Intelligence and Achievement

Help Questions

AP Psychology › Intelligence and Achievement

Questions 1 - 10
1

A researcher examines whether an intelligence test predicts job training completion; what is being assessed?

Fixed IQ determinism, because training success is fully predetermined by inherited intelligence and cannot be influenced by instruction.

Predictive validity, because the test is evaluated by how well scores forecast later performance in training outcomes.

Gardner’s bodily-kinesthetic intelligence, because job training completion depends mainly on physical coordination rather than cognition.

Split-half reliability, because predicting training completion requires correlating two halves of the test to show it measures intelligence.

Explanation

Predictive validity is evaluated by examining how well test scores forecast future performance on relevant criteria. In this case, the researcher is assessing whether intelligence test scores can predict success in job training completion, which represents a future outcome that should theoretically be related to cognitive ability. This type of validity study involves a time delay between test administration and outcome measurement, distinguishing it from concurrent validity studies. Strong predictive validity would support the practical utility of the intelligence test for selection or placement decisions. The Flynn effect demonstrates that cognitive abilities can change over time, but this doesn't eliminate the utility of tests for prediction within reasonable time frames. Understanding predictive validity is crucial for evidence-based decision making in educational and occupational settings. Different theories of intelligence may suggest that different types of cognitive abilities predict different outcomes.

2

A student’s performance drops when testing conditions are noisy; what does this suggest about the score interpretation?

Noise reduces validity by definition, because validity is the same as consistency and any distraction makes a test invalid.

Noise cannot matter, because IQ is fixed; any score drop must be impossible if the test is standardized to mean 100, SD 15.

Noise disproves g, because a general intelligence factor cannot exist if performance varies across testing environments.

Situational factors can add measurement error, potentially lowering reliability of the obtained score without necessarily changing the construct definition.

Explanation

Environmental noise during testing can introduce random error that reduces the reliability of the obtained score without necessarily changing what the test is designed to measure. Situational factors like noise, distractions, fatigue, or anxiety can interfere with performance and lead to scores that underestimate an individual's true ability level. This highlights the importance of standardized testing conditions for obtaining reliable measurements. The score may not accurately reflect the person's typical performance capability under optimal conditions. Understanding these factors is crucial for appropriate test interpretation - a single low score under poor conditions should be interpreted cautiously and may warrant retesting under better conditions. The Flynn effect operates at the population level over time, while situational factors affect individual performance at specific testing sessions. Different theories of intelligence recognize that optimal assessment requires minimizing factors that can interfere with demonstration of cognitive abilities. This understanding supports the importance of standardized administration procedures and careful consideration of testing conditions when interpreting scores.

3

Which scenario best illustrates achievement rather than aptitude testing in a classroom setting?

A reasoning test designed to predict future success in calculus, regardless of what students have already been taught.

A math final exam assessing skills taught this term, focusing on what students have learned rather than their potential to learn.

A test declared valid because students earn similar scores on retesting, even if it does not match the curriculum.

An IQ test proving fixed intelligence, because achievement cannot change and should remain constant despite instruction.

Explanation

A math final exam assessing skills taught during the term is a classic example of achievement testing because it measures what students have already learned rather than their potential to learn new material. Achievement tests focus on evaluating current knowledge, skills, or competencies that have been acquired through instruction or experience. This contrasts with aptitude tests, which aim to predict future learning or performance potential. Both types of tests can be reliable and valid when properly constructed, but they serve different purposes in educational and psychological assessment. The distinction is important for test selection and score interpretation. Modern understanding recognizes that both achievement and aptitude can be influenced by educational experiences and environmental factors rather than being completely fixed. The Flynn effect demonstrates that even supposedly stable cognitive abilities can change over time, and theories of intelligence help explain why comprehensive assessment might require multiple types of measures.

4

A psychologist compares two students’ IQs: 85 and 115; how many SDs apart are they on mean 100, SD 15?

They are 2 SDs apart, because the 30-point difference equals $30/15=2$ standard deviations on a mean 100, SD 15 scale.

They are 0 SDs apart, because reliability implies all scores reflect the same true intelligence regardless of observed differences.

They are 3 SDs apart, because IQ differences compound over time as intelligence remains fixed and cannot be influenced by schooling.

They are 1 SD apart, because the difference of 30 points equals one standard deviation on the typical IQ distribution.

Explanation

To find the distance between two scores in standard deviation units, we subtract the scores and divide by the standard deviation: (115 - 85) ÷ 15 = 30 ÷ 15 = 2 standard deviations. This calculation is fundamental to interpreting scores on any standardized measure. Understanding standard deviation units allows for meaningful comparison of scores across different tests and populations. On a normal distribution, approximately 68% of scores fall within one standard deviation of the mean, 95% within two standard deviations, and 99.7% within three standard deviations. This mathematical relationship is crucial for determining how unusual or typical a particular score is. The Flynn effect shows that population means can shift over time, requiring periodic renorming, but the mathematical relationships between scores and standard deviations remain constant within any given norming sample.

5

A test’s items all measure the same skill and correlate strongly with each other; what property is indicated?

Sternberg’s triarchic theory, because correlated items show analytic, creative, and practical intelligence are identical abilities.

Fixed IQ measurement, because consistent items prove intelligence is unchangeable and unaffected by practice or education.

Internal consistency reliability, because high inter-item correlations suggest the items measure a similar construct even if validity is unknown.

Content validity, because strong inter-item correlations guarantee the test fully samples the domain and predicts external outcomes.

Explanation

Internal consistency reliability measures whether test items that are supposed to measure the same construct actually correlate with each other. When items on a test correlate strongly with one another, this suggests they are all tapping into the same underlying ability or trait, indicating good internal consistency. This is typically measured using statistics like Cronbach's alpha or split-half reliability. High internal consistency provides evidence that the test is measuring a coherent construct, though it doesn't guarantee the test is measuring what it claims to measure (validity). A test could have high internal consistency but still be measuring something other than the intended construct. This type of reliability is particularly important during test development, as it helps identify items that may not belong with the others and ensures the final test measures a unified concept rather than a mixture of unrelated abilities.

6

An IQ test is standardized to mean 100, SD 15; what score is two SDs above average?

A score of 145, because Gardner’s multiple intelligences predicts many people exceed 140 when schools teach to their strengths.

A score of 130, because it is two standard deviations above 100 using an SD of 15 on the typical IQ scale.

A score of 160, because IQ is fixed and extreme scores reflect unchangeable genetic endowment rather than measurement conventions.

A score of 115, because one standard deviation above the mean indicates very superior intelligence and strong test reliability.

Explanation

On a standardized IQ scale with mean 100 and standard deviation 15, calculating scores at specific standard deviation distances is straightforward arithmetic. Two standard deviations above the mean equals 100 + (2 × 15) = 130. This demonstrates how IQ scores are distributed on the normal curve, where approximately 95% of scores fall within two standard deviations of the mean. Understanding this standardization is crucial for interpreting IQ scores in both clinical and educational settings. The Flynn effect shows that population averages can shift over time, requiring periodic renorming to maintain the mean at 100. Gardner's multiple intelligences theory and concepts about fixed intelligence don't change the mathematical relationship between standard deviations and score interpretation.

7

Aptitude tests are intended to predict future performance; what is the key distinction from achievement tests?

Aptitude tests are more reliable because they are always valid, while achievement tests are invalid unless they show test-retest consistency.

Achievement tests estimate potential to learn new material, whereas aptitude tests measure what a student has already learned in school.

Aptitude tests prove IQ is fixed, whereas achievement tests change with schooling and therefore cannot be standardized to mean 100, SD 15.

Aptitude tests aim to forecast future performance, whereas achievement tests assess current knowledge or skills already acquired.

Explanation

Aptitude tests are specifically designed to predict future performance or learning potential, while achievement tests measure what has already been learned or accomplished. This fundamental distinction reflects different testing purposes: aptitude tests aim to forecast how well someone might perform in future learning situations, while achievement tests assess current knowledge and skills. Both types of tests can be reliable and valid when properly constructed and standardized to appropriate populations. The notion that IQ is completely fixed is not supported by modern research, which shows that environmental factors, education, and intervention can influence cognitive performance. Understanding this distinction helps educators and psychologists select appropriate assessments for their specific purposes, whether predicting future success or evaluating current competency.

8

A teacher wants a test that samples all units taught this semester; which validity is most relevant?

Test-retest reliability, because sampling all units ensures students would earn identical scores if tested again next week.

Fixed IQ validity, because a comprehensive semester test reveals stable intelligence that cannot be improved by studying.

Content validity, because the assessment should cover the full range of material taught rather than a narrow subset.

Spearman’s g, because covering all units guarantees the test measures general intelligence rather than specific learned knowledge.

Explanation

Content validity refers to how well a test samples the full domain of knowledge or skills it's intended to measure. A teacher designing a semester exam wants to ensure the test adequately covers all the material taught, not just a narrow subset of topics. This type of validity is typically evaluated through expert judgment rather than statistical analysis. Content validity is particularly important for achievement tests, where the goal is to assess mastery of specific curricula or domains. Unlike predictive or concurrent validity, content validity focuses on the representativeness of test items rather than correlations with external criteria. Good content validity helps ensure fair assessment by preventing students from being disadvantaged if they happen to have studied topics that aren't represented on the test, while also ensuring that high scores reflect comprehensive understanding rather than lucky guessing on a narrow selection of material.

9

A student excels at music and interpersonal skills but average on logic puzzles; which theory best fits this profile?

High reliability, because consistent performance in music indicates the test is valid and measures intelligence rather than practice effects.

Spearman’s g, because all cognitive abilities are driven by one factor, so strengths must appear equally across domains.

Fixed IQ doctrine, because domain strengths show intelligence is permanent and cannot be influenced by instruction or cultural opportunities.

Gardner’s multiple intelligences, emphasizing distinct abilities like musical and interpersonal intelligence that may not align with logical-mathematical performance.

Explanation

Gardner's theory of multiple intelligences proposes that intelligence consists of several relatively independent abilities, including musical, interpersonal, logical-mathematical, linguistic, and others. A student who excels in music and interpersonal skills but performs averagely on logic puzzles exemplifies this theory's core premise that individuals can have distinct strength profiles across different intellectual domains. This contrasts with Spearman's g theory, which emphasizes a single general intelligence factor underlying all cognitive abilities. The Flynn effect describes population-level changes over time, and modern research shows that intelligence can be influenced by education and environment. Gardner's theory has been influential in education, encouraging recognition of diverse talents and alternative approaches to instruction that capitalize on different intellectual strengths.

10

A test predicts first-year college GPA from high school juniors’ scores; what validity is being evaluated?

Fixed IQ certainty, because accurate prediction proves intelligence cannot change and therefore future academic performance is predetermined.

Predictive validity, because the test is judged by how well earlier scores forecast a later real-world outcome like college GPA.

Internal consistency reliability, because predicting GPA requires the test to measure the same trait at two different time points.

Gardner’s theory, because GPA prediction depends on multiple intelligences, each standardized to mean 100, SD 15 independently.

Explanation

Predictive validity is demonstrated when test scores successfully forecast future outcomes or performance in relevant real-world situations. A test that predicts first-year college GPA from high school scores shows it can anticipate future academic success, which is a key form of criterion-related validity. This type of validity is particularly important for selection and placement decisions in education and employment. The time gap between test administration and outcome measurement is what distinguishes predictive validity from concurrent validity, where measures are taken simultaneously. Strong predictive validity provides confidence that test scores have practical utility beyond the testing situation itself. This concept is fundamental to understanding why aptitude tests are valuable - their worth lies primarily in their ability to forecast future performance rather than just describe current abilities.

Page 1 of 8