Psychological comments: Measurement errors

Friday, 27 June 2014

Measurement errors

One popular criticism of intelligence testing is that scores could be affected by motivation and levels of practice. By implication, those who are not motivated to take the test will do badly and will be unfairly judged, to the detriment of any society which uses intelligence test results as a ticket of admission to education or employment. By further implication, such lack of motivation may apply most strongly to those who are poorer and most dispirited for other reasons.

Test administrators know all that, and make sure that subjects understand the test, take and pass the practice items, and are encouraged before, during and after each test (guided by protocols as to what help and encouragement is permissible) and that at least 6 months elapses between face to face testing sessions, and that alternate forms are used if testing has to be conducted sooner. Hence psychometric reports talk about the person’s level of engagement, the amount of effort they show, and the specific problems they may have encountered. If there are significant problems the results are either set aside, or labelled as being under-estimates and further testing carried out later usually resolves the issue. Monitoring is easier in face to face testing, but item analysis gives some insight into lack of effort in group tests. Group tests often have more practice items and care is taken to provide good quality test settings. By following all these procedures practice effects and motivational differences are reduced, but not eliminated entirely. It is still possible that some low results may be due to low motivation, and also that some high results might be due to lucky guessing. How big could these effects be?

Assume for a moment that motivational and practice effects have an influence, and that to the true low scores of less able people must be added the false low scores of those who found the test boring, pointless, and not worth bothering about. People like me, for example. I prefer watching clothes dry on a cloudy day than taking most intelligence tests.

If that were true, IQs would under-predict real life successes in things which were intrinsically interesting: getting good qualifications so as to get on in life, making money, and becoming famous.

If motivation were a major confounder, then correlations between IQ scores and real life scores would be low. However, IQ and real life are strongly correlated. For example, the largest recent study (Deary et al., 2007) of over 70,000 English children found correlations of r=0.81 between general intelligence measured at 11 years of age and GCSE scores at age 16. This is an extremely high predictive power (accounting for 64% of the variance). The colossal sample size gives us exceptional confidence in the robustness of the results. By way of comparison, most educational psychology publications have sample sizes of a few hundred, and are far less robust. As further proof of the common sense view that intelligence is involved in academic achievement, we can be even more precise about the impact of intelligence on different subjects. IQ scores on their own accounted for 58.6% of the results in Mathematics, 48% in English and down to 18.1% in Art and Design, that subject being the least intellectually demanding (Deary et al., 2007).

I. J. Deary, S. Strand, P. Smith and C. Fernandes (2007) Intelligence and educational achievement. Intelligence 35, 1, pp13-21. (For private study, email the author at the University of Edinburgh and ask for a copy).

Problems of motivation and practice also apply to scholastic examinations and to any procedures followed in job interviews. Varying motivation applies not just to IQ test but to all measures: intelligence tests, scholastic tests, and work assessments. Nobody gets round measurement error, not even the Spanish Inquisition.

In summary: Assume some people’s IQ scores are reduced by lack of motivation. That will reduce the correlation between IQ and other real life measures. IQ at 11 correlates 0.81 with scholastic attainment at 16. If motivation is a problem, the correlation is really higher.

If you prefer that as a Tweet:

If IQ scores are reduced by lack of motivation, but IQ at 11 correlates 0.81 with GCSEs at 16, then the real correlation is much higher.

14 comments:

Anonymous27 June 2014 at 14:30
If real IQ was completely uncorrelated with life outcomes but motivation was highly correlated with life outcomes, the IQ test would be more correlated with life outcomes than real IQ. That exact scenario is unlikely but who knows whether real IQ or test IQ is more correlated with life outcomes.
ReplyDelete
Replies
Unknown27 June 2014 at 15:12
The hypothesis can be tested by comparing the predictions made by tests of intelligence and by tests of motivation. So far, tests of intelligence are the better predictors, (also also better than self-rated intelligence).
ReplyDelete
Replies
Unknown27 June 2014 at 15:17
However, if IQ and personality are correlated then motivation may be an aspect of intelligence http://drjamesthompson.blogspot.co.uk/2013/07/intelligence-personality-and-self.html
ReplyDelete
Replies
Unknown27 June 2014 at 16:14
And a bit more on the confidence literature:http://drjamesthompson.blogspot.co.uk/2013/12/isir-confidence-and-achievement.html
ReplyDelete
Replies
Anonymous28 June 2014 at 19:53
Could differences in motivation play a significant role in the Flynn Effect? I understand that psychologists giving (individually administered) IQ tests, pay close attention to motivation and consider the scores of unmotivated people to be suspect, but does this apply to the norming samples too? Are the scores of unmotivated people counted when they are standardizing tests like the Wechsler? And if most people in the 1930s were unmotivated to try their best on these tests, would their lack of motivation have even been recognized if it was the norm for the time?
ReplyDelete
Replies
Unknown29 June 2014 at 00:25
In group test data we usually don't have any data other than item responses, and sometimes latencies of response. Olev Must has good historical data which suggests that there may have been differences in guessing over the years, a hypothesis first proposed by Chris Brand. The picture on guessing rates is not consistent though. However, I doubt that effort is a major factor, though persistence on untimed tasks might be.
ReplyDelete
Replies
Steve Sailer30 June 2014 at 02:09
One question is whether motivation matters in PISA-type tests. For example, Finland routinely outperforms on PISA tests how it does on IQ standardizations. This could be that Finland's schools really are better than the rest of the Caucasian world's schools. Or maybe the Finnish school system is effective at giving pep talks to Finnish students to try really hard, get a good night's sleep before hand, and other wise treat this low stakes test like a high stakes test.
ReplyDelete
Replies
Beliavsky30 June 2014 at 15:39
Why not say that IQ scores predict real-life outcomes both because they measure "g" and because they measure motivation, both of which are important?
ReplyDelete
Replies

Add comment