Discussion about this post

User's avatar
Emil O. W. Kirkegaard's avatar

Great write-up. There are other issues to consider:

1) Selection bias due to g-differentiated mortality in the older ages biases their performance upwards. This is visible in e.g. UKBB estimation of dysgenic effect using PGS. This will cause the longitudinal studies to show smaller declines.

2) We know that measurement invariance fails between age groups within tests. Your studies above seem to use MGCFA, but IRT-DIF studies have been done on items. This is particularly the case for crystalized tests, if you keep using the old versions of the test which test for increasingly out of date information and words. This bias is presumably smaller in longitudinal settings but substantial for quasi-longitudinal and cross-sectional. See e.g. https://doi.org/10.1037/a0038217 Their conclusion is that scores are not interpretable between birth cohort groups. I have a lot of cross-sectional mixed age data for various vocabulary tests which we can examine if you want to collaborate.

Tanj's avatar

I don't see any analysis of cultural fit varying by age. Culture differences have been a pernicious problem with IQ tests. Now, there are two aspects of cultural shift. One is that the culture 50 years ago was radically different even in the same place. This can be reasonably lumped in as one of the causes of the Flynn effect, which you do address.

However, another cultural shift is that a person aged 70 simply has very different interests, experiences, and methods than a person aged 30. Reading what younger people assume about aging is a bit like reading speculative fiction, and tests created by young people are likely to be oblivious to some of the things old people may invest intelligence in. Is that also in Flynn? Maybe.

No posts

Ready for more?