Resources for Understanding Psychometrics

I enjoy writing about psychometrics. I also enjoy people reading what I write. Unfortunately, I don’t think many people do, and a big reason is probably that few people actually understand psychometrics. To help with that, I’m sharing some resources that make the field more accessible.

The Personality Project has a surprisingly nice (and free) textbook which covers the basics.

Chapter 7 (“Reliability and Stability of Mental Measurements”) of Arthur Jensen’s Bias in Mental Testing also serves as a easy-to-understand introduction to classical test theory, along with showing how it is applied to real testing data.

Measurement invariance (often assessed via multi-group CFA) and differential item functioning (DIF, typically assessed at the item level in IRT or logistic frameworks) are important for comparing results across groups1. For measurement invariance, the most accessible explanation I’ve found is A casual but causal take on measurement invariance by The 100% CI. For differential item functioning, Meng Hu has a very comprehensive post explaining various approaches to detecting DIF.

Emil Kierkegaard has created a number of useful visualizations illustrating concepts such as range restriction, ceiling and floor effects, regression to the mean, discretization, and other statistical artifacts.

  1. The concept of comparing results across groups is very broad. It includes the obvious cases, such as comparing extroversion across countries (where the groups might be Americans and Germans). But it also includes questions like whether pre-treatment and post-treatment scores on a depression questionnaire are comparable (the ‘groups’ being the pre-treatment and post-treatment measurements, even if they come from the same individuals), or whether the population has become more intelligent over time (the groups being, for example, Americans in 1970 versus Americans in 2020).