The Basic Concept
A confidence interval (CI) is a statistical method for quantifying measurement uncertainty. Scores obtained from cognitive tests are observed values that combine true ability with measurement error. A 95% confidence interval means that if the same measurement were repeated 100 times under identical conditions, the true value would fall within this range in 95 of those repetitions. For example, if your reaction time is 200ms with a 95% CI of [185ms, 215ms], your true reaction time likely falls between 185 and 215ms. Narrower intervals indicate higher measurement precision, while wider intervals signal greater uncertainty.
Sources of Measurement Error in Cognitive Tests
Measurement error in cognitive tests arises from multiple sources. Trial-to-trial variability means reaction times fluctuate by tens of milliseconds even within a single session. Day-to-day variability reflects how sleep quality, caffeine intake, and stress levels produce different scores on different days. Environmental factors including lighting, noise, and device input lag also contribute. The confidence interval statistically aggregates these error sources, enabling you to view results as a range rather than a single number. This perspective prevents overreacting to minor score fluctuations and supports more balanced self-assessment.
Using CIs for Score Comparison and Improvement Detection
Confidence intervals play a crucial role in determining whether score differences are statistically meaningful. When confidence intervals from two measurements overlap, the score difference falls within measurement error and cannot be attributed to true ability change. When intervals do not overlap, a statistically significant change has likely occurred. Bench displays confidence intervals alongside each test result, enabling users to objectively judge whether improvement is genuine or within normal daily variation. Increasing the number of measurements narrows the confidence interval, allowing more precise evaluation of progress over time.