For every subject assessed, NAEP reports on how well students in different demographic groups, such as race and gender perform on the assessment. (NAEP does NOT report individual student scores.) How does NAEP summarize what students in these groups know and can do, and make comparisons among the achievement of these groups of students?
In U.S. history, NAEP creates a scale ranging from 0-500, based on statistical procedures called Item Response Theory (IRT). IRT is a set of statistical procedures useful in summarizing student performance across a collection of test exercises requiring similar knowledge and skills. All NAEP subject area scales are produced using these procedures.
To give meaning to the levels of the scale, it is useful to create an "item map." An item map is a representation of the skills and abilities demonstrated by students at various levels of the scale. The map indicates which kinds of questions students can likely answer correctly at each level on the scale. To get a more complete sense of the U.S. history scale, explore item maps for U.S. history.