Overview of the Assessment
Reporting the Assessment—Scale Scores and Achievement Levels
Description of U.S. History Performance by Item Maps for Each Grade
Results Are Estimates
NAEP Reporting Groups
Cautions in Interpretations
NAEP assesses student performance in U.S. history by administering assessments to samples that are representative of the nation's students. The content of the NAEP U.S. history assessment is determined by a framework developed with the help of researchers, policymakers, and the interested public, including those with expert perspectives about U.S. history and its measurement. Read more about what the assessment measures, how it was developed, who took the assessment, and how the assessment was administered.
The 2010 U.S. history results presented on the website are based on representative samples of students for the nation. Nationally representative samples of more than 7,000 fourth-graders; 11,000 eighth-graders; and 12,000 twelfth-graders participated in the assessment. The national results reflect the performance of students attending both public and nonpublic schools. Results from 2010 are compared to results from the previous U.S. history assessments administered in 1994, 2001, and 2006. In 1994, students were not offered accommodations as part of the NAEP assessments; however, in 2001, NAEP administered the assessment with both a sample who were allowed accommodations and those who were not allowed accommodations. Changes in student performance across years or differences between groups of students in 2010 are discussed only if they have been determined to be statistically significant.
The results of student performance on the NAEP U.S. history assessment are presented on this website in two ways: as average scores on the NAEP U.S. history scale and as the percentages of students attaining NAEP U.S. history achievement levels. The average scale scores represent how students performed on the assessment. The achievement levels represent how that performance measured up against set expectations for achievement. Thus, the average scale scores represent what students know and can do, while the achievement-level results indicate the degree to which student performance meets expectations of what they should know and be able to do. In addition to reporting an overall U.S. history score for each grade, scale scores are reported at five percentiles to show trends in results for students performing at lower (10th and 25th percentiles), middle (50th percentile), and higher (75th and 90th percentile) levels.
Average U.S. history scale score results are based on the NAEP U.S. history scale, which ranges from 0 to 500. The NAEP U.S. history assessment scale is a composite combining separate scales for each of the four U.S. history themes specified by the U.S. history framework. Average scale scores are computed for groups of students; NAEP does not produce individual student scores. The average scores are based on analyses of the percentages of students who answered each item successfully. While the score ranges at each grade in U.S. history are identical, the scale was derived independently at each grade. Therefore, average scale scores across grades cannot be compared. For example, equal scale scores on the grade 4 and grade 8 scales do not imply equal levels of U.S. history achievement.
Achievement-level results are presented in terms of U.S. history achievement levels adopted by the National Assessment Governing Board, and are intended to measure how well students' actual achievement matches the achievement desired of them. For each grade tested, the Governing Board has adopted three achievement levels: Basic, Proficient, and Advanced. For reporting purposes, the achievement-level cut scores are placed on the U.S. history scales, resulting in four ranges: below Basic, Basic, Proficient, and Advanced. As provided by law, the achievement levels are to be used on a trial basis and should be interpreted and used with caution.
Item maps illustrate the knowledge and skills demonstrated by students performing at different scale points on the 2010 NAEP U.S. history assessment. In order to provide additional context, the cut points for the three NAEP achievement levels are marked on the item maps. The map location for each question represents the probability that, for a given score point, 65 percent of the students for a constructed-response question and 74 percent of the students for a four-option multiple-choice question answered that question successfully. For constructed-response questions, responses may be completely or partially correct; therefore, different types of responses to the same question could map onto the scale at different score levels.
Approximately 20 to 30 U.S. history questions per grade have been selected and placed on an item map for each grade.
The average scores and percentages presented on this website are estimates because they are based on representative samples of students rather than on the entire population of students. Moreover, the collection of subject-area questions used at each grade level is but a sample of the many questions that could have been asked. As such, NAEP results are subject to a measure of uncertainty, reflected in the standard error of the estimates. The standard errors for the estimated scale scores and percentages in the figures and tables presented on this website are available through the NAEP Data Explorer.
Results are provided for groups of students defined by shared characteristics—gender, race/ethnicity, eligibility for free/reduced-price school lunch, students with disabilities, and students identified as English language learners. Based on participation rate criteria, results are reported for various student populations only when sufficient numbers of students and adequate school representation are present. The minimum requirement is at least 62 students in a particular group from at least five primary sampling units (PSUs). However, the data for all students, regardless of whether their group was reported separately, were included in computing overall results. Explanations of the reporting groups are presented below.
Results are reported separately for males and females.
In all NAEP assessments, data about student race/ethnicity is collected from two sources: school records and student self-reports. Before 2002, NAEP used students' self-reports of their race and ethnicity on a questionnaire as the source of race/ethnicity data. In 2002, it was decided to change the student race/ethnicity variable highlighted in NAEP reports. Starting in 2002, NAEP reports of students' race and ethnicity are based on the school records, with students' self-reports used only if school data are missing. Information based on student self-reported race/ethnicity will continue to be reported in the NAEP Data Explorer.
In order to allow comparisons across years, assessment results presented are based on school-reported information for six mutually exclusive racial/ethnic categories: White, Black, Hispanic, Asian/Pacific Islander, American Indian (including Alaska Native), and Other. Students who identified with more than one of the first five categories or had a background other than the ones listed were categorized as Unclassified.
NAEP collects data on student eligibility for the National School Lunch Program (NSLP) as an indicator of low family income. Under the guidelines of NSLP, children from families with incomes below 130 percent of the poverty level are eligible for free meals. Those from families with incomes between 130 and 185 percent of the poverty level are eligible for reduced-price meals. (For the period July 1, 2009, through June 30, 2010, for a family of four, 130 percent of the poverty level was $28,665, and 185 percent was $40,793.) Some schools provide free meals to all students irrespective of individual eligibility, using their own funds to cover the costs of noneligible students. Under special provisions of the National School Lunch Act intended to reduce the administrative burden of determining student eligibility every year, schools can be reimbursed based on eligibility data for a single base year. Participating schools might have high percentages of eligible students and report all students as eligible for free lunch.
Because of the improved quality of the data on students' eligibility for NSLP, the percentage of students for whom information was not available has decreased compared to the percentages reported prior to the 2006 assessment. While results by NSLP eligibility prior to 2006 are not shown in the print report, all results are available on the website beginning in 2001—the earliest year in which NAEP collected data by this variable as part of the U.S. history assessment.
Results are reported for students who were identified by school records as having a disability. A student with a disability may need specially designed instruction to meet his or her learning goals. A student with a disability will usually have an Individualized Education Program (IEP) which guides his or her special education instruction. Students with disabilities are often referred to as special education students and may be classified by their school as learning disabled (LD) or emotionally disturbed (ED).
Most figures in the U.S. history web results show two data points in 2001—one in which accommodations were permitted and one in which they were not. While results for both 2001 samples are presented, comparisons between 2001 and 2010 are based on the accommodated samples of both years. When results for students with disabilities are presented, only the accommodated sample data are shown. Accommodations were not offered prior to 2001.
Results are reported for students who were identified by school records as being English language learners. When results are presented for English language learners, only the accommodated sample is presented as the 2001 starting point in the trend line. (Note that English language learners were previously referred to as limited English proficient [LEP].)
The national results are based on a representative sample of students in both public schools and nonpublic schools. Nonpublic schools include private schools, Bureau of Indian Affairs schools, and Department of Defense schools. Private schools include Catholic, Conservative Christian, Lutheran, and other private schools. Results are reported for private schools overall, as well as disaggregated by Catholic and other private schools, in 2010 at grade 8 only due to low participation rates at other grade levels.
Parents' highest level of education is defined by the highest level reported by eighth-graders and twelfth-graders for either parent. Fourth-graders' replies to this question were not reported because their responses in previous studies were highly variable, and a large percentage of them chose the "I don't know" option.
NAEP has established policies and procedures to maximize the inclusion of all students in the assessment. Every effort is made to ensure that all selected students who are capable of participating meaningfully in the assessment are assessed. While some students with disabilities (SD) and/or English language learners (ELL) can be assessed without any special procedures, others require accommodations to participate in NAEP. Still other SD and/or ELL students selected by NAEP may not be able to participate. Local school authorities determine whether SD/ELL students require accommodations or should be excluded because they cannot be assessed. The percentage of SD and/or ELL students who are excluded from NAEP assessments varies from one jurisdiction to another and within a jurisdiction over time.
See additional information about the percentages of special-needs students
Differences between scale scores and between percentages that are discussed in the results on this website take into account the standard errors associated with the estimates. Comparisons are based on statistical tests that consider both the magnitude of the difference between the group average scores or percentages and the standard errors of those statistics. Throughout the results, differences between scores or between percentages are discussed only when they are significant from a statistical perspective.
All differences reported are significant at the 0.05 level with appropriate adjustments for multiple comparisons. The term "significant" is not intended to imply a judgment about the absolute magnitude or the educational relevance of the differences. It is intended to identify statistically dependable population differences to help inform dialogue among policymakers, educators, and the public.
Users of this website are cautioned against interpreting NAEP results as implying causal relations. Inferences related to student group performance or to the effectiveness of public and nonpublic schools, for example, should take into consideration the many socioeconomic and educational factors that may also have an impact on performance.
The NAEP scale makes it possible to examine relationships between students' performance and various factors measured by NAEP. However, a relationship that exists between achievement and another variable does not reveal its underlying cause, which may be influenced by a number of other variables. Similarly, the assessments do not reflect the influence of unmeasured variables. The results are most useful when they are considered in combination with other knowledge about the student population and the educational system, such as trends in instruction, changes in the school-age population, and societal demands and expectations.
Return to the U.S. history subject information.