Page Title:
Keywords:
Description:
Skip to main content
Skip Navigation
NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percent Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items From Previous Years, Rescored in 2012, Age 17 Mathematics Long-Term Trend Assessment, by Item and Block: 2012
NAEP Technical DocumentationScore range and percent agreement for the constructed-response items from the previous year assessed that were rescored in 2012, age 17 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Percent exact agreement
NOTE: Special codes assigned to student responses including blank, off-task, and not-scorable were included in the calculation of the percent exact agreement measure.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
MB N302901 1-2 800 100
N308901 1-2 800 100
N309301 1-2 800 100
N309901 1-2 800 99
N310201 1-2 800 99
MC N325301 1-2 800 100
N325601 1-2 800 99
N325701 1-2 800 99
N326601 1-2 800 98
N326801 1-2 800 96
N327101 1-2 800 98
MD N315501 1-2 800 99
N321001 1-2 800 99
N321101 1-2 800 99
N321401 1-2 800 99
N321901 1-2 800 98
ME N303801 1-2 800 99
N304101 1-2 800 100
N304201 1-2 800 99
N304601 1-2 800 99
N304901 1-2 800 99
MF N299001 1-2 800 99
N305501 1-2 800 98
N305801 1-2 800 100
N306201 1-2 800 99
N306401 1-2 800 99

Score range and Cohen's Kappa or intraclass correlation for the constructed-response items from previous years that were rescored in 2012, age 17 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Cohen’s Kappa Intraclass correlation
† Not applicable. The intraclass correlation is not reported for dichotomously scored items; Cohen’s Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa  is a measure of reliability that is used for items that are dichotomously scored. The intraclass correlation is used for items with more than two categories. The discrepancy in sample sizes compared to the first table is due to the fact that percent agreement is calculated from the entire sample size, while Cohen's Kappa and intraclass correlation statistics exclude those who omit the item.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
MB N302901 1-2 800 0.99
N308901 1-2 800 0.99
N309301 1-2 800 0.99
N309901 1-2 700 0.99
N310201 1-2 700 0.98
MC N325301 1-2 800 0.99
N325601 1-2 700 0.96
N325701 1-2 700 0.98
N326601 1-2 800 0.99
N326801 1-2 700 0.99
N327101 1-2 700 0.99
MD N315501 1-2 700 0.99
N321001 1-2 800 0.98
N321101 1-2 700 0.99
N321401 1-2 700 0.99
N321901 1-2 700 0.98
ME N303801 1-2 700 1.00
N304101 1-2 800 0.99
N304201 1-2 800 0.98
N304601 1-2 700 0.99
N304901 1-2 700 1.00
MF N299001 1-2 700 0.99
N305501 1-2 700 0.98
N305801 1-2 800 0.99
N306201 1-2 700 1.00
N306401 1-2 700 1.00

Last updated 13 November 2013 (JL)