Page Title:
Keywords:
Description:
Skip to main content
Skip Navigation
NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percent Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items From Previous Years, Rescored in 2012, Age 9 Mathematics Long-Term Trend Assessment, by Item and Block: 2012
NAEP Technical DocumentationScore range and percent agreement for the constructed-response items from the previous year assessed that were rescored in 2012, age 9 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Percent exact agreement
NOTE: Special codes assigned to student responses including blank, off-task, and not-scorable were included in the calculation of the percent exact agreement measure.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
MB N294201 1-2 800 100
N294301 1-2 800 100
N295301 1-2 800 99
N295401 1-2 800 99
N295701 1-2 800 98
MC N322401 1-2 800 100
N323101 1-2 800 99
N323701 1-2 800 100
N323901 1-2 800 100
N324001 1-2 800 99
MD N312901 1-2 800 99
N313001 1-2 800 100
N313101 1-2 800 99
N313102 1-2 800 99
N313103 1-2 800 99
N313701 1-2 800 100
N313702 1-2 800 99
N313801 1-2 800 100
N313901 1-2 800 100
ME N287601 1-2 800 98
N288101 1-2 800 100
N288501 1-2 800 97
N289201 1-2 800 99
MF N289701 1-2 800 100
N289901 1-2 800 100
N290101 1-2 800 99
N290701 1-2 800 99
N291501 1-2 800 100

Score range and Cohen's Kappa or intraclass correlation for the constructed-response items from previous years that were rescored in 2012, age 9 mathematics long-term trend assessment, by item and block: 2012
Block Item Range of response codes Sample size Cohen’s Kappa Intraclass correlation
† Not applicable. The intraclass correlation is not reported for dichotomously scored items; Cohen’s Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa  is a measure of reliability that is used for items that are dichotomously scored. The intraclass correlation is used for items with more than two categories. The discrepancy in sample sizes compared to the first table is due to the fact that percent agreement is calculated from the entire sample size, while Cohen's Kappa and intraclass correlation statistics exclude those who omit the item.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2012 Mathematics Long-Term Trend Assessment.
MB N294201 1-2 800 0.96
N294301 1-2 800 0.99
N295301 1-2 800 0.99
N295401 1-2 800 0.98
N295701 1-2 800 0.97
MC N322401 1-2 800 0.99
N323101 1-2 800 0.99
N323701 1-2 700 0.99
N323901 1-2 800 1.00
N324001 1-2 700 0.99
MD N312901 1-2 800 0.97
N313001 1-2 800 0.99
N313101 1-2 800 0.98
N313102 1-2 800 0.99
N313103 1-2 800 0.99
N313701 1-2 700 0.99
N313702 1-2 700 0.99
N313801 1-2 800 0.99
N313901 1-2 800 0.99
ME N287601 1-2 700 0.98
N288101 1-2 800 0.99
N288501 1-2 800 0.99
N289201 1-2 800 0.98
MF N289701 1-2 800 1.00
N289901 1-2 800 0.99
N290101 1-2 800 0.98
N290701 1-2 800 0.98
N291501 1-2 700 0.99

Last updated 13 November 2013 (JL)