Skip to main content
Skip Navigation
NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percent Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items Used in Scaling, Age 13 Reading Long-Term Trend Assessment, by Item and Block: 2008

NAEP Technical DocumentationRange of response codes, percent agreement, and Cohen's Kappa or intraclass correlation for the constructed-response items used in scaling, age 13 reading long-term trend assessment, by item and block: 2008
Block Item Range of response codes Sample size Percent agreement Cohen’s Kappa Intraclass correlation
— Not available. The intraclass correlation is not reported for dichotomously scored items; Cohen's Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa is a measure of reliability that is appropriate for items that are dichotomously scored. The intraclass correlation coefficient is most appropriate for items with more than two categories. Item statistics in this table are described on the page Blocks of Items.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2008 Reading Long-Term Trend Assessment.
R21 N021203 1–5 600 72 0.83
R22 N023802 1–4 600 85 0.86
R23 N021803 1–4 600 94 0.95
R24 N022103 1–4 600 77 0.75
R25 N022405 1–4 600 92 0.90
R28 N027302 1–5 600 85 0.89
R29 N029002 1–4 600 83 0.86

Last updated 28 September 2011 (JL)