Skip to main content
Skip Navigation

Table of Contents  |  Search Technical Documentation  |  References

NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percent Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items Used in Scaling, Grade 12 U.S. History Assessment, by Item and Block: 2006
NAEP Technical DocumentationRange of response codes, percent exact agreement, and Cohen's Kappa or intraclass correlation for the constructed-response items used in scaling, grade 12 U.S. history assessment, by item and block: 2006
Block Item Range of response codes Sample size Percent exact agreement Cohen's Kappa Intraclass correlation
H3 H073501 1–3 500 82.00 0.85
H074001 1–3 500 94.00 0.93
H074401 1–3 500 78.00 0.76
H074801 1–3 300 90.00 0.88
H075101 1–4 400 83.00 0.87
H4 H045509 1–3 500 80.00 0.78
H045901 1–4 500 84.00 0.85
H046001 1–2 600 99.00 0.98
H046101 1–3 500 98.00 0.98
H046301 1–3 500 84.00 0.79
H5 H062001 1–3 500 79.00 0.72
H062201 1–3 500 83.00 0.81
H063101 1–3 500 89.00 0.88
H063401 1–4 500 87.00 0.91
H063601 1–3 500 81.00 0.79
H6 H048901 1–3 200 97.00 0.98
H049401 1–4 500 88.00 0.58
H049503 1–3 400 85.00 0.78
H049601 1–4 500 85.00 0.85
H049701 1–3 400 97.00 0.98
H7 H075301 1–3 500 76.00 0.70
H075701 1–3 500 93.00 0.86
H076401 1–3 500 73.00 0.76
H076501 1–3 500 91.00 0.87
H076801 1–3 500 83.00 0.85
H8 H051301 1–3 500 77.00 0.72
H052301 1–4 600 74.00 0.83
H052501 1–3 500 87.00 0.87
H052701 1–3 500 83.00 0.85
H9 H060701 1–3 500 92.00 0.94
H061501 1–3 600 86.00 0.86
H061601 1–3 600 92.00 0.88
H061801 1–3 500 92.00 0.89
H10 H042201 1–3 500 94.00 0.85
H042801 1–4 600 83.00 0.87
H042902 1–3 500 89.00 0.90
H043001 1–3 600 78.00 0.79
H043101 1–3 500 95.00 0.94
H11 H063801 1–3 600 78.00 0.70
H064001 1–3 500 78.00 0.74
H064101 1–3 500 81.00 0.83
H064401 1–4 500 69.00 0.79
H064901 1–3 500 85.00 0.85
H065101 1–3 500 80.00 0.78
H065201 1–3 800 78.00 0.80
H065301 1–3 500 81.00 0.81
H065401 1–4 500 77.00 0.85
† The intraclass correlation is not reported for dichotomously scored items; Cohen's Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa is a measure of reliability that is appropriate for items that are dichotomously scored. The intraclass correlation coefficient is most appropriate for items with more than two categories.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2006 U.S. History Assessment.

Last updated 19 November 2009 (GF)

Printer-friendly Version