Skip Navigation
small NCES header image

Table of Contents  |  Search Technical Documentation  |  References

NAEP Analysis and Scaling → Initial Activities → Constructed-Response Interrater Reliability → Range of Response Codes, Percentage Exact Agreement, and Cohen's Kappa or Intraclass Correlation for the Constructed-Response Items Used in Scaling, by Block and Item, Grade 8 U.S. History Assessment: 2001
Range of response codes, percentage exact agreement, and Cohen's Kappa or intraclass correlation for the constructed-response items used in scaling, by block and item, grade 8 U.S. history assessment: 2001
Block Item Range of response codes Sample size Percentage exact agreement Cohen's Kappa Intraclass correlation
H3 H035801 1–3 600 92 0.88
H035901 1–3 600 93 0.90
H035902 1–3 600 88 0.89
H036101 1–3 600 88 0.94
H036402 1–4 600 86 0.95
H4 H059001 1–3 600 88 0.93
H059201 1–3 600 83 0.90
H059701 1–3 600 94 0.97
H059801 1–3 600 86 0.92
H060201 1–3 600 88 0.94
H5 H038103 1–4 600 88 0.86
H038301 1–3 600 91 0.96
H038601 1–3 600 86 0.79
H038702 1–3 600 88 0.86
H039001 1–3 600 98 0.99
H6 H039401 1–3 600 89 0.90
H039901 1–4 600 89 0.93
H040001 1–3 600 97 0.98
H040103 1–3 600 89 0.93
H040201 1–3 600 95 0.96
H7 H057501 1–3 600 97 0.98
H057701 1–3 600 84 0.85
H057801 1–3 600 86 0.92
H058601 1–3 600 88 0.94
H058701 1–3 600 94 0.97
H8 H034101 1–4 700 90 0.92
H034401 1–3 700 95 0.96
H034501 1–2 700 100 0.99
H034702 1–3 700 96 0.96
H035001 1–3 700 87 0.93
H035101 1–3 700 90 0.93
H9 H060701 1–3 600 96 0.96
H061501 1–3 600 93 0.90
H061601 1–3 600 92 0.94
H061801 1–3 600 98 0.98
H10 H042201 1–3 600 98 0.95
H042801 1–4 600 93 0.94
H042902 1–3 600 95 0.97
H043001 1–3 600 90 0.89
H043101 1–3 600 95 0.97
H11 H043201 1–3 600 92 0.94
H043401 1–3 600 86 0.92
H043501 1–4 600 88 0.94
H043601 1–3 600 85 0.86
H043701 1–3 600 92 0.96
H043705 1–3 600 86 0.87
H044001 1–4 600 86 0.90
† The intraclass correlation is not reported for dichotomously scored items; Cohen's Kappa is not reported for polytomously scored items.
NOTE: Cohen's Kappa is a measure of reliability that is appropriate for items that are dichotomously scored. The intraclass correlation coefficient is most appropriate for items with more than two categories.
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2001 U.S. History Assessment.

Last updated 15 July 2008 (KL)

Printer-friendly Version


Would you like to help us improve our products and website by taking a short survey?

YES, I would like to take the survey

or

No Thanks

The survey consists of a few short questions and takes less than one minute to complete.
National Center for Education Statistics - http://nces.ed.gov
U.S. Department of Education