NAEP assessments include cognitive items and contextual items. Cognitive items are designed to assess what students know and can do, and are based on the framework and specifications documents for each assessment subject. These types of items include multiple-choice items,
constructed-response items scored
dichotomously, constructed-response items scored
polytomously, and interactive scenario-based tasks (SBTs). Contextual items are administered to students, teachers, and school administrators via survey questionnaires that collect additional information about students' demographics and experiences in and out of school.
The item-development steps for each subject area are as follows:
The National Assessment Governing Board (the Governing Board) provides content frameworks and item specifications in each subject area.
The instrument development committee in each subject area provides guidance to NAEP staff about how the objectives described in the framework can be measured given the constraints of resources and the feasibility of measurement technology. The committee makes recommendations about priorities for the assessment (within the context of the assessment framework) and the types of items to be developed.
Concept sketches for SBTs are developed and submitted for NCES and NAGB approval prior to item development.
Specialists with subject-matter expertise and experience in creating items according to specifications develop and review the assessment questions.
For a new stimulus, the copyright permission process should begin at the beginning of the item development phase. If the stimulus (text, artwork, graphics, charts, diagrams, music, video, etc.) is new, a permission form will first need to be completed by the appropriate subject assessment specialist. The NAEP copyright coordinator will review the permission request form, along with the provided attachments, and submit to the appropriate office at a minimum of 12 weeks in advance of the date for receipt of permission.
For a stimulus that needs to be renewed, the process should also begin at the beginning of the item development phase. The assessment specialist will first identify the items and blocks to be used and inform the NAEP copyright coordinator. The copyright coordinator will review and confirm that the number of uses are not exhausted, and the expiration date is still current to use in the upcoming assessment.
NAEP test development staff and external test specialists review and revise the items and accompanying scoring guides.
Editorial and fairness reviews are conducted as required by NCES.
Pilot test materials are prepared, and those that require secure clearance are sent to the federal
Office of Management and Budget (OMB). Contextual items are submitted to the OMB for clearance, while cognitive items do not need to be approved. In addition, materials such as recruitment and communication documents that would be sent to the field (e.g., Facts for Teachers and Facts for Districts) may also be included in clearance packages.
A pilot test is conducted in many of the states and jurisdictions slated to participate in the next operational assessment.
Based on the pilot test analyses, items are selected for inclusion in the operational assessment.
Each subject-area instrument development committee approves the selection of items to include in the next operational assessment.
Each subject-area instrument is submitted to the Governing Board for approval.
Operational materials, namely the contextual items and documentation showing which contextual items were removed, added, or revised are sent to the OMB to secure clearance.
The NAEP translation process typically begins after the English language versions of items or tasks have been finalized. The translation team prepares the initial translations which then undergo review by bilingual panels of experts, including content experts, teachers in the content area at grades 4 and 8, and linguistic experts. In addition, an external vendor conducts a complete independent Translation Verification Review of all translated assessment materials. Thereafter, NCES reviews and approves the translations prior to certification.
There are two types of NAEP digitally based assessment bilingual accommodations. For grades 4 and 8 mathematics and science operational assessments, and for grade 8 civics and U.S. history operational assessments, NAEP offers an English-Spanish full bilingual version as an accommodation for English learners (mainland U.S. administration). Selected cognitive blocks and all directions screens, tutorial screens, help screens, toolbar and rollover text, dialog boxes, and student questionnaires are presented in English and Spanish. Students can toggle back and forth between languages. In addition, for grades 4 and 8 mathematics, reading, and science operational assessments, and for grade 8 civics and U.S. history operational assessments, NAEP offers an English-Spanish partial bilingual version as an accommodation for English learners (mainland U.S. administration). All directions screens, tutorial screens, help screens, toolbar and rollover text, dialog boxes, and student questionnaires are presented in English and Spanish. Students can toggle back and forth between languages, but cognitive content remains in English.
For NAEP paper-based assessments, NAEP offered English-Spanish full bilingual versions as an accommodation for English learners (mainland U.S. administration). Selected forms with cognitive blocks, directions, and student questionnaires were presented in English and Spanish on facing pages.
After a final review, the booklets are printed or packaged as digital test forms for computer delivery.
Each administration of the NAEP assessment requires a new configuration of the student booklets given to students and how they are distributed to schools. To allow for wide content coverage within the limited testing time for each student, the instrument configuration entails a three-step design process for the subject areas to be assessed:
In the first step, NAEP uses a focused balanced incomplete block (BIB) or partially balanced incomplete block (pBIB) design to assign blocks or groups of cognitive items to student booklets or test forms. The "focused" aspect of NAEP's booklet or test form design requires that each student answer questions from only one subject area. In a BIB design, the cognitive blocks are balanced; each cognitive block appears an equal number of times in every possible position. Each cognitive block is also paired with every other cognitive block in a test booklet or test form exactly the same number of times. In a pBIB design, cognitive blocks may not appear an equal number of times in each position, or may not be paired with every other cognitive block an equal number of times.
Second, the spiraling scheme is designed. Spiraling refers to interleaving booklets or test forms systematically so that when they are handed out in the specified order, any group of students will receive approximately the target proportions of different types of booklets or test forms. This same process is applied in the development of contextual questionnaires for NAEP's digitally based assessments.
The third aspect is the bundling design. In 2003, NAEP test developers introduced an enhanced bundling design, referred to as vertical bundling. Vertical bundling has flexibility with respect to bundle length and reduces the required number of different bundles, decreasing booklet wastage, and improving balance of within-session booklet or test form pairings.
|Note: Until the 1984 assessment, NAEP was administered using matrix sampling and tape recorders; that is, by administering booklets of exercises using paced audio tapes that walked groups of students through the individual assessment exercises in a common booklet. In the 1984 assessment, a balanced incomplete block booklet design, which does not include audio tape pacing, was introduced in place of taped matrix sampling.|