Explore NAEP results! By clicking "continue" you will be leaving the National Assessment of Educational Progress (NAEP) operational website and opening The Nation's Report Card (NRC) website. Explore NAEP results about students' performance, and access a suite of data tools.
NAEP Technical DocumentationNAEP Variance Estimation During Transition to Digitally Based Assessment
Suppose
common population linking is used to align the digitally based assessment (DBA) scores from future reporting samples to the existing trend scale used in reporting the paper-based assessment (PBA) results (referred to as trend PBA). In NAEP, two randomly equivalent samples from the same population, one administered with DBA, while the other one with PBA (referred to as bridge PBA), are used in estimating the common population linking function. An approach is described below for estimating error variance of the group statistics for the DBA assessments. The statistic of interest,
, can be a mean, a percentage, an achievement-level percentage, etc.
Procedures for estimating error variance for DBA group scale score statistics, when comparing groups between DBA and trend PBA—External Linking
Suppose that the common population linking functions are estimated using two randomly equivalent samples—the DBA sample and the bridge PBA sample, and then applied to link the results of a third independent DBA sample to the existing trend scale in reporting the trend PBA results. This is referred to as external linking.
Suppose the DBA results are linked to the trend scale in reporting PBA via external linking. When comparing the DBA result,
, to the trend PBA result,
, where PBA and DBA are two independent samples, the error variance of
is
where
and
are the
sampling and measurement variances for
. The term
is the linking error variance associated with the uncertainty in estimating the common population linking function. When external linking is used,
is estimated using a quadratic function which describes the relationship between the linking variance and
. Specifically, the quadratic function for any subgroup has the following general form:
, (4)
where
,
, and are coefficients estimated through a Monte Carlo based method. See
Mazzeo, Donoghue, Liu, and Xu (2018) for more discussion of the method of calculating the linking variance for external linking.
The approach described here is used in estimating the error variance of the statistics from Mathematics and Reading DBA at grades 4 and 8, when comparing the group scale score statistics from the DBA in 2017 and future years to those from PBA in 2015 and earlier years. Tables that provide the quadratic function coefficients for means and percentages, standard deviations, and achievement-level percentages can be accessed via links in the table below.
The linking error component derived from external linking does not apply when
comparing group scale score statistics from DBA; or
comparing group scale score statistics from PBA.
Procedures for estimating error variance for group scale score statistics from combined PBA/DBA sample—Internal Linking
Suppose that the common population linking functions are estimated using two randomly equivalent samples—the DBA sample and the bridge PBA sample, and then applied to link the results of the same DBA sample used in deriving the linking function to the existing trend scale in reporting the trend PBA results from previous years. This is referred to as internal linking. In addition, suppose that the DBA and bridge PBA samples used in deriving the linking function are combined in estimating group statistics of interest,
.
The estimate of the statistic of interest is computed as
.
Under internal linking, the error variance of
is estimated as
, (5)
where
is the sampling variance, considering the uncertainty due to sampling in deriving the linking functions as well as in the estimation of the group statistics;
is the measurement variance, considering the uncertainty due to latency in deriving the linking functions as well as in estimation of the group statistics.
Estimating error variance of group scale score statistics on subscale, combined PBA/DBA sample and internal linking
The following steps can be taken to estimate the error variance of statistics of interest
using the combined PBA/DBA sample, for a subscale
s.
To estimate the sampling error variance,
, a total of 62 pairs of transformation coefficients
are used. These transformation coefficients are derived from linking the DBA mean and standard deviation to the PBA mean and standard deviation for each replicate
i. Also, the first set of PVs of the combined sample
are used. Here,
is the DBA PVs, and
is the bridge PBA PVs.
For each pair of transformation coefficients
, do the following:
Apply
to transform
, in the following way:
. (6)
Combine the transformed sets of DBA PVs,
with the bridge PBA-part of the PVs,
to get the combined PV: .
Calculate the statistic of interest,
based on
and
where
denote the
replicate weight for the combined sample.
is then calculated as
(7)
where
.
To calculate the measurement error variance , 100 pairs of one set of PBA PVs and one set of DBA PVs are randomly chosen. This leads to a total of 100 pairs of transformation coefficients, which are grouped into 5 replications with 20 pairs of coefficients within each replication:
.
In each replication, transformation coefficients are calculated for each of the 20 pairs of a set of DBA PVs and a set of PBA PVs. Hence, for the
replication,
Apply the transformation coefficients to transform the DBA PVs in the following way:
For each set of , combine them with the bridge PBA PVs as
For a given
is a random permutation of the 20 sets of bridge PBA PVs, which is used in deriving the transformation coefficients .
using the
replication of coefficients is calculated as
(8)
where
is calculated based on , weighted by the student sampling weight. And
.
is then calculated as:
Note that for NAEP subjects that report on a univariate scale only (e.g., 2018 Civics), the error variance estimation for group statistics on the overall scale based on the combined PBA/DBA sample follows the procedure described for subscale.
Estimating error variance of group scale score statistics on composite scale, combined sample and internal linking
To calculate the sampling variance of statistics on composite scale, assume there are a total of
S subscales and the subscale weights are where
.
First calculate the DBA composite PVs in the following way:
.
Let
be the
set of PVs on composite scale where
is the first set of composite scale PVs for bridge PBA.
For statistic
on composite scale,
is then calculated as
(9)
where
is calculated based on
and
.
To calculate the measurement error variance of the group scale score statistics on composite scale,
Calculate the DBA and bridge PBA composite PVs in the following way:
and .
Let
be the set of PVs within replication
on composite scale; where
is the
set of composite scale PV for bridge PBA within replication
.
is calculated as
(10)
where
is calculated based on , weighted by the student sampling weight, and
.
The approaches described here are used in estimating the error variance of the statistics for the 2018 Civics, Geography, and U.S. History assessments, when
1) estimating error variance for the combined PBA/DBA sample group scale scores statistics themselves within 2018; or
2) comparing the group scale score statistics from the combined PBA/DBA sample in 2018 to those from trend PBA in 2014 and earlier years.
Tables that provide the transformation coefficients which are needed for calculating the sampling variances can be accessed via links in the table below.
Also included are tables that provide the transformation coefficients as well as the index of the set of PVs (out of the 20 sets of PBA PVs and 20 sets of DBA PVs) that are used for calculating the measurement variances. For example, (k=2, DBA PV=1, Bridge PBA PV=14) means for the 2nd replication, the first set of PVs from DBA and 14th set of PVs from PBA bridge were used in computing the transformation coefficients.
When comparing the statistics estimated from the combined PBA/DBA sample,
, to the one estimated from trend PBA in previous years,
, where the trend PBA sample and the combined sample are independent, the error variance of
is
. (11)
is estimated as
, (12)
where
and
are the sampling and measurement variances for
.
Links to coefficients used for variance calculation, by subject: 2017, 2018, and 2019
SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2017, 2018, and 2019 Assessments.