|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
West Virginia Department of Education SAT-9 and Testing Conditions: Research SummaryWest VirginiaPublished by the National Center on Educational Outcomes
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Test Takers |
1999 |
2000 |
2001 |
|||
|
Standard |
Non-Standard |
Standard |
Non-Standard |
Standard |
Non-Standard |
|
Non-Disabled Students (General
Education) |
159,631 |
4,452 |
155,446 |
4,734 |
153,496 |
4,837 |
Students with Disabilities
(Special Education) |
10,056 |
17,063 |
9,608 |
18,541 |
9,434 |
19,332 |
|
Total |
191,202 |
188,329 |
187,099 |
|||
Students with disabilities as a group made
gains in reading until the ninth grade, when reading achievement stabilized.
General education students, however, continued to make gains in reading, thus
further increasing the gap between students based on educational status (general
education versus special education).
Special education students (students with disabilities) had marked growth in
math from year to year. Throughout
the research documented in this study, special education students’ language arts
scores lagged behind reading and math scores, with the trend becoming most
pronounced in the middle school grades. Figures 1 through 3 illustrate the test
performance at different grades of students with disabilities and general
education students.



Students with behavior disorders and students with specific learning disabilities showed marked similarities in their reading, math, and language arts achievement. Students with speech/language impairments performed most similarly to general education students, and students with mild to moderate mental impairment performed at the lowest levels among these groups. When the scaled scores of students with different disabilities were disaggregated by testing conditions (standard versus non-standard), there was no difference (less than 2 points) based on testing conditions in the test scores of students with mental impairment. However, scaled scores of students with specific learning disabilities, behavior disorders, and with speech/language impairments showed large 22- and 23-point differences between those tested under standard and non-standard conditions. See Figure 4 for more detail.

In examining test scores based on levels of least restrictive environment, the
students with disabilities who were in regular education full time performed at
the highest levels. However, students in regular education
full time who received modifications (non-standard) performed similarly to those
in regular education part time who did not (standard conditions).
Furthermore, students in regular education part time who took the test under
non-standard conditions scored similarly to students who spent most of their
time in a separate special education class.
Districts have different policies, or at least different implementation
procedures for deciding which students with disabilities take the test under
standard versus non-standard conditions. An initial look at a sample of districts
chosen to represent the diversity in
Relationships among variables were examined by calculating the effect sizes for
scaled score comparisons (t-tests) between the two testing conditions.
Results showed that testing conditions make a more profound practical difference
in language arts than in either reading or math scores.
Although the differences due to testing conditions differ across subject areas,
they are stable across time.
Sequential regressions were also employed to determine the relative roles of
type of disability, LRE, and testing conditions in predicting special education
students’ SAT-9 test scores. The three variables did not account for
much of the variance in test scores (25-30%), and account for less variance as
grade level increases. Moreover, the predictive utility of the
testing conditions variable decreased as grade level increases, whereas the
opposite was true for both LRE and type of disability.
Therefore, students’ SAT-9 test scores appear to be largely and increasingly
dependent on LRE and type of disability with testing conditions affecting
performance less as grade level increases, but overall some 70 to 75% of the
variance in students’ test scores can be explained by other unknown factors.
A sample of eight districts representing different performance levels was drawn
from the population data and examined in depth to determine the extent to which
SAT-9 scores of students with disabilities varied as a function of district
performance levels, testing conditions, disability status, content area, and
student school level (elementary, middle, or high). Districts were identified by examining all 55
The sample was further limited to students who were continuously enrolled for
all three years in the same district, and to students whose disability status
and LRE had remained the same between 1999 and 2001.
In addition, the types of disabilities examined were limited to behavior
disorders, speech/language impairments, specific learning disabilities, and
other health impairments. LRE was
also limited to regular education full time, regular education part time, and
special education separate class.
Thus, the final sample consisted of 20,950 students, including 19,506 students
in general education and 1,444 in special education.
As shown in Table 2, in this sample, lower performing districts tested larger proportions of students in non-standard conditions than did high performing districts. Moreover, lower performing districts tested decreasing numbers of students in standard conditions from 1999 to 2001, whereas higher performing districts tended to test more of their students in special education under standard conditions during this period.
Table 2. Distribution of Students across Testing Conditions by Year and District Performance Level
District Performance Level |
District |
Testing Condition |
1999 |
2000 |
2001 |
|||
|
Number |
Percent |
Number |
Percent |
Number |
Percent |
|||
High Performing |
Mineral |
Standard |
100 |
62.1 |
112 |
69.6 |
127 |
78.9 |
|
Non-Standard |
61 |
37.9 |
49 |
30.4 |
34 |
21.1 |
||
|
Wood |
Standard |
219 |
73.0 |
225 |
75.0 |
235 |
78.3 |
|
|
Non-Standard |
81 |
27.0 |
75 |
25.0 |
65 |
21.7 |
||
|
Total |
Standard |
319 |
69.2 |
337 |
73.1 |
362 |
78.5 |
|
|
Non-Standard |
142 |
30.8 |
124 |
26.9 |
99 |
21.5 |
||
Average |
|
Standard |
74 |
54.0 |
72 |
52.6 |
77 |
56.2 |
|
Non-Standard |
63 |
46.0 |
65 |
47.4 |
60 |
43.8 |
||
|
|
Standard |
80 |
43.7 |
70 |
38.3 |
99 |
54.1 |
|
|
Non-Standard |
103 |
56.3 |
113 |
61.7 |
84 |
45.9 |
||
|
Total |
Standard |
154 |
48.1 |
142 |
44.4 |
176 |
55.0 |
|
|
Non-Standard |
166 |
51.9 |
178 |
55.6 |
144 |
45.0 |
||
Improving |
|
Standard |
56 |
36.1 |
47 |
30.3 |
36 |
23.2 |
|
Non-Standard |
99 |
63.9 |
108 |
69.7 |
119 |
76.8 |
||
|
Mingo |
Standard |
50 |
26.6 |
49 |
26.1 |
55 |
29.3 |
|
|
Non-Standard |
138 |
73.4 |
139 |
73.9 |
133 |
70.7 |
||
|
Total |
Standard |
106 |
30.9 |
96 |
28.0 |
91 |
26.5 |
|
|
Non-Standard |
237 |
69.1 |
247 |
72.0 |
252 |
73.5 |
||
Low Performing |
|
Standard |
38 |
21.1 |
32 |
17.8 |
29 |
16.1 |
|
Non-Standard |
142 |
78.9 |
148 |
82.2 |
151 |
83.9 |
||
|
McDowell |
Standard |
24 |
17.1 |
15 |
10.7 |
18 |
12.9 |
|
|
Non-Standard |
116 |
82.9 |
125 |
89.3 |
122 |
87.1 |
||
|
Total |
Standard |
62 |
19.4 |
47 |
14.7 |
47 |
14.7 |
|
|
Non-Standard |
258 |
80.6 |
273 |
85.3 |
273 |
85.3 |
||
Several series of sequential regressions were used to determine the extent to
which district performance levels accounted for variance in SAT-9 test scores,
and the extent to which they predicted the assignment to testing conditions.
In the first series of regressions, LRE, district performance level, school
level (elementary, middle, or high school), and four dichotomous disability
categories (behavior disorder, speech/language impairment, specific learning
disabilities, and other health impairments), were used to predict testing
conditions in each of the three years. The next series of regressions repeated
the first but at each of the three school levels—elementary school (grades 3-5),
middle school (grades 6-8), and high school (grades 9-11).
This process was repeated to predict special education students’ SAT-9 scores,
this time however testing conditions was included as a predictor.
Finally, the same process was used to examine the role of district performance
level in predicting general education students’ SAT-9 scores.
Results showed the
following:
Table 3. Proportion of Variance Accounted for in Assignment to Testing Conditions by School Level for Each Independent Variable (?) and the Combined Total (R2)
|
Year |
Independent Variables**
|
Variance Accounted for in Testing Conditions |
|||||
1999
|
2000 |
2001 |
|||||
|
? |
R2 |
? |
R2 |
? |
R2 |
||
|
Elementary School |
Speech/Language Impairments |
.454 |
.596 |
.544 |
.655 |
.464 |
.634 |
|
LRE |
.105 |
.072 |
.044 |
||||
District Performance Level |
.035 |
.036 |
.116 |
||||
Behavior Disorders |
.002 |
.003 |
* |
||||
Other Health Impairments |
* |
* |
.011 |
||||
|
Middle School |
LRE |
.235 |
.351 |
.277 |
.442 |
.315 |
.525 |
|
District Performance Level |
.073 |
.101 |
.095 |
||||
|
Speech/Language Impairment |
.044 |
.055 |
.110 |
||||
Behavior Disorders |
* |
.009 |
.005 |
||||
|
High School |
District Performance Level |
.449 |
.503 |
.471 |
.547 |
.439 |
.573 |
|
LRE |
.054 |
.069 |
.005 |
||||
|
Behavior Disorders |
* |
.006 |
* |
||||
Specific Learning Disabilities |
* |
* |
.005 |
||||
*Variable did not make a statistically significant
contribution to the regression equation.
**Variables not listed did not make a statistically significant contribution to
the regression equation.
These results taken together underscore the importance of appropriate assignment to LRE, and suggest that the influence of district performance level on SAT-9 test scores is indirect in that it affects assignment of students with disabilities to testing conditions, which in turn exert a modest effect on their test performance. It is apparent from the earlier findings that as district performance levels decrease, the percentage of students with disabilities who take the test under standard conditions also diminishes. Therefore, although testing conditions only appear to have a minimal effect on test performance, because of policies and/or practices in low-performing districts, student performance—especially that of students under non-standard testing conditions—may be somewhat artificially inflated by the unnecessary assignment of students with disabilities to non-standard testing conditions. This evidence may prove valuable in WVDE staff efforts to increase the standard condition participation rates of students with disabilities, and help persuade superintendents and school boards that excluding these students’ scores does little to enhance their overall test performance.
This study is significant because very little previous research has analyzed
data from statewide administrations of a norm-referenced assessment to determine
what effect testing conditions have on test scores of students with
disabilities. Implications for WVDE personnel and
policymakers are discussed with regard to improving data
collection/documentation associated with testing conditions of students with
disabilities, collecting data on the specific testing accommodations and
modifications that are used, and redefining accountability measures for schools
and counties using student test scores, particularly as