author:
|
Eric Banilower
|
published in:
|
Horizon Research
|
published:
|
02/14/2001
|
posted to site:
|
02/14/2001
|
Table 8
Scores on the NSRE by Class Type
Scale | IMP | Non- IMP | Difference | Effect Size |
NSRE 10th Grade Math Scale Score | 150.7 | 145.1 | 5.6* | .57 |
NSRE 10 th Grade SAT- 9 Scale Score | 738.8 | 718.2 | 20.6* | .54 |
Percent of Students Meeting or Exceeding Standard in Math Skills (NSRE) | 97.0 | 72.7 | 14.3* | .75 |
Percent of Students Meeting or Exceeding Standard in Math Problem Solving (NSRE) | 39.4 | 15.2 | 24.2* | .56 |
Percent of Students Meeting or Exceeding Standard in Math Concepts (NSRE) | 36.4 | 24.2 | 12.2 |
Project 4 (K- 12 Mathematics)
This study examined the results of the school district's implementation of a standards- based
educational system using student achievement scores. The district used two national assessments
for this purpose. The first was the New Standards Mathematics Reference Examination for
grade 4 that contains sub- scales for skills, concepts, and problem solving. The second
assessment used was the Iowa Test of Basic Skills (ITBS), chosen to verify that students using
the new reform- oriented curriculum did not suffer in basic skills.
The district first administered the New Standards exam in 1996. Comparing achievement scores
from 1996 to 1998, the district showed a significant increase in the percent of students meeting
or exceeding the standard in all three areas: skills, concepts, and problem solving. Further, data
from the ITBS show a small but significant increase in student achievement, lending evidence to
the claim that students experiencing a reform- oriented curriculum do not do worse in basic skills,
and may in fact do better than students experiencing a traditional curriculum.
Taking the analysis a step further, the project then compared schools based upon whether they
were rated as strong or weak implementers of the mathematics program. Ratings of individual
teachers were made by teacher leaders at each site and were based upon 1
st through 4th grade
teachers' use of the curriculum (Everyday Mathematics) as intended. The three sites classified
as weak schools were those where all but one or two teachers in grades 1 through 4 were rated as
weak implementers. In order to be classified as a strong implementing school, all 3rd and 4th
grade teachers had to be strong implementers (8 schools met this criterion). To protect against
initial differences between strong and weak schools, the strong schools were further split into
two groups. One contained three strong schools with similar demographics to the weak schools
(number of students, percent free/ reduced lunch, percent African- American, etc.) and the other
contained the remaining strong schools.
Students in both groups of strong implementing schools outperformed those from weak
implementing schools on all three sub- scales of the New Standards exam and on the ITBS.
Further, while a sizable achievement gap existed between white and African- American students
at all schools, both groups in strong implementing schools outperformed their counterparts at
weak implementing schools.
Project 5 (6- 12 Mathematics)
This project looked at the percent of students passing the state's 8th grade mathematics
assessment, comparing average passing rates for districts within the project to the state as a
whole. As can be seen in Table 9, districts in the LSC averaged a 10% increase in their pass
rates (from 36% to 46%) one year after the implementation of the LSC compared to a 7%
increase state- wide (from 59% in 1998 to 66% in 1999). Unfortunately, the study does not build
a case as to why or how the LSC is responsible for the larger gains in LSC district. As the LSC
districts started with a pass rate well below the state average, it could be argued that the increase
is to some extent due to regression to the mean.
Table 9
Percent of Students Passing the State Mathematics Assessment
| 1998 | 1999 | Difference |
LSC Districts | 36 | 46 | 10 |
State Average | 59 | 66 | 7 |
Science Studies
HRI was able to obtain the results of studies from four science projects. While there is quite a bit
of variability in the quality of the study designs, in general, the science studies did a better job of
controlling for threats to internal validity. As with the mathematics studies, the results of the
science studies are generally positive.
Project 6 (K- 8 Science)
With no state assessment in science, individual districts are given the prerogative to choose when
and how to measure student achievement in science. Although they use a variety of instruments,
most districts do administer a science assessment at the 4th and 8th grade level. These include the
ITBS, CTBS, and SAT/ OLSAT. Thus, the project was able to collect student achievement trend
data from sixteen districts it serves. However, as nine of the districts are very small, rural
districts, the project aggregated their data into one composite district, leaving eight districts
(seven of the original sixteen and the one composite district) to be analyzed. Of these, four have
baseline data (student test scores from the year prior to LSC implementation) giving them four
data points. Three districts have three years of data, and one district has two years of data.
From project records, the average number of professional development hours per school as well
as the level of kit- usage (low, medium, and high) for each district were computed. Then, districtwide
scores were examined (visually) to see if any patterns emerged. The data, shown in Table
10, reveal no clear trends in the relationship between PD hours and changes in student
achievement scores or between kit usage and increased achievement. However, given the
inadequacy of the data available (district means of student scores and measures of participation
in the LSC), it is highly unlikely that any changes, good or bad, could be detected, much less an
argument be made that the LSC was responsible for those changes.
Table 10
School Test Scores and Level of Participation in LSC
School | Grade | Number of Years in Project | Baseline Score | 1999 Score | Difference | Average Hours of LSC PD | Level of Kit Use |
G | 4 | 5 | 43.4 | 54.5 | 11.1 | 85 | High |
E | 4 | 5 | 67.0 | 78.0 | 11.0 | 107 | High |
A | 4 | 3 | 56.4 | 61.3 | 4.9 | 128 | High |
C | 4 | 5 | 70.0 | 74.0 | 4.0 | 129 | High |
D | 4 | 5 | 61.0 | 65.0 | 4.0 | 70 | Low |
H | 4 | 2 | 62.0 | 62.0 | 0.0 | 61 | Medium |
B | 4 | 3 | 60.0 | 59.0 | -1.0 | 125 | Medium |
F | 4 | 3 | 65.0 | 61.1 | -3.9 | 127 | Medium |
M | 8 | 5 | 73.0 | 87.0 | 14.0 | 81 | Low |
L | 8 | 5 | 57.0 | 67.0 | 10.0 | 73 | Low |
J | 8 | 3 | 61.0 | 62.0 | 1.0 | 93 | Low |
I | 8 | 3 | 70.1 | 70.6 | -0.5 | 128 | Low |
O | 8 | 5 | 52.4 | 51.4 | -1.0 | 69 | Medium |
K | 8 | 5 | 84.0 | 80.0 | -4.0 | 130 | Medium |
N | 8 | 3 | 62.1 | 56.9 | -5.2 | 16 | Low |
P | 8 | 2 | 68.0 | 60.0 | -8.0 | 53 | Low |
Project 7 (K- 8 Science/ Mathematics)
The study, one of the strongest submitted to HRI, made comparisons of students' test scores on
the SAT- 9 open- ended assessment, grouping students by the number of years they had a LSC
trained teacher. The analysis was done separately for two grade levels, 5th and 7th grades with
over 1000 students participating at each grade level. Additionally, students' reading test scores
(3rd and 4th grade respectively for the two analyses) were used to control for initial differences in student abilities.
As can be seen in Table 11, students at the 5th grade level who had a LSC trained teacher for one
or two years outperformed students who had never had a LSC trained teacher by about 3.5 points
(.17 standard deviations).
|