School & District Management

Study Offers Mixed Results On Impact of High-Stakes Tests

By Debra Viadero — January 28, 2004 3 min read
  • Save to favorites
  • Print

Efforts in more than half the states to tie major consequences to students’ test scores are translating into academic gains, according to the latest in a series of studies on the policy approach known as high-stakes testing.

The report, “Reconsidering the Impact of High-Stakes Testing,” is available from the Education Policy Analysis Archives.

Or then again, maybe they’re not.

The study, published this month in the online journal Education Policy Analysis Archives, draws on eight years of national testing data to compare states with traditional “low stakes” testing policies against those with “high stakes” systems. Under high-stakes policies, students’ scores are used to decide which teachers or schools win cash bonuses, whether students graduate or move on to the next grade, or what schools are subject to takeover by their districts or states.

The report follows half a dozen other studies over the past year that have used similar techniques to evaluate the effects of such accountability systems. (“Study Finds Higher Gains in States With High-Stakes Tests,” April 16, 2003).

“I was intrigued by the fact that different researchers with different ideological stances were coming to different conclusions from the same data,” said Henry I. Braun, the author of the new study. “I was also motivated by a sense that the world of research is very complex and we are not, in our research worlds, respectful enough of that complexity.”

Sorting It Out

Mr. Braun, a statistician with the Princeton, N.J.-based Educational Testing Service, used four different methods to compare changes in states’ scores on National Assessment of Educational Progress mathematics tests between 1992 and 2000.

The comparisons pitted the 18 states that some previous researchers have identified as having high-stakes systems against 32 with lower-pressure accountability systems.

Looking first at overall changes in the states’ 4th and 8th grade test scores over that period, Mr. Braun, like most of his predecessors, found that students’ academic gains were greater in states, such as Texas and North Carolina, that had high- pressure testing systems.

What’s more, he said, the trend could not be explained by statistical errors or the fact that some of the states showing the biggest improvements had also been excluding growing percentages of special education students from the tests.

In the 4th grade, the difference in mean scores between the high-stakes and low-stakes states was 4.3 score points; in 8th grade, it was 3.99 score points.

The opposite occurred, though, when Mr. Braun took a look at how cohorts of students fared on the tests over time. (He compared 4th graders’ scores with the 8th grade scores in the same states four years later.)

That time around, the improvements in academic achievement were greater—albeit to a lesser degree—in the states with low-pressure testing systems.

Mr. Braun said the differing results didn’t surprise him.

“You cannot look at high-stakes testing in isolation from other things going on in the state,” he said. “Many education reforms can be assisted or thwarted by other education reforms going on at the same time.”

In an effort to take a broader look, Mr. Braun reconfigured the data to factor in a measure that rated states on their education activism. It assigned states grades based on whether they had enacted—or were about to enact—22 school improvement efforts, such as professional standards for teachers or subject-matter standards.

But he found little correlation between the level of states’ education activism and their students’ test-score changes over time.

Mr. Braun also looked at changes in scores for the bottom 25 percent of students in each of the states. Gains were greater in the states that put more pressure on students or schools for test-score improvements.

“All of this is about which states you include and which states you do not include,” said David C. Berliner, an education professor at Arizona State University in Tempe whose own study on high-stakes testing helped spark the spate of research on the subject. (“Reports Find Fault With High-Stakes Testing,” Jan. 8, 2003).

For his study, co-written with Audrey L. Amrein, Mr. Berliner compared states’ academic gains against the national average. He and Ms. Amrein found that most of the high-pressure states saw decreases in 4th grade math scores after adopting their testing programs. At the 8th grade level, a majority of high-stakes states gained relative to the national average.

Referring to Mr. Braun, Mr. Berliner added: “He was able to find results, but my guess is that our study and everybody else’s is still going to be subject to criticism.”

A version of this article appeared in the January 28, 2004 edition of Education Week as Study Offers Mixed Results On Impact of High-Stakes Tests

Events

Teaching Profession K-12 Essentials Forum Supporting the New K-12 Workforce: What Teachers Need to Stay at School
 Join this free virtual event to discover what teachers say they need to feel supported to stay in classrooms for the long haul.
College & Workforce Readiness K-12 Essentials Forum Career and Technical Education Takes Its Next Big Step
Join this free virtual event to hear creative approaches to modernize CTE programs and navigate the shift away from a near-exclusive focus on "college preparedness."

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide — elementary, middle, high school and more.
View Jobs
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
View Jobs
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
View Jobs
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.
View Jobs

Read Next

School & District Management Lessons Learned About Bold Tech Initiatives From the LAUSD Chief's Departure
Bold initiatives can cut both ways, says a leadership expert, sparking achievement gains or falling apart.
20260622 AMX US NEWS WHAT ALBERTO CARVALHOS RESIGNATION MEANS 1 LD
Alberto Carvalho, then the Los Angeles Unified School District superintendent, listens to parents of students at a Los Angeles high school on March 30, 2022. Carvalho resigned from his position Sunday night under the cloud of a failed AI chatbot initiative and an FBI investigation.
Photo by David Crane, Los Angeles Daily News/SCNG
School & District Management Carvalho Resigns as L.A. Unified Superintendent Amid Federal Investigation
Alberto Carvalho has been under FBI investigation for four months after a failed AI chatbot venture.
Howard Blume, Los Angeles Times
6 min read
Los Angeles Schools Federal Raid 26059057494102
Alberto Carvalho speaks about Los Angeles students' improved scores before Gov. Gavin Newsom signed legislation related to student literacy in Los Angeles on Oct. 9, 2025. The Los Angeles Unified superintendent, facing an FBI investigation, resigned June 21.
Damian Dovarganes/AP Photo
School & District Management Opinion Embrace the Struggle: How I Find Joy as an Educator
Many of the most meaningful moments in my career started with a difficult conversation.
4 min read
Positive and emotional interaction with a group of students. The struggle is part of the joy.
Vanessa Solis/Education Week + Canva
School & District Management Closing a School? Don't Expect to Save Money, a New Study Warns
The hope is that closing schools can reduce fixed costs. A new study looks into whether that happens.
5 min read
This is an aerial shot of a large public high school complex shot on a Sunday with nobody around. This image features multiple buildings, a running track, football fields, baseball diamonds, tennis courts parking lots and a residential neighborhood surrounding the image. Shot from the open window of a small plane.
Illustration by Education Week + Getty