This summer, as students enjoyed their summer vacations, education officials in many states were busy handling sweeping investigations into teacher cheating. In just one school district in Atlanta, at least 178 teachers and principals were implicated in a wide-scale falsification of student test scores. They had taken students’ standardized test sheets, erased wrong answers, and replaced them with the right ones. One teacher told investigators that the district was “run like the mob” and that she was afraid of retaliation if she didn’t participate.
The cheating in Atlanta was uncovered in part thanks to two simple checks that states can do to look for suspicious test results.
The same machines that grade the penciled-in bubbles of standardized tests can also tally how many answers on the tests have been erased and changed from wrong to right. The technique, called “erasure analysis,” flags suspicious patterns of answers that may indicate teachers have tampered with answer sheets to inflate their students’ scores.
In Atlanta, students’ test scores also jumped or dropped from year-to-year in surprising and unlikely ways. Checking for such dramatic swings provides another red flag for potential teacher cheating.
Experts view these kinds of screenings as crucial. Teachers and principals are facing increasing incentives to cheat, since student scores are being used to determine whether schools get funding and how teachers and principals get paid. Secretary of Education Arne Duncan recently told USA Today that states should require such screening.
Some states are indeed becoming more aggressive. Florida and Illinois instituted new, more rigorous screenings for statewide tests this year.
Yet many of the largest states have lagged.
California stands out. The state’s Department of Education conducted erasure analysis on tests for several years but ended the program in 2009. John Boivin, an administrator in the Education Department, said the screening was stopped as part of massive cuts after a drastic budget shortfall two years ago. (Boivin also said the erasure analysis cost the state $105,000 per year.)
Deborah Sigman, California’s deputy superintendent of public instruction, said the state ultimately had to choose between eliminating some of the students’ tests themselves and scaling back test oversight.
Sigman says that the state has always uncovered more cheating via on-the-ground reports, which the state now relies on. According to Sigman, the number of reports of teacher cheating is growing, from 69 three years ago, to 263 this past year.
But Boston College Professor Walt Haney, an expert on testing, said screening is critical.
“Any large districts or state that didn’t employ those techniques would have its head in the sand,” said Haney. “Given the number of large cheating scandals that have emerged over the last 20 years, any large institution would be derelict in not instituting some of the widely documented techniques for identifying cheating.”
Sigman says California plans to reinstate erasure analysis as soon as it has the money to do so—perhaps as early as this spring.
“This is a real priority,” she said. And, she noted, “It’s kind of a small investment.”
Other states have also been slow to act. In Pennsylvania and New Jersey, officials screened for suspicious levels of erasures, only to let results gather dust for years, until local journalists investigated and published the results themselves.
“People don’t want to know,” said Jennifer Jennings, a sociology professor at New York University who specializes in education. “People would rather hold their noses and hope the scores mean what they think they mean. At every level of the system there are adults who have an interest in the scores going up.”
Screenings such as erasure analysis don’t provide proof on their own that cheating has occurred. Because they depend on statistical analysis, they can only say what results are typical and what results are highly improbable.
When USA Today consulted statisticians about the 2009 test results from one elementary school in Washington, D.C., they were told that having that many erasures by chance was less likely than winning the Powerball grand prize with a $1 lottery ticket.
Screening methods do flag schools that are later cleared on further investigation, but they also consistently identify cheating that might otherwise have gone undiscovered.
In New Jersey, the state Department of Education has conducted erasure analysis on state tests since 2008. But the department did not investigate any of the schools flagged in these reports until this summer, when the Asbury Park Press successfully sued the department to obtain a copy of the analysis and made it public. (Spokesman Justin Barra told ProPublica that while the department did not share the reports with districts, the state did use them to decide whether to send observers on test days.)
Following the Asbury Park Press expose, 34 schools are now under investigation.
Pennsylvania’s Department of Education received an erasure analysis report in 2009 that flagged dozens of schools for potential cheating, but it left the results untouched for two years and did not notify school districts about the anomalies. After a reporter at an education blog obtained the report and made it public, the state ordered initial investigations at 89 schools.
In Texas, assessment officials said that the state’s education authority has done erasure analysis reports for at least 10 years. But up until this year, the state had not used them to investigate schools unless school members or test monitors on the ground also submitted cheating complaints.
Texas’ policy continued despite an update to the state’s education code in 2007 requiring the adoption of statistical measures to screen for cheating and a procedure for investigating schools with suspicious results. Five years later, Texas is finally putting those measures in place.
Criss Cloudt, Texas’ associate commissioner of assessment and accountability, said that Texas has relied on its rigorous test security measures—including seating charts, honor pledges, and legally binding oaths that test administers must sign—to prevent cheating.
Testing experts say that test security, while important, is not a substitute for screening measures.
Other states are also beginning more rigorous screenings.
In a report released last week, New York state’s Department of Education recommended instituting a multi-part screening of state tests, looking for score jumps, unlikely patterns of answers, and high levels of erasures. North Carolina is also considering more regular erasure analysis.
Florida has done screenings of a subset of high-stakes tests since 2004, and they implemented a state-of-the-art analysis of all statewide tests results this year.
“Generally, from leadership we have gotten a good but nervous response. This obviously is not the kind of work that people relish doing,” said Kris Ellington, Florida’s deputy commissioner for accountability research and measurement.
“Other than that, the discomfort with it, there aren’t any drawbacks.”
Of course, instituting a more rigorous screening of test results means facing up to how widespread teacher cheating actually is.
The new analysis revealed higher numbers of suspicious results for both student cheating and teacher cheating than in previous years, said Ellington.
But Ellington said that the problematic tests represented a tiny fraction of the tests administered—and that the screening has given Florida confidence that the remainder of their scores are valid.
“We want to make sure that no corruption is part of this process,” she said. “We don’t believe that there are big pockets of problems. But we can’t just live in our happy place and believe it. We have to know it.”
Copyright 2011, ProPublica Inc. Republished with permission from ProPublica.