Teaching Profession

Research Detects Bias in Classroom Observations

By Stephen Sawchuk — May 13, 2014 6 min read
Principal Scott Steckler, rear, observes 4th grade teacher Lora Johnson as she works with her students at George Cox Elementary in Gretna, La., in 2012. Such classroom observations are a key component of many states' teacher-evaluation systems.
  • Save to favorites
  • Print

As the rubber hits the road in the implementation of states’ revamped teacher-evaluation systems, new research illuminates a troubling source of bias. School principals—when conducting classroom observations—appear to give some teachers an unfair boost based on the students they’re assigned to teach, rather than judging them solely on their instructional savvy.

Observers tended to give the best marks to teachers whose students already were high-performing, while those teachers working with academically struggling students were penalized, according to an analysis of thousands of observation scores.

The report, released today by the Brown Center on Education Policy at the Brookings Institution, a Washington think tank, raises a host of new concerns about the nation’s evolving systems for grading teachers. And it suggests that, in trying to manage the technical and political challenges posed by test-score-based approaches to evaluation, such as “value added” methods, policymakers may be missing problems in other features of the systems.

“It’s very worrisome. It’s a huge bias,” said Grover J. “Russ” Whitehurst, the director of the Brown Center. “The criticism about value-added is certainly something we need to attend to, but a lot of work has helped reduce or eliminate that bias. None of that’s being done for observation scores.”

The report recommends that districts try to level the playing field by adjusting teachers’ observation scores based on the demographics of the students they instruct.

Among other things, the report also recommends scrapping policies that permit teachers to be judged based on the progress of all students in the school.

District Data

Spurred largely by federal efforts, such as the Race to the Top competition, dozens of states rushed to introduce new teacher-evaluation systems, most based on a combination of test scores and classroom observations.

Both in news stories and in research annals, most of the ink spilled on teacher evaluation has focused on value-added approaches, which estimate each teacher’s ability to boost his or her students’ standardized-test progress. Observations, in which administrators visit classrooms and rate the quality of a teacher’s instruction against a framework, are comparatively understudied.

For their analysis, Mr. Whitehurst and two colleagues examined teacher-evaluation data from four urban school districts ranging in size from 25,000 to 110,000 students. They looked at one to three years of data, analyzing the relationships among the various evaluation components, teachers’ overall scores, and the demographics of the students they taught.

The specific weights assigned to each component varied across the districts, but classroom observations counted for at least 40 percent of each teacher’s overall score. In most cases, it was more heavily weighed: More than three-quarters of the teacher sample taught in grades or subjects not assessed with standardized tests, and for such teachers, observations are typically weighted more heavily.

Overall, the researchers found that the components’ technical properties were consistent with those studied in the massive Measures of Effective Teaching study sponsored by the Seattle-based Bill & Melinda Gates Foundation. But they also discovered some troubling patterns.

For one, the researchers found a strong statistical link between teachers’ observation scores and the achievement levels of the students they instructed.

Just 9 percent of teachers of the lowest-achieving students received a top observation score, for example, while 29 percent of such teachers received a ranking in the bottom 20 percent. By contrast, 37 percent of teachers of the highest performing students got a top observation score, and only 11 percent received the lowest score.

News reports indicate that teachers in some districts, including the school system for the District of Columbia, have fretted about similar patterns.

Fix Proffered

The cause of the bias isn’t examined in the report, but the authors surmised that some often-measured teaching skills—such as leading a discussion with lots of questioning—may be more difficult with students who are underprepared or not fluent in English.

And the study suggests that the problem may be fixable. The authors applied a handicap of sorts based on student demographics, giving a boost to teachers with many lower-performing students and depressing the scores of those with students who tend to score well. That method would more evenly distribute observation scores across teachers of different groups of students, the paper shows.

Such an idea could be controversial for states and districts to implement because of its assumptions about how much various subgroups of students can progress. But without it, Mr. Whitehurst contends, teacher-evaluation systems may have unintended consequences—such as making working with the neediest students less attractive for teachers.

“Either we have to have observations designed to be immune from this kind of bias, or we have to adjust for it,” Mr. Whitehurst said. “I don’t see any other way out, if we want teachers to teach where we need them to teach, and to be valued for what they do.”

Other observers said that the new research adds yet another question mark to the contested policy push for revamped teacher evaluations.

“I think this is going to be another bad-news story for the supporters of teacher evaluation, and for the [Obama] administration,” said Michael Petrilli, the executive vice president of the Thomas B. Fordham Institute, a Washington think tank. “Rather than trying to find some kind of technocratic solution, we need to get back to common sense—trusting principals to make judgments. If we don’t do that, none of our school-reform efforts are going to work.”

Teachers’ unions have tended to be far more critical of the value-added approach than classroom observations.

Segun Eubanks, the director of teacher quality for the 3 million-member National Education Association, said that, on the one hand, the new analysis from Brookings confirms a general sense among teachers that they’re put at a disadvantage by choosing to work with the most at-risk students.

On the other hand, the notion of adjusting observation scores seems premature without further research, he said.

“My first instinct would be to help to put observational data into a context-specific realm. You need to train folks to see what teaching performance looks like when you’re teaching students who have low achievement,” Mr. Eubanks said. “The look-fors are different; the way the standards are applied are different. We have to find ways to do that before we start going for handicapping.”

Schoolwide Gauge Questioned

The Brookings report also examines a host of other aspects of the newly designed teacher-evaluation systems.

Consistent with the Gates research findings, it suggests that more observations improve the accuracy of the systems, and that outside observers tend to give ratings that are more predictive of teaching quality than principals.

Also, the report takes aim at evaluation systems that use a “schoolwide” value-added measure, in which all teachers are judged in part on the progress of the school as a whole. Such a policy, the report notes, tends to bring down the scores of even good teachers in schools with lots of low-achieving students—and to inflate the scores of weaker teachers who were in high-performing schools.

“It creates a system that is demonstrably and palpably unfair to teachers, given that they have little control over the performance of the whole school,” the report states.

A version of this article appeared in the May 21, 2014 edition of Education Week as Bias Detected in Classroom Observations

Events

This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
School Climate & Safety Webinar
Belonging as a Leadership Strategy for Today’s Schools
Belonging isn’t a slogan—it’s a leadership strategy. Learn what research shows actually works to improve attendance, culture, and learning.
Content provided by Harmony Academy
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
School & District Management Webinar
Too Many Initiatives, Not Enough Alignment: A Change Management Playbook for Leaders
Learn how leadership teams can increase alignment and evaluate every program, practice, and purchase against a clear strategic plan.
Content provided by Otus
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
Artificial Intelligence Webinar
Beyond Teacher Tools: Exploring AI for Student Success
Teacher AI tools only show assigned work. See how TrekAi's student-facing approach reveals authentic learning needs and drives real success.
Content provided by TrekAi

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide — elementary, middle, high school and more.
View Jobs
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
View Jobs
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
View Jobs
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.
View Jobs

Read Next

Teaching Profession Quiz Teachers, How Does Your Morale Compare With Your Colleagues'? Take Our Quiz
Take our online quiz and compare your morale score with that of teachers nationwide.
Education Week Staff
1 min read
New Teacher Support Coaches engross in a discussion during New Teacher Support Coaches Professional Learning session on November 7, 2025 at Center for Professional Development in Fresno.
Coaches who support new teachers meet on November 7, 2025, at the Fresno, Calif., school district's Center for Professional Development. Nurturing the morale of new teachers is a big challenge for schools across the country.
Andri Tambunan for Education Week
Teaching Profession Gen Z Teachers Grew Up With Tech. Now They're Seeking Better Boundaries for Students
Gen Z teachers grew up in an era of unbridled tech. It shapes how they approach classroom technology.
4 min read
Katrina tk
Katrina Sacurom, a 5th grade teacher, huddles with the Shawnee Trail Elementary School journalism crew to go over how their projects are progressing on Feb. 3, 2026 in Frisco, Texas. She says she wants her students to learn to use technology thoughtfully and has looked for ways to tailor it to be meaningful, not mindless.
Kaylee Domzalski/Education Week
Teaching Profession Why Are Teachers in This Region So Miserable?
It's not clear why New England and Mid-Atlantic teachers feel so burned out. But some fixes could help.
9 min read
Winter in Lowville, N.Y. on Nov. 29, 2025. “There’s a lot of things here in our area that would certainly impact teacher morale if you let it,” said Zippel Principal Christopher Hallett. “We are very conscious of it here in our region. We are isolated in many, many ways: It’s a low-income population in a very rural area, so as you can imagine, there’s not a lot to do. Getting people to think outside the box about their own mental health and self-care is pretty important up here.”
Winter in Lowville, N.Y. on Nov. 29, 2025. For the past three years, teachers in the Northeast—including New York state—have reported significantly poorer morale than teachers in the West, Midwest, and South, according to the EdWeek Research Center’s annual survey. Said one Maine principal, Christopher Hallett: “There’s a lot of things here in our area that would certainly impact teacher morale if you let it."
Cara Anna/AP
Teaching Profession Download Insights for School Leaders: How to Better Support Teachers
EdWeek's downloadable guide offers tips to principals on how to improve the morale and working conditions of educators.
1 min read