Artificial Intelligence What the Research Says

AI Changes Its Feedback on Students’ Writing When It Knows Their Race, Gender

By Sarah D. Sparks — June 18, 2026 6 min read
A silhouette of a girl's profile has the quote "I love your confidence in expressing your opinion!" on top of it on torn pieces of paper. She is facing a silhouette of a boy's profile that has the quote "Try providing additional evidence or examples from the article to support this claim." on top of it, also on torn pieces of paper.
  • Save to favorites
  • Print

Artificial intelligence-based writing coaches have gained popularity as a way to give teachers quick feedback on hundreds of student drafts, reducing one of their most time-intensive tasks.

But efforts to make that feedback more personalized may backfire, new research on the tools shows.

Consider this example:

See also

Female teacher helps young boy to do his school work.
iStock/Getty

“You raise a compelling argument about the potential benefits of discovering alien life,” one AI writing tutor tells a high-achieving student. “Consider expanding on this thought to address potential counterarguments.”

By contrast, a struggling writer gets feedback limited to pointing out spelling mistakes and a suggested rewrite for an unclear sentence. They don’t get prompting on how to strengthen their writing by analyzing counterfactuals.

The problem? Both writing samples are identical. The only difference was the description of the students themselves entered in with the sample in a new study of AI writing feedback by researchers at Stanford University’s Institute for Human-Centered AI.

Feedback for English learner

LLM biases tearsheets ell

Feedback for student with learning disabilities

LLM biases tearsheets learningdisabilities

Feedback for student with unspecified attributes

LLM biases tearsheets unspecified

Providing information about a student’s background—their race, gender, language, disability status, achievement level, or even motivation—can significantly change the kind and quality of writing feedback the AI tool provides, according to the study by doctoral researchers Mei Tan and Lena Phalen. It was first presented at the International Learning Analytics and Knowledge conference in Bergen, Norway, in May.

“Teachers have discernment that LLMs simply don’t,” Phalen said. “So it is really important that we know, in this pursuit of ‘personalization’ for students, what personalization looks like according to large language models, and whether or not that is really aligning with the kind of personalization we want to see in pedagogy.”

Tan, an education data scientist, and Phalen, a curriculum and teacher-training researcher, asked AI models to provide writing feedback for 600 8th grade persuasive essays from a nationally representative data set. The study looked at GPT-4o, GPT-3.5-Turbo, and Meta’s Llama-3.3 70B, and Llama-3.1 8B, commonly used to undergird educational tools like MagicSchool and School AI. It did not include the newest AI models, like ChatGPT 5.5, Meta’s Muse Spark, or Anthropic’s Claude Opus 4.8.

At first, the researchers didn’t include any of the students’ background characteristics in their prompts requesting feedback. Later, they randomly assigned the essays to specific descriptions, including gender, race, academic achievement level, disability, and level of English fluency. And in a third round, the researchers deleted the descriptive characteristics but assigned names associated with specific genders or races.

Writers described as students of color in the study were more likely to be exhorted to “polish” their writing, researchers found, and the feedback seemed shot through with cultural stereotypes—such as framing critiques for Asian students around academic responsibility and respect, while assuming limited English ability for Latino students and orienting feedback around family and culture.

Feedback for white student

LLM biases tearsheets white

Feedback for Latino student

LLM biases tearsheets hispanic

And while the models focused on direct critiques and to-do tasks for students described as male, they were more likely to use emotional language like “love” or “wonderful” for writers described as female.

For “unmotivated” students, the AI coaches used more praise and affirmations in feedback—but also focused more on basic edits like spelling or grammar. Feedback for “motivated” writers more often pushed students to improve their arguments or structure.

Feedback for male student

AI LLM biases tearsheets gender male

Feedback for female student

AI LLM biases tearsheets gender female

The study comes amid rising concern from educators and lawmakers about students’ use of AI tools in the classroom and their effect on students’ critical thinking skills and the potential for the technology to facilitate cheating.

“AI is both fascinating and powerful in the kind of feedback that it can give,” said Larry Berger, the chief executive officer of the education technology and curriculum provider Amplify, “but also it can make really fundamental pedagogical mistakes, and I think it can also be culturally insensitive in all kinds of ways.” (Amplify’s tools and materials were not reviewed in the study.)

This is far from the first time that AI models have shown what’s known as algorithmic biases. Prior studies repeatedly have found that generative AI tools can replicate or magnify stereotypes because they have been trained on skewed historic data.

“These technologies are essentially black boxes,” Phalen said, “so we don’t know how biases may be reproduced and what access to information about students [AI tools] have as a result of integration into school settings,” like learning-management systems or other interfaces that might include background characteristics.

One factor is that LLMs undergirding AI tools don’t filter information in the same way as humans do, so an AI system won’t ignore irrelevant data—like a student’s Spanish-sounding name—in the way a teacher can. All of the context included in a prompt is considered relevant to the task, even if a student’s race or gender, for example, has no bearing on their writing skills.

Relationships crucial to student writing

Writing can be especially vulnerable to biased AI feedback, Berger said, because it can be more subjective and relationship-driven than learning in other content areas: Teachers’ or classmates’ responses can drive students’ motivation to write.

“If I’m putting myself out there and sharing my ideas—even if it’s just two paragraphs about photosynthesis—I’m also asking, ‘Did the teacher think this communicated what I wanted it to? Am I good at writing? What do my classmates think of this?’” Berger said. “If [AI feedback] bias comes along at that moment of vulnerability and it is ungenerous to a kid, or gets that feedback wrong, it’s potentially offering a troubling answer to those questions.”

See Also

Katrina tk
Katrina Sacurom, a 5th grade teacher, huddles with the Shawnee Trail Elementary School journalism crew to go over how their projects are progressing on Feb. 3, 2026 in Frisco, Texas. She says she wants her students to learn to use technology thoughtfully and has looked for ways to tailor it to be meaningful, not mindless.
Kaylee Domzalski/Education Week

Katrina Sacurom, a 5th grade teacher at Shawnee Trail Elementary School in Frisco, Texas, has developed and regularly uses her own AI-based writing coach in classes. She said the tools can help a teacher develop a norm for her own students’ writing skills, but they shouldn’t replace teacher judgment.

“Teachers have the wherewithal and the intimate knowledge of students and how to deem what is and isn’t appropriate as it relates to feedback,” she said.

For example, Sacurom said she would never use student motivation as a gauge for writing proficiency, as it might change significantly depending on the writing task or topic. Improving student motivation involves conversations with the student and nuance that AI probably can’t provide.

She avoids referencing context about specific students, instead prompting the program to give feedback based on her own writing rubric and specific skill-related goals for a given assignment.

“I might specify, ‘hey, this student typically generates one to three sentences for short constructed responses. Our goal is to grow the student’s output up to four to five sentences.’ Things like that, I would believe, are useful,” Sacorum said.

Events

This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
College & Workforce Readiness Webinar
The Road to Opportunity: Making CTE Accessible for All
The most valuable CTE happens off campus. For too many students, transportation is the barrier that keeps opportunity out of reach.
Content provided by HopSkipDrive
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of Education Week's editorial staff.
Sponsor
Recruitment & Retention Webinar
New Hire, No Laptop, No Login: Preventing Day-One Disruption
What happens before day one matters. Discover how districts are improving the new hire experience.
Content provided by Frontline Education
Teaching Profession K-12 Essentials Forum Supporting the New K-12 Workforce: What Teachers Need to Stay at School
 Join this free virtual event to discover what teachers say they need to feel supported to stay in classrooms for the long haul.

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide — elementary, middle, high school and more.
View Jobs
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
View Jobs
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
View Jobs
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.
View Jobs

Read Next

Artificial Intelligence At U.S. Senate Hearing, a Call for AI That Protects 'Human Judgment' in Schools
State and company officials want meaningful guardrails around AI use in schools.
4 min read
Delaware Secretary of Education Cynthia Marten meeting with a student while visiting a classroom at the Adams Campus of Oyster-Adams Bilingual School in Washington, Thursday, May 25, 2023. At a U.S. Senate hearing this week, Marten said policymakers should recognize AI's potential while preserving the importance of "human relationships and human judgment."
Delaware Secretary of Education Cynthia Marten visits a classroom at Oyster-Adams Bilingual School in Washington on May 25, 2023. Marten testified on Tuesday before a U.S. Senate subcommittee exploring the role of AI in schools.
Pablo Martinez Monsivais/AP
Artificial Intelligence White House Honors AI Challenge Winners as Tech Backlash Grows
Students and teachers submitted projects that use AI to solve problems in their schools and communities.
First Lady Melania Trump leaves the East Room after hosting the Inaugural Presidential AI Challenge National Champion Awards Ceremony at the White House on Tuesday, June 9, 2026, in Washington.
First lady Melania Trump leaves the East Room after hosting the inaugural Presidential AI Challenge National Champion Awards Ceremony at the White House on June 9, 2026, in Washington.
John McDonnell/AP
Artificial Intelligence Why This Large District Used AI to Simplify Its Org Chart
The school system's organizational chart evolved organically for decades. Can AI bring cohesion?
3 min read
Image of an organizational chart with "prompt."
Getty + Canva
Artificial Intelligence Kids Are Turning to AI Before Adults for Homework Help
Students increasingly turn to AI first for school help, raising trust and safety concerns.
5 min read
Student doing homework at home
Laura Baker/Education Week + Getty