Futurism: No more favoritism! Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

School’s Out

May 10, 6:30 AM EDTbyJoe Wilkins

Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

The results are dismal.

/ Artificial Intelligence / Ai / Education / Mixtral

Image by Getty / Futurism

Talk to a teacher lately, and you’ll probably get an earful about AI’s effects on student attention spans, reading comprehension, and cheating.

As AI becomes ubiquitous in everyday life — thanks to tech companies forcing it down our throats — it’s probably no shocker that students are using software like ChatGPT at a nearly unprecedented scale. One study by the Digital Education Council found that nearly 86 percent of university students use some type of AI in their work.

That’s causing some fed-up teachers to fight fire with fire, using AI chatbots to score their students’ work. As one teacher mused on Reddit: “You are welcome to use AI. Just let me know. If you do, the AI will also grade you. You don’t write it, I don’t read it.”

While AI might save educators some time and precious brainpower — which arguably make up the bulk of the gig — the tech isn’t even close to cut out for the job, according to researchers at the University of Georgia. While we should probably all know it’s a bad idea to grade papers with AI, a new study by the School of Computing at UG gathered data on just how bad it is.

The research tasked the Large Language Model (LLM) Mixtral with grading written responses to middle school homework. Rather than feeding the LLM a human-created rubric, as is usually done in these studies, the UG team tasked Mixtral with creating its own grading system. The results were abysmal.

Compared to a human grader, the LLM accurately graded student work just 33.5 percent of the time. Even when supplied with a human rubric, the model had an accuracy rate of just over 50 percent.

Though the LLM “graded” quickly, its scores were frequently based on flawed logic inherent to LLMs.

“While LLMs can adapt quickly to scoring tasks, they often resort to shortcuts, bypassing deeper logical reasoning expected in human grading,” wrote the researchers.

“Students could mention a temperature increase, and the large language model interprets that all students understand the particles are moving faster when temperatures rise,” said Xiaoming Zhai, one of the UG researchers. “But based upon the student writing, as a human, we’re not able to infer whether the students know whether the particles will move faster or not.”

Though the UG researchers wrote that “incorporating high-quality analytical rubrics designed to reflect human grading logic can mitigate [the] gap and enhance LLMs’ scoring accuracy,” a boost from 33.5 to 50 percent accuracy is laughable. Remember, this is the technology that’s supposed to bring about a “new epoch” — a technology we’ve poured more seed money into than any in human history.

If there were a 50 percent chance your car would fail catastrophically on the highway, none of us would be driving. So why is it okay for teachers to take the same gamble with students?

It’s just further confirmation that AI is no substitute for a living, breathing teacher, and that isn’t likely to change anytime soon. In fact, there’s mounting evidence that AI’s comprehension abilities are getting worse as time goes on and original data becomes scarce. Recent reporting by the New York Times found that the latest generation of AI models hallucinate as much as 79 percent of the time — way up from past numbers.

When teachers choose to embrace AI, this is the technology they’re shoving off onto their kids: notoriously inaccurate, overly eager to please, and prone to spewing outright lies. That’s before we even get into the cognitive decline that comes with regular AI use. If this is the answer to the AI cheating crisis, then maybe it’d make more sense to cut out the middle man: close the schools and let the kids go one-on-one with their artificial buddies.

More on AI: People With This Level of Education Use AI the Most at Work

About michelleclarke2015

Life event that changes all: Horse riding accident in Zimbabwe in 1993, a fractured skull et al including bipolar anxiety, chronic fatigue …. co-morbidities (Nietzche 'He who has the reason why can deal with any how' details my health history from 1993 to date). 17th 2017 August operation for breast cancer (no indications just an appointment came from BreastCheck through the Post). Trinity College Dublin Business Economics and Social Studies (but no degree) 1997-2003; UCD 1997/1998 night classes) essays, projects, writings. Trinity Horizon Programme 1997/98 (Centre for Women Studies Trinity College Dublin/St. Patrick's Foundation (Professor McKeon) EU Horizon funded: research study of 15 women (I was one of this group and it became the cornerstone of my journey to now 2017) over 9 mth period diagnosed with depression and their reintegration into society, with special emphasis on work, arts, further education; Notes from time at Trinity Horizon Project 1997/98; Articles written for Irishhealth.com 2003/2004; St Patricks Foundation monthly lecture notes for a specific period in time; Selection of Poetry including poems written by people I know; Quotations 1998-2017; other writings mainly with theme of social justice under the heading Citizen Journalism Ireland. Letters written to friends about life in Zimbabwe; Family history including Michael Comyn KC, my grandfather, my grandmother's family, the O'Donnellan ffrench Blake-Forsters; Moral wrong: An acrimonious divorce but the real injustice was the Catholic Church granting an annulment – you can read it and make your own judgment, I have mine. Topics I have written about include annual Brain Awareness week, Mashonaland Irish Associataion in Zimbabwe, Suicide (a life sentence to those left behind); Nostalgia: Tara Hill, Co. Meath.

View all posts by michelleclarke2015 →

Futurism: No more favoritism! Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

The results are dismal.

About michelleclarke2015

Leave a comment Cancel reply

Archives

Meta

Futurism: No more favoritism! Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

Teachers Using AI to Grade Their Students’ Work Sends a Clear Message: They Don’t Matter, and Will Soon Be Obsolete

The results are dismal.

Share this:

Related

About michelleclarke2015

Leave a comment Cancel reply

Archives

Meta