Medical AI falters when assessing patients it hasn’t seen. Source: Nature Briefing. “Physicians rely on algorithms for personalized medicine — but an analysis of schizophrenia trials shows the tools fail to adapt to new data sets.”

  • NEWS
  • 11 January 2024

Medical AI falters when assessing patients it hasn’t seen

Physicians rely on algorithms for personalized medicine — but an analysis of schizophrenia trials shows the tools fail to adapt to new data sets.

Coloured Positron Emission Tomography brain scans of a schizophrenic shown at bottom versus normal patient at top
Scans showing brain activity during speech for a person with schizophrenia (bottom) and one without (top).Credit: Wellcome Centre Human Neuroimaging/Science Photo Library

Computer algorithms that are designed to help doctors treat people with schizophrenia do not adapt well to fresh, unseen data, a study has found.

Such tools — which use artificial intelligence (AI) to spot patterns in large data sets and predict how individuals will respond to a particular treatment — are central to precision medicine, in which health-care professionals try to tailor treatment to each person. In work published on 11 January in Science1, researchers showed that AI models can predict treatment outcomes with high accuracy for people in a sample that they were trained on. But their performance drops to little better than chance when applied to subsets of the initial sample, or to different data sets.

To be effective, prediction models need to be consistently accurate across different cases, with minimal bias or random outcomes.

“It’s a huge problem that people have not woken up to,” says study co-author Adam Chekroud, a psychiatrist at Yale University in New Haven, Connecticut. “This study basically gives the proof that algorithms need to be tested on multiple samples.”

Algorithm accuracy

The researchers assessed an algorithm that is commonly used in psychiatric-prediction models. They used data from five clinical trials of antipsychotic drugs, involving 1,513 participants across North America, Asia, Europe and Africa, who had been diagnosed with schizophrenia. The trials, which were carried out between 2004 and 2009, measured participants’ symptoms before and four weeks after taking one of three antipsychotic drugs (or compared the effects of different doses of the same drug).

The team trained the algorithm to predict improvements in symptoms over four weeks of antipsychotic treatment. First, the researchers tested the algorithm’s accuracy in the trials in which it had been developed — comparing its predictions with the actual outcomes recorded in the trials — and found that the accuracy was high.

Then they used several approaches to evaluate how well the model generalizes to new data. The researchers trained it on a subset of data from one clinical trial and then applied it to another subset from the same trial. They also trained the algorithm on all the data from one trial — or a group of trials — and then measured its performance on a separate trial.

The model performed poorly in these tests, generating seemingly almost random predictions when applied to a data set that it had not been trained on. The team repeated the experiment using a different prediction algorithm, but got similar results.

Better testing

The study’s authors say that their findings highlight how clinical prediction models should be tested rigorously on large data sets to ensure that they are reliable. A systematic review2 of 308 clinical-prediction models for psychiatric outcomes found that only about 20% of models underwent validation on samples other than the ones on which they were developed.

“We should think about it much more like drug development,” says Chekroud. Many drugs show promise in early clinical trials, but falter in the later stages, he explains. “We do have to be really disciplined about how we build these algorithms and how we test them. We can’t just do it once and think it’s real.”

doi: https://doi.org/10.1038/d41586-024-00094-9

References

  1. Chekroud, A. L. et al. Science 383, 164 (2024).Article Google Scholar 
  2. Meehan, A. J. et al. Mol. Psychiatry 27, 2700–2708 (2022).Article Google Scholar 

Download references

Reprints and permissions

Latest on:

NeuroscienceMachine learningPersonalized medicineProtracted neuronal recruitment in the temporal lobe of young childrenARTICLE 2D materials ratchet up biorealism in computingNEWS & VIEWS FORUM Minute-scale oscillatory sequences in medial entorhinal cortexARTICLE 

Nature Careers

Jobs

Related Articles

Subjects

Sign up to Nature Briefing

An essential round-up of science news, opinion and analysis, delivered to your inbox every weekday.Email addressYes! Sign me up to receive the daily Nature Briefing email. I agree my information will be processed in accordance with the Nature and Springer Nature Limited Privacy Policy.Sign up

Unknown's avatar

About michelleclarke2015

Life event that changes all: Horse riding accident in Zimbabwe in 1993, a fractured skull et al including bipolar anxiety, chronic fatigue …. co-morbidities (Nietzche 'He who has the reason why can deal with any how' details my health history from 1993 to date). 17th 2017 August operation for breast cancer (no indications just an appointment came from BreastCheck through the Post). Trinity College Dublin Business Economics and Social Studies (but no degree) 1997-2003; UCD 1997/1998 night classes) essays, projects, writings. Trinity Horizon Programme 1997/98 (Centre for Women Studies Trinity College Dublin/St. Patrick's Foundation (Professor McKeon) EU Horizon funded: research study of 15 women (I was one of this group and it became the cornerstone of my journey to now 2017) over 9 mth period diagnosed with depression and their reintegration into society, with special emphasis on work, arts, further education; Notes from time at Trinity Horizon Project 1997/98; Articles written for Irishhealth.com 2003/2004; St Patricks Foundation monthly lecture notes for a specific period in time; Selection of Poetry including poems written by people I know; Quotations 1998-2017; other writings mainly with theme of social justice under the heading Citizen Journalism Ireland. Letters written to friends about life in Zimbabwe; Family history including Michael Comyn KC, my grandfather, my grandmother's family, the O'Donnellan ffrench Blake-Forsters; Moral wrong: An acrimonious divorce but the real injustice was the Catholic Church granting an annulment – you can read it and make your own judgment, I have mine. Topics I have written about include annual Brain Awareness week, Mashonaland Irish Associataion in Zimbabwe, Suicide (a life sentence to those left behind); Nostalgia: Tara Hill, Co. Meath.
This entry was posted in Uncategorized and tagged , , , , . Bookmark the permalink.

Leave a comment