At a glance:
A new AI model called popEVE can predict how likely each variant in a patient’s genome is to cause disease.
The team is testing popEVE in clinical settings to see if it can speed accurate diagnoses of rare, single-variant genetic diseases.
The tool could also help researchers identify new drug targets for the treatment of genetic conditions.
Every human has tens of thousands of tiny genetic alterations in their DNA, also known as variants, that affect how cells build proteins.
Yet in a given human genome, only a few of these changes are likely to modify proteins in ways that cause disease, which raises a key question: How can scientists find the disease-causing needles in the vast haystack of genetic variants?
For years, scientists have been working on genome-wide association studies and artificial intelligence tools to tackle this question. Now, a new AI model developed by Harvard Medical School researchers and colleagues has pushed forward these efforts. The model, called popEVE, produces a score for each variant in a patient’s genome indicating its likelihood of causing disease and places variants on a continuous spectrum.
In a paper published Nov. 24 in Nature Genetics, the scientists show that popEVE can predict whether variants are benign or pathogenic (disease-causing) and which variants lead to death in childhood versus adulthood.
The model was able to identify more than 100 novel alterations responsible for undiagnosed, rare genetic diseases.
Authorship, funding, disclosures
Mafalda Dias and Jonathan Frazer were co-senior authors on the paper. Additional authors include Courtney A. Shearer, Aaron W. Kollasch, Aviv D. Spinner, Thomas Hopf, Lood van Niekerk, and Dinko Franceschi.
Funding for the work was provided by a Chan Zuckerberg Initiative Award (Neurodegeneration Challenge Network, CZI2018-191853), a National Institutes of Health Transformational Research Award (TR01CA260415), a National Science Foundation Graduate Research Fellowship, the Spanish Ministry of Science and Innovation (PID2022-140793NA-I00; CEX2020-001049-S; MCIN/AEI/10.13039/501100011033, MCIN/AEI/10.13039/501100011033/FEDER, UE), and the Generalitat de Catalunya (Government of Catalonia) through the CERCA program.