
Illumina unveiled AI software to predict disease-causing genetic mutations in patients
On Jun. 1, 2023, Illumina announced the new PrimateAI-3D, an artificial intelligence (AI) algorithm that predicts with unprecedented accuracy disease-causing genetic mutations in patients. The results were published in two papers in the Jun. 2 issue of Science, detailing the training of the algorithm and its application to half a million genomes in the UK Biobank cohort. Two accompanying papers on the primate evolution research that informed the development of PrimateAI-3D also published in Science.
According to the National Institutes of Health, the amount of genomic data being generated is approaching 40 billion gigabytes each year. The ability to share, analyze, and interpret genomic data is critical to unlocking discoveries that will advance understanding of human health and improve precision medicine.
An AI algorithm trained by natural selection. Each person carries millions of genetic variants that underlie individual differences in health and disease risk, but most of these variants are presently of unknown function. By highlighting disease-causing variants with unparalleled accuracy, PrimateAI-3D addresses a critical challenge facing the successful implementation of personalized genomic medicine.
To achieve its state-of-the-art performance, PrimateAI-3D utilizes deep neural network architectures similar to ChatGPT and AlphaFold, but is trained on genome sequences rather than human language. However, unlike generative language models such as ChatGPT, where existing texts can be used to inform training, the genetic variants that cause disease in the human genome are largely unknown.
To overcome this, PrimateAI-3D effectively uses natural selection to train the parameters of the deep neural network, using millions of benign genetic variants identified through the sequencing of 233 diverse primate species, the largest such sequencing effort of nonhuman primate species to date. Sequencing nonhuman primates can help scientists infer the pathogenicity of human genetic variants, and thus improve clinical variant interpretation on a genome-wide scale.
The result is a deep neural network that has been shown to identify disease-causing variants with superior accuracy in all six clinical cohorts that were tested, and provide individualized predictions of genetic disease risk that have been validated in a cohort of nearly half a million people.
As described in the accompanying paper published in Science, Illumina scientists, along with academic collaborators, next applied the PrimateAI-3D algorithm to identify rare pathogenic mutations in nearly half a million individuals in the UK Biobank. They found that the genomes of 97% of otherwise healthy members of the general population harbor highly actionable variants for at least one of 90 different clinical conditions that they surveyed. PrimateAI-3D also greatly improved the accuracy of genetic risk prediction, enabling the first demonstration of polygenic risk scores that were largely unaffected by ancestry bias, a key step toward the equitable implementation of genetic-based precision medicine for diverse, non-European populations.
Tags:
Source: Illumina
Credit:
