sciencehabit shares a report from Science Magazine: What if, rather than scouring the internet, ChatGPT could search all of the DNA on Earth? That future just got a bit closer with Evo, an AI model reported today in Science. The program — trained on billions of lines of genetic sequences — can design new proteins and even whole genomes. Previous AIs could only interpret and predict relatively short sections of DNA, and they could only work with groups of nucleotides — the A, C, G, T alphabet of DNA — not individual nucleotides. To take things to the next level, researchers trained Evo on 300 billion nucleotides of sequence information.
In a first test, Evo bested other AI models on predicting the impact of mutations on protein performance. The team then had Evo design new versions of the CRISPR genome editor; the best designs were as good at cutting DNA as a commercial version. And in what study author Brian Hie, a computational biologist at Stanford University, calls the “most futuristic and crazy” part of the study, the researchers asked Evo to generate DNA sequences that are long enough to serve as genomes for bacteria — a step toward AI-designed synthetic genomes.
Much of the work on AI occurs in secret at companies. But the researchers have released Evo publicly so that other researchers can use it, and Hie says the team has no plans to commercialize its creation. “For now, I see this as a research project.”