10 years of genomics research at Google
For decades, scientists have been working to understand the genome, the operating manual for all of life on Earth. This manual lives in unending variations inside the DNA in each organism and affects everything from an organism's development to the rules of life more broadly, and yet it is one of the least understood aspects of biology.
Our journey into developing technologies for geneticists to study the genome of billions of humans, plants and animals began 10 years ago. A small team of researchers at Google decided to apply deep learning to various genome sequencing challenges, making it faster, more accurate and more efficient. What began in 2015 as a foundational research effort has now evolved into a global initiative, including partnerships with scientists and institutions across the globe to accelerate scientific discovery, advance healthcare and preserve biodiversity.
This work is already showing promising breakthroughs on behalf of human health — including today’s announcement of DeepSomatic, a new tool to identify cancer variants with more accuracy. But our work is not yet done, and more tools are coming. Today, we're taking a moment to look back at some of the key milestones and groundbreaking tools from our first 10 years in genomics, powered by AI.
Reading the genome
AI has proven instrumental in overcoming the most fundamental challenge in genomics: reading the code of life accurately and efficiently.
- 2015 - Research begins: Google applied cutting-edge deep learning techniques to the field of genomics for the first time, winning the 2016 PrecisionFDA Truth Challenge and kicking off a decade of exploration into the field.
- 2018 - Accurate identification of genetic variations: We openly released our state-of-the-art variant caller, DeepVariant. This deep learning-based tool accurately identifies genetic variants in DNA sequencing data and has become widely used, directly contributing to landmark milestones like the first truly complete human genome sequence.
- 2022 - More accurate genetic sequencing: We introduced DeepConsensus, a deep learning model that improves the accuracy of long-read sequencing data and reduces errors in raw genomic data. This has since allowed for a 250% increase in the throughput of the highest-quality sequence reads for researchers.
- 2022 - Completing the human genome: With help from DeepVariant, the final 8% of the human genome was fully decoded (the NIH Telomere-to-Telomere (T2T) project). This effort filled in the last missing pieces of our genetic blueprint, providing a more complete resource for researchers everywhere.
- 2023 - The human pangenome reference: Engineers used DeepConsensus and DeepVariant to help create the first draft of the human pangenome. This new reference genome includes sequences from multiple individuals of diverse ancestries, a critical step toward ensuring that all people benefit from genomic medicine.
- 2025 - More accurate gene assemblies: We introduced DeepPolisher, a deep learning tool that enhances the accuracy of genome assemblies for human, animal and plant genomes, improving the quality of reference genomes used in research.
Understanding the genome
Once the genome can be read accurately, the next challenge is creating a more comprehensive understanding of what function specific parts of the sequence encode and identifying which tiny variations cause disease.
- 2021 - Predicting gene expression: Enformer, a model that predicts gene expression from a genome sequence, was released to better understand non-coding genetic variants and design cell type specific DNA sequences, advancing basic biological understanding.
- 2023 - Assessing the disease-causing potential of coding genetic variants: We developed AlphaMissense, a model that predicts which genetic variants in the protein-coding regions of the genome are more likely to cause disease, providing a powerful new tool for geneticists.
- 2025 - Unifying model of non-coding variant effects: Our new AI model, AlphaGenome, predicts how single DNA variants affect biological processes, helping researchers understand the "dark matter" of the genome.

Accelerating discovery for people and the planet
Our AI tools are helping researchers solve major biological grand challenges, leading to breakthroughs in health and enabling conservation efforts for endangered species.
- 2021 - Phenotyping eye diseases: Our approach to training AI on large-scale biobank data of retinal images and genomics led to the discovery of new genetic links to eye diseases, identifying potential targets for drug development against conditions like glaucoma.
- 2022 - Record-breaking genetic diagnosis: A collaboration led by Stanford Medicine set a Guinness World Record for the fastest genetic diagnosis, identifying a disease-causing variant in under 8 hours with Google’s AI.
- 2023 - Sequencing DNA for conservation: Google provides technical and computational support for an ambitious initiative to sequence all eukaryotic life. Our collaboration has already aided genome projects for 17 critically endangered species, showcasing AI's role in conservation.
- 2024 - AI for common health measures: Google developed AI methods that use common medical tests to discover how our genes contribute to heart and lung diseases. Our methods improve disease prediction and genetic discovery by better understanding how genetics influences the body's fundamental processes.
- 2025 - Identifying variants in cancer cells: We developed an AI model, DeepSomatic, that more accurately identifies cancer-related genetic mutations. When tested on multiple cancer types, our method identified key variants missed by prior state-of-the-art tools, offering the potential to improve cancer diagnosis and treatment.

The advancement of genomics over the past decade comes down to our approach to creating an innovation flywheel: we are driven by a real-world problem, like understanding the human genome to deliver better healthcare, to conduct fundamental computer science research. That research leads to a solution, which in turn empowers us to uncover even more interesting problems to solve, such as applying this work to the preservation of our planet’s species and protection of biodiversity.
After a decade of driving breakthrough discoveries from research to reality for advancing genetics, there is now an extraordinary research community that collaborates on this powerful suite of scientific tools to improve the lives of every being on this planet.
We can't wait to see what the next 10 years of research together will bring.