- About
- News & Communications
- Programs & Events
- Get in Touch
Back to Top Nav
Back to Top Nav
Back to Top Nav
Back to Top Nav
"Deep-learning discovery of variants that alter 3D genome folding"
Katherine S. Pollard
Director, Gladstone Institute of Data Science & Biotechnology
Professor, University of California San Francisco
"Deep-learning discovery of variants that alter 3D genome folding"
The human genome sequence folds in three dimensions (3D) into a rich variety of locus-specific contact patterns. Despite growing appreciation for the importance of 3D genome folding in evolution and disease, we lack models for relating mutations in genome sequences to changes in genome structure and function. Towards that goal, we developed a computational model (a deep convolutional neural network called Akita) that accurately predicts genome folding from DNA sequence alone. Representations learned by Akita underscore the importance of the structural protein CTCF but also reveal a complex grammar beyond CTCF binding sites that underlies genome folding, including an unexpectedly large contribution of repetitive elements. Akita enables rapid in silico predictions for effects of sequence variants on the 3D genome, including differences in genome folding across species and in disease cohorts. We validated high-scoring causal variants from patients with autism, developmental delay, and congenital heart defects using CRISPR-edited genomes. This prediction-first strategy exemplifies my vision for a more proactive, rather than reactive, role for data science in biomedical research.
Events are free and open to the public unless otherwise noted.