Deep learning in population genetics

Kevin Korfmann, Oscar E Gaggiotti, Matteo Fumagalli*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

8 Downloads (Pure)

Abstract

Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field.
Original languageEnglish
Article numberevad008
Number of pages20
JournalGenome Biology and Evolution
Volume15
Issue number2
Early online date23 Jan 2023
DOIs
Publication statusPublished - 2 Feb 2023

Keywords

  • Population genetics
  • Machine learning
  • Artifical neural networks
  • Simulations
  • Balancing selection

Fingerprint

Dive into the research topics of 'Deep learning in population genetics'. Together they form a unique fingerprint.

Cite this