How local reference panels improve imputation in French populations - Université de Bretagne Occidentale
Article Dans Une Revue Scientific Reports Année : 2024

How local reference panels improve imputation in French populations

Camille Charbonnier-Le Clézio
  • Fonction : Auteur
Chantal Adjou
  • Fonction : Auteur
Philippe-Antoine Halbout
  • Fonction : Auteur
David L’helgouach
  • Fonction : Auteur
Anne Vogelsperger
  • Fonction : Auteur
Pierre Lindenbaum
  • Fonction : Auteur
Vincent Meyer
  • Fonction : Auteur
Florian Sandron
  • Fonction : Auteur
Damien Delafoy
  • Fonction : Auteur
Isabel Alves
  • Fonction : Auteur
Matilde Karakachoff
  • Fonction : Auteur
Christian Dina
  • Fonction : Auteur
Richard Redon
  • Fonction : Auteur
Jean-François Deleuze
  • Fonction : Auteur

Résumé

Imputation servers offer the exclusive possibility to harness the largest public reference panels which have been shown to deliver very high precision in the imputation of European genomes. Many studies have nonetheless stressed the importance of ‘study specific panels’ (SSPs) as an alternative and have shown the benefits of combining public reference panels with SSPs. But such combined approaches are not attainable when using external imputation servers. To investigate how to confront this challenge, we imputed 550 French individuals using either the University of Michigan imputation server with the Haplotype Reference Consortium (HRC) panel or an in‑house SSP of 850 whole‑genome sequenced French individuals. With approximate geo‑localization of both our target and SSP individuals we are able to pinpoint different scenarios where SSP‑based imputation would be preferred over server‑based imputation or vice‑versa. This is achieved by showing to a high degree of resolution the importance of the proximity of the reference panel to target individuals; with a focus on the clear added value of SSPs for estimating haplotype phase and for the imputation of rare variants (minor allele‑frequency below 0.01). Such benefits were most evident for individuals from the same geographical regions in France as the SSP individuals. Overall, only 42.3% of all 125,442 variants evaluated were better imputed with an SSP from France compared to an external reference panel, however this rises to 58.1% for individuals from geographic regions well covered by the SSP. By investigating haplotype sharing and population fine‑structure in France, we show the importance of including SSP haplotypes for imputation but also that they should ideally be combined with large public panels. In the absence of the unattainable results from a combined panel of the HRC and our French SSP, we put forward a pragmatic solution where server‑based and SSP‑based imputation outcomes can be combined based on comparing posterior genotype probabilities. We show that such an approach can give a level of imputation accuracy in excess of what could be achieved with either strategy alone. The results presented provide detailed insights into the accuracy of imputation that should be expected from different strategies for European populations.
Fichier principal
Vignette du fichier
How local reference panels.pdf (1.72 Mo) Télécharger le fichier
Origine Fichiers éditeurs autorisés sur une archive ouverte
Licence

Dates et versions

hal-04501605 , version 1 (12-03-2024)

Licence

Identifiants

Citer

Anthony Herzig, Lourdes Velo‐suárez, Dominique Campion, Jean-François Dartigues, Jean-Charles Lambert, et al.. How local reference panels improve imputation in French populations. Scientific Reports, 2024, 14 (1), pp.370. ⟨10.1038/s41598-023-49931-3⟩. ⟨hal-04501605⟩
76 Consultations
63 Téléchargements

Altmetric

Partager

More