American University
Browse

Priors for genotyping polyploids

Download (733.24 kB)
journal contribution
posted on 2023-08-05, 12:59 authored by David GerardDavid Gerard, Luís Felipe Ventorim Ferrão

Motivation: Empirical Bayes techniques to genotype polyploid organisms usually either (i) assume technical artifacts are known a priori or (ii) estimate technical artifacts simultaneously with the prior genotype distribution. Case (i) is unappealing as it places the onus on the researcher to estimate these artifacts, or to ensure that there are no systematic biases in the data. However, as we demonstrate with a few empirical examples, case (ii) makes choosing the class of prior genotype distributions extremely important. Choosing a class is either too flexible or too restrictive results in poor genotyping performance. Results: We propose two classes of prior genotype distributions that are of intermediate levels of flexibility: the class of proportional normal distributions and the class of unimodal distributions. We provide a complete characterization of and optimization details for the class of unimodal distributions. We demonstrate, using both simulated and real data that using these classes results in superior genotyping performance. Availability and implementation: Genotyping methods that use these priors are implemented in the updog R package available on the Comprehensive R Archive Network: https://cran.r-project.org/package¼updog. All code needed to reproduce the results of this article is available on GitHub: https://github.com/dcgerard/reproduce_prior_sims.

History

Publisher

Bioinformatics

Notes

Bioinformatics, Volume 36, Issue 6, 15 March 2020, Pages 1795–1800.

Handle

http://hdl.handle.net/1961/auislandora:88400

Usage metrics

    Mathematics & Statistics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC