American University
Browse

Scalable bias-corrected linkage disequilibrium estimation under genotype uncertainty

Download (620.2 kB)
preprint
posted on 2023-08-04, 11:35 authored by David GerardDavid Gerard

Linkage disequilibrium (LD) estimates are often calculated genome-wide for use in many tasks, such as SNP pruning and LD decay estimation. However, in the presence of genotype uncertainty, naive approaches to calculating LD have extreme attenuation biases, incorrectly suggesting that SNPs are less dependent than in reality. These biases are particularly strong in polyploid organisms, which often exhibit greater levels of genotype uncertainty than diploids. A principled approach using maximum likelihood estimation with genotype likelihoods can reduce this bias, but is prohibitively slow for genome-wide applications. Here, we present scalable moment-based adjustments to LD estimates based on the marginal posterior distributions of the genotypes. We demonstrate, on both simulated and real data, that these moment-based estimators are as accurate as maximum likelihood estimators, but are almost as fast as naive approaches based only on posterior mean genotypes. This opens up bias-corrected LD estimation to genome-wide applications. In addition, we provide standard errors for these moment-based estimators. All methods discussed in this manuscript are implemented in the ldsep package, available on the Comprehensive R Archive Network (https://cran.r-project.org/package=ldsep).

History

Publisher

Springer Nature

Notes

Heredity, 2021.

Handle

http://hdl.handle.net/1961/auislandora:95326

Usage metrics

    Mathematics & Statistics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC