Publication Type: | Journal Article |
Year of Publication: | 2005 |
Authors: | M. Stephens, Scheet P. |
Journal: | American journal of human genetics |
Volume: | 76 |
Pagination: | 449–62 |
ISSN: | 0002-9297 |
Keywords: | Algorithms, Alleles, Biometry, Chromosomes, Data Interpretation, Genetic, Genomics, Genomics: statistics & numerical data, Genotype, Haplotypes, Haplotypes: genetics, Human, Humans, Linkage Disequilibrium, Male, Models, Software, Statistical, X, X: genetics |
Abstract: | Although many algorithms exist for estimating haplotypes from genotype data, none of them take full account of both the decay of linkage disequilibrium (LD) with distance and the order and spacing of genotyped markers. Here, we describe an algorithm that does take these factors into account, using a flexible model for the decay of LD with distance that can handle both "blocklike" and "nonblocklike" patterns of LD. We compare the accuracy of this approach with a range of other available algorithms in three ways: for reconstruction of randomly paired, molecularly determined male X chromosome haplotypes; for reconstruction of haplotypes obtained from trios in an autosomal region; and for estimation of missing genotypes in 50 autosomal genes that have been completely resequenced in 24 African Americans and 23 individuals of European descent. For the autosomal data sets, our new approach clearly outperforms the best available methods, whereas its accuracy in inferring the X chromosome haplotypes is only slightly superior. For estimation of missing genotypes, our method performed slightly better when the two subsamples were combined than when they were analyzed separately, which illustrates its robustness to population stratification. Our method is implemented in the software package PHASE (v2.1.1), available from the Stephens Lab Web site. |
URL: | http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1196397&tool=pmcentrez&rendertype=abstract |
DOI: | 10.1086/428594 |