On coding genotypes for genetic markers with multiple alleles in genetic association study of quantitative traits. BMC Genet 2011 Sep 21;12:82
Date
09/23/2011Pubmed ID
21936918Pubmed Central ID
PMC3224146DOI
10.1186/1471-2156-12-82Scopus ID
2-s2.0-80053175923 (requires institutional sign-in at Scopus site) 8 CitationsAbstract
BACKGROUND: In genetic association study of quantitative traits using F(∞) models, how to code the marker genotypes and interpret the model parameters appropriately is important for constructing hypothesis tests and making statistical inferences. Currently, the coding of marker genotypes in building F(∞) models has mainly focused on the biallelic case. A thorough work on the coding of marker genotypes and interpretation of model parameters for F(∞) models is needed especially for genetic markers with multiple alleles.
RESULTS: In this study, we will formulate F(∞) genetic models under various regression model frameworks and introduce three genotype coding schemes for genetic markers with multiple alleles. Starting from an allele-based modeling strategy, we first describe a regression framework to model the expected genotypic values at given markers. Then, as extension from the biallelic case, we introduce three coding schemes for constructing fully parameterized one-locus F(∞) models and discuss the relationships between the model parameters and the expected genotypic values. Next, under a simplified modeling framework for the expected genotypic values, we consider several reduced one-locus F(∞) models from the three coding schemes on the estimability and interpretation of their model parameters. Finally, we explore some extensions of the one-locus F(∞) models to two loci. Several fully parameterized as well as reduced two-locus F(∞) models are addressed.
CONCLUSIONS: The genotype coding schemes provide different ways to construct F(∞) models for association testing of multi-allele genetic markers with quantitative traits. Which coding scheme should be applied depends on how convenient it can provide the statistical inferences on the parameters of our research interests. Based on these F(∞) models, the standard regression model fitting tools can be used to estimate and test for various genetic effects through statistical contrasts with the adjustment for environmental factors.
Author List
Wang TAuthor
Tao Wang PhD Professor in the Data Science Institute department at Medical College of WisconsinMESH terms used to index this publication - Major topics in bold
AllelesChromosome Mapping
Gene Frequency
Genetic Association Studies
Genetic Markers
Genotype
Models, Genetic
Models, Statistical
Quantitative Trait Loci
Quantitative Trait, Heritable