Back to Search
Start Over
Gene- or region-based association study via kernel principal component analysis
- Source :
- BMC Genetics, BMC Genetics, Vol 12, Iss 1, p 75 (2011)
- Publication Year :
- 2011
- Publisher :
- BioMed Central, 2011.
-
Abstract
- Background In genetic association study, especially in GWAS, gene- or region-based methods have been more popular to detect the association between multiple SNPs and diseases (or traits). Kernel principal component analysis combined with logistic regression test (KPCA-LRT) has been successfully used in classifying gene expression data. Nevertheless, the purpose of association study is to detect the correlation between genetic variations and disease rather than to classify the sample, and the genomic data is categorical rather than numerical. Recently, although the kernel-based logistic regression model in association study has been proposed by projecting the nonlinear original SNPs data into a linear feature space, it is still impacted by multicolinearity between the projections, which may lead to loss of power. We, therefore, proposed a KPCA-LRT model to avoid the multicolinearity. Results Simulation results showed that KPCA-LRT was always more powerful than principal component analysis combined with logistic regression test (PCA-LRT) at different sample sizes, different significant levels and different relative risks, especially at the genewide level (1E-5) and lower relative risks (RR = 1.2, 1.3). Application to the four gene regions of rheumatoid arthritis (RA) data from Genetic Analysis Workshop16 (GAW16) indicated that KPCA-LRT had better performance than single-locus test and PCA-LRT. Conclusions KPCA-LRT is a valid and powerful gene- or region-based method for the analysis of GWAS data set, especially under lower relative risks and lower significant levels.
- Subjects :
- Genetics
Candidate gene
Principal Component Analysis
Models, Statistical
lcsh:QH426-470
Genome-wide association study
Biology
Logistic regression
Polymorphism, Single Nucleotide
Kernel principal component analysis
lcsh:Genetics
Logistic Models
Multicollinearity
Sample size determination
Principal component analysis
Humans
Genetics(clinical)
Computer Simulation
Genetic Predisposition to Disease
Categorical variable
Genetics (clinical)
Algorithms
Genetic Association Studies
Research Article
Subjects
Details
- Language :
- English
- ISSN :
- 14712156
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- BMC Genetics
- Accession number :
- edsair.doi.dedup.....20d75a0ab07b1efd8ddc4f4cb27c2435