Back to Search
Start Over
KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN.
- Source :
-
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing [Pac Symp Biocomput] 2016; Vol. 21, pp. 249-60. - Publication Year :
- 2016
-
Abstract
- Next-generation sequencing technology has presented an opportunity for rare variant discovery and association of these variants with disease. To address the challenges of rare variant analysis, multiple statistical methods have been developed for combining rare variants to increase statistical power for detecting associations. BioBin is an automated tool that expands on collapsing/binning methods by performing multi-level variant aggregation with a flexible, biologically informed binning strategy using an internal biorepository, the Library of Knowledge (LOKI). The databases within LOKI provide variant details, regional annotations and pathway interactions which can be used to generate bins of biologically-related variants, thereby increasing the power of any subsequent statistical test. In this study, we expand the framework of BioBin to incorporate statistical tests, including a dispersion-based test, SKAT, thereby providing the option of performing a unified collapsing and statistical rare variant analysis in one tool. Extensive simulation studies performed on gene-coding regions showed a Bin-KAT analysis to have greater power than BioBin-regression in all simulated conditions, including variants influencing the phenotype in the same direction, a scenario where burden tests often retain greater power. The use of Madsen- Browning variant weighting increased power in the burden analysis to that equitable with Bin-KAT; but overall Bin-KAT retained equivalent or higher power under all conditions. Bin-KAT was applied to a study of 82 pharmacogenes sequenced in the Marshfield Personalized Medicine Research Project (PMRP). We looked for association of these genes with 9 different phenotypes extracted from the electronic health record. This study demonstrates that Bin-KAT is a powerful tool for the identification of genes harboring low frequency variants for complex phenotypes.
- Subjects :
- Computational Biology methods
Computational Biology statistics & numerical data
Computer Simulation
Databases, Genetic statistics & numerical data
Genetic Variation
High-Throughput Nucleotide Sequencing statistics & numerical data
Humans
Knowledge Bases
Models, Genetic
Models, Statistical
Pharmacogenetics statistics & numerical data
Precision Medicine statistics & numerical data
Genome-Wide Association Study statistics & numerical data
Phenotype
Software
Subjects
Details
- Language :
- English
- ISSN :
- 2335-6936
- Volume :
- 21
- Database :
- MEDLINE
- Journal :
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
- Publication Type :
- Academic Journal
- Accession number :
- 26776191