Journal: briefings in bioinformatics / Topic: dna methylation - Searchworks@Jio Institute Digital Library Search Results

Showing total 4 results

Start Over Topic dna methylation Journal briefings in bioinformatics

4 results

1. Evaluation of variable selection methods for random forests and omics data sets

Author: Stephan Seifert, Silke Szymczak, and Frauke Degenhardt
Subjects: Paper, Clustering high-dimensional data, Computer science, 0206 medical engineering, high dimensional data, Stability (learning theory), Breast Neoplasms, Feature selection, 02 engineering and technology, computer.software_genre, Machine Learning, Set (abstract data type), 03 medical and health sciences, feature selection, relevant variables, Biomarkers, Tumor, Feature (machine learning), Humans, Computer Simulation, Molecular Biology, 030304 developmental biology, Parametric statistics, 0303 health sciences, Gene Expression Profiling, Computational Biology, DNA Methylation, Random forest, Variable (computer science), Female, Data mining, computer, random forest, Algorithms, 020602 bioinformatics, Information Systems
Abstract: Machine learning methods and in particular random forests are promising approaches for prediction based on high dimensional omics data sets. They provide variable importance measures to rank predictors according to their predictive power. If building a prediction model is the main goal of a study, often a minimal set of variables with good prediction performance is selected. However, if the objective is the identification of involved variables to find active networks and pathways, approaches that aim to select all relevant variables should be preferred. We evaluated several variable selection procedures based on simulated data as well as publicly available experimental methylation and gene expression data. Our comparison included the Boruta algorithm, the Vita method, recurrent relative variable importance, a permutation approach and its parametric variant (Altmann) as well as recursive feature elimination (RFE). In our simulation studies, Boruta was the most powerful approach, followed closely by the Vita method. Both approaches demonstrated similar stability in variable selection, while Vita was the most robust approach under a pure null model without any predictor variables related to the outcome. In the analysis of the different experimental data sets, Vita demonstrated slightly better stability in variable selection and was less computationally intensive than Boruta. In conclusion, we recommend the Boruta and Vita approaches for the analysis of high-dimensional data sets. Vita is considerably faster than Boruta and thus more suitable for large data sets, but only Boruta can also be applied in low-dimensional settings.
Published: 2017

2. ‘Multi-omic’ data analysis using O-miner

Author: Sangaralingam, Ajanthah, Dayem Ullah, Abu Z, Marzec, Jacek, Gadaleta, Emanuela, Nagano, Ai, Ross-Adams, Helen, Wang, Jun, Lemoine, Nicholas R, and Chelala, Claude
Subjects: Paper, Data Analysis, O-miner, Internet, Whole Genome Sequencing, Sequence Analysis, RNA, Gene Expression Profiling, Gene Dosage, Computational Biology, sequencing, Genomics, multi-omics, DNA Methylation, Software Design, Neoplasms, Databases, Genetic, Humans, data integration, Software
Abstract: Innovations in -omics technologies have driven advances in biomedical research. However, integrating and analysing the large volumes of data generated from different high-throughput -omics technologies remain a significant challenge to basic and clinical scientists without bioinformatics skills or access to bioinformatics support. To address this demand, we have significantly updated our previous O-miner analytical suite, to incorporate several new features and data types to provide an efficient and easy-to-use Web tool for the automated analysis of data from ‘-omics’ technologies. Created from a biologist’s perspective, this tool allows for the automated analysis of large and complex transcriptomic, genomic and methylomic data sets, together with biological/clinical information, to identify significantly altered pathways and prioritize novel biomarkers/targets for biological validation. Our resource can be used to analyse both in-house data and the huge amount of publicly available information from array and sequencing platforms. Multiple data sets can be easily combined, allowing for meta-analyses. Here, we describe the analytical pipelines currently available in O-miner and present examples of use to demonstrate its utility and relevance in maximizing research output. O-miner Web server is free to use and is available at http://www.o-miner.org.
Published: 2017

3. Detecting differential DNA methylation from sequencing of bisulfite converted DNA of diverse species

Author: Taesung Park, Iksoo Huh, Soojin V. Yi, and Xin Wu
Subjects: Paper, 0301 basic medicine, Bisulfite sequencing, Computational biology, Biology, 03 medical and health sciences, 0302 clinical medicine, Species Specificity, Animals, Humans, Sulfites, Computer Simulation, Methylated DNA immunoprecipitation, Epigenetics, insects, Molecular Biology, Illumina dye sequencing, Genetics, DNA methylation, Models, Statistical, Models, Genetic, Genome, Human, Computational Biology, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Bees, differentially methylated regions, genomic DNA, 030104 developmental biology, Differentially methylated regions, bisulfite sequencing, Illumina Methylation Assay, CpG Islands, 030217 neurology & neurosurgery, Information Systems
Abstract: DNA methylation is one of the most extensively studied epigenetic modifications of genomic DNA. In recent years, sequencing of bisulfite-converted DNA, particularly via next-generation sequencing technologies, has become a widely popular method to study DNA methylation. This method can be readily applied to a variety of species, dramatically expanding the scope of DNA methylation studies beyond the traditionally studied human and mouse systems. In parallel to the increasing wealth of genomic methylation profiles, many statistical tools have been developed to detect differentially methylated loci (DMLs) or differentially methylated regions (DMRs) between biological conditions. We discuss and summarize several key properties of currently available tools to detect DMLs and DMRs from sequencing of bisulfite-converted DNA. However, the majority of the statistical tools developed for DML/DMR analyses have been validated using only mammalian data sets, and less priority has been placed on the analyses of invertebrate or plant DNA methylation data. We demonstrate that genomic methylation profiles of non-mammalian species are often highly distinct from those of mammalian species using examples of honey bees and humans. We then discuss how such differences in data properties may affect statistical analyses. Based on these differences, we provide three specific recommendations to improve the power and accuracy of DML and DMR analyses of invertebrate data when using currently available statistical tools. These considerations should facilitate systematic and robust analyses of DNA methylation from diverse species, thus advancing our understanding of DNA methylation.
Published: 2017

4. A survey of the approaches for identifying differential methylation using bisulfite sequencing data

Author: Tin Nguyen, Sorin Draghici, Adib Shafi, and Cristina Mitrea
Subjects: Paper, 0301 basic medicine, Computer science, Bisulfite sequencing, Computational biology, Epigenesis, Genetic, 03 medical and health sciences, 0302 clinical medicine, Humans, Sulfites, Molecular Biology, Extramural, Computational Biology, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA, Methylation, DNA Methylation, Epigenetic Mechanism, Markov Chains, Logistic Models, 030104 developmental biology, DNA methylation, Differential Methylation, CpG Islands, 030217 neurology & neurosurgery, Information Systems
Abstract: DNA methylation is an important epigenetic mechanism that plays a crucial role in cellular regulatory systems. Recent advancements in sequencing technologies now enable us to generate high-throughput methylation data and to measure methylation up to single-base resolution. This wealth of data does not come without challenges, and one of the key challenges in DNA methylation studies is to identify the significant differences in the methylation levels of the base pairs across distinct biological conditions. Several computational methods have been developed to identify differential methylation using bisulfite sequencing data; however, there is no clear consensus among existing approaches. A comprehensive survey of these approaches would be of great benefit to potential users and researchers to get a complete picture of the available resources. In this article, we present a detailed survey of 22 such approaches focusing on their underlying statistical models, primary features, key advantages and major limitations. Importantly, the intrinsic drawbacks of the approaches pointed out in this survey could potentially be addressed by future research.
Published: 2017

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results

1. Evaluation of variable selection methods for random forests and omics data sets

2. ‘Multi-omic’ data analysis using O-miner

3. Detecting differential DNA methylation from sequencing of bisulfite converted DNA of diverse species

4. A survey of the approaches for identifying differential methylation using bisulfite sequencing data

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

4 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources