1. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis
- Author
-
Arek Kasprzyk, Steffen Durinck, Wolfgang Huber, Yves Moreau, Sean Davis, Alvis Brazma, and Bart De Moor
- Subjects
Statistics and Probability ,SQL ,Information Storage and Retrieval ,Biological database ,Computational biology ,Ontology (information science) ,Biology ,computer.software_genre ,Biochemistry ,Bioconductor ,Annotation ,Data retrieval ,Ensembl ,Databases, Protein ,Molecular Biology ,Oligonucleotide Array Sequence Analysis ,computer.programming_language ,Microarray analysis techniques ,Gene Expression Profiling ,Computer Science Applications ,Systems Integration ,Computational Mathematics ,Computational Theory and Mathematics ,Database Management Systems ,Data mining ,computer ,Algorithms ,Software - Abstract
Summary:biomaRt is a new Bioconductor package that integrates BioMart data resources with data analysis software in Bioconductor. It can annotate a wide range of gene or gene product identifiers (e.g. Entrez-Gene and Affymetrix probe identifiers) with information such as gene symbol, chromosomal coordinates, Gene Ontology and OMIM annotation. Furthermore biomaRt enables retrieval of genomic sequences and single nucleotide polymorphism information, which can be used in data analysis. Fast and up-to-date data retrieval is possible as the package executes direct SQL queries to the BioMart databases (e.g. Ensembl). The biomaRt package provides a tight integration of large, public or locally installed BioMart databases with data analysis in Bioconductor creating a powerful environment for biological data mining. Availability:http://www.bioconductor.org. LGPL Contact: steffen.durinck@esat.kuleuven.ac.be
- Published
- 2005
- Full Text
- View/download PDF