Back to Search
Start Over
Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
- Source :
- BMC Bioinformatics
- Publication Year :
- 2014
- Publisher :
- Springer Science and Business Media LLC, 2014.
-
Abstract
- Background Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. Results Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. Conclusions Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.
- Subjects :
- Current (mathematics)
Databases, Factual
Computer science
Ontology (information science)
computer.software_genre
Biochemistry
Concept recognition
Open Biomedical Ontologies
Structural Biology
Data Mining
Molecular Biology
business.industry
Applied Mathematics
Scale (chemistry)
Reproducibility of Results
Biological Ontologies
Computer Science Applications
Range (mathematics)
Ontology
Artificial intelligence
Data mining
business
computer
Natural language processing
Research Article
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 15
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....2f7f5ae68827e993091e546e3dd12d6d
- Full Text :
- https://doi.org/10.1186/1471-2105-15-59