Back to Search
Start Over
Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
- Source :
- Journal of Biomedical Semantics, Journal of Biomedical Semantics, Vol 2, Iss Suppl 5, p S7 (2011)
- Publisher :
- Springer Nature
-
Abstract
- Background This paper presents a novel approach to the problem of hedge detection, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of random indexing. Results The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance. Conclusions This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.
- Subjects :
- Computer Networks and Communications
Computer science
Feature vector
Dimensionality reduction
Research
Health Informatics
computer.software_genre
lcsh:Computer applications to medicine. Medical informatics
Computer Science Applications
Task (project management)
Domain (software engineering)
Random indexing
Classifier (linguistics)
lcsh:R858-859.7
Data mining
Hedge (finance)
Speculation
computer
Information Systems
Subjects
Details
- Language :
- English
- ISSN :
- 20411480
- Volume :
- 2
- Issue :
- Suppl 5
- Database :
- OpenAIRE
- Journal :
- Journal of Biomedical Semantics
- Accession number :
- edsair.doi.dedup.....d6ba84c3cba53b5d42fdafe82db7a36f
- Full Text :
- https://doi.org/10.1186/2041-1480-2-s5-s7