Back to Search Start Over

EXTENDED DISTRIBUTED PROTOTYPICAL FOR BIOMEDICAL NAMED ENTITY RECOGNITION

Authors :
Maan Tareq Abd
Masnizah Mohd
Source :
Asia-Pacific Journal of Information Technology and Multimedia, Vol 6, Iss 02, Pp 1-11 (2017)
Publication Year :
2017
Publisher :
UKM Press, 2017.

Abstract

Biomedical Named Entity Recognition (Bio-NER) is an essential step of biomedical information extraction and biomedical text mining. Although, a lot of researches have been made in the design of rule-based and supervised tools for general NER, Bio-NER still remains a challenge and an area of active research, as still there is huge difference in F-score of 10 points between general newswire NER and Bio-NER. The complex structures of the biomedical entities pose a huge challenge for their recognition. To handle this, this paper explores different effective word representations with Support Vector Machine (SVM) to deal with the complex structures of biomedical named entities. First, this paper identifies and evaluates a set of morphological and contextual features with SVM learning method for Bio-NER. This paper also presents an extended distributed representation word embedding technique (EDRWE) for Bio-NER. These models are evaluated on widely used standard Bio-NER dataset namely GENIA corpus. Experimental results show that EDRWE technique improves the overall performance of the Bio-NER and outperforms all other representation methods. Results analysis shows that the new EDRWE is satisfactory and effective for Bio-NER especially when only a small-sized data set is available.

Details

Language :
English, Malay
ISSN :
20170602 and 22892192
Volume :
6
Issue :
02
Database :
Directory of Open Access Journals
Journal :
Asia-Pacific Journal of Information Technology and Multimedia
Publication Type :
Academic Journal
Accession number :
edsdoj.13883ab6ac784966bc32c416045689e9
Document Type :
article
Full Text :
https://doi.org/10.17576/apjitm-2017-0602-01