Back to Search Start Over

Distributed Representation of Healthcare Text Through Qualitative and Quantitative Analysis

Authors :
H B Barathi Ganesh
J. R. Naveen
M. Anand Kumar
K. P. Soman
Source :
Computer Aided Intervention and Diagnostics in Clinical and Medical Images ISBN: 9783030040604
Publication Year :
2019
Publisher :
Springer International Publishing, 2019.

Abstract

Many healthcare-related applications use pretrained embeddings, but these are often trained over general corpus which is mostly downstreamed to certain particular application. One problem noticed among such embeddings is that these are not efficient across various health text applications and even less number of research describe evaluation of these embedding for health domain. In this paper, distributional embedding model is performed to acquire a word representation on data crawled from Journal of Medical Case Reports. This distributed embedding model is analyzed qualitatively and quantitatively over crawled corpus. Qualitative evaluation is employed by cosine similarity on different categories and is visually represented. Quantitative evaluation performed with parts of speech tagging and entity recognition. The embedding model attained a cross-validation accuracy of 91.70% in parts of speech tagging for GENIA corpus and ensured 83% accuracy in the entity recognition of i2b2 clinical data.

Details

ISBN :
978-3-030-04060-4
ISBNs :
9783030040604
Database :
OpenAIRE
Journal :
Computer Aided Intervention and Diagnostics in Clinical and Medical Images ISBN: 9783030040604
Accession number :
edsair.doi...........b89f4629dcf8c6dea9eda27df77aa6c4