Back to Search Start Over

REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis

Authors :
Mengmeng Liu
Yunshan Zhong
Hongqian Liu
Desheng Liang
Erhong Liu
Yu Zhang
Feng Tian
Qiaowei Liang
David S. Cram
Hua Wang
Lingqian Wu
Fuli Yu
Source :
Molecular Genetics & Genomic Medicine, Vol 8, Iss 11, Pp n/a-n/a (2020)
Publication Year :
2020
Publisher :
Wiley, 2020.

Abstract

Abstract Background Current copy number variation (CNV) identification methods have rapidly become mature. However, the postdetection processes such as variant interpretation or reporting are inefficient. To overcome this situation, we developed REDBot as an automated software package for accurate and direct generation of clinical diagnostic reports for prenatal and products of conception (POC) samples. Methods We applied natural language process (NLP) methods for analyzing 30,235 in‐house historical clinical reports through active learning, and then, developed clinical knowledge bases, evidence‐based interpretation methods and reporting criteria to support the whole postdetection pipeline. Results Of the 30,235 reports, we obtained 37,175 CNV‐paragraph pairs. For these pairs, the active learning approaches achieved a 0.9466 average F1‐score in sentence classification. The overall accuracy for variant classification was 95.7%, 95.2%, and 100.0% in retrospective, prospective, and clinical utility experiments, respectively. Conclusion By integrating NLP methods in CNVs postdetection pipeline, REDBot is a robust and rapid tool with clinical utility for prenatal and POC diagnosis.

Subjects

Subjects :
Genetics
QH426-470

Details

Language :
English
ISSN :
23249269
Volume :
8
Issue :
11
Database :
Directory of Open Access Journals
Journal :
Molecular Genetics & Genomic Medicine
Publication Type :
Academic Journal
Accession number :
edsdoj.647d7d95d724eb3baab590366c0d572
Document Type :
article
Full Text :
https://doi.org/10.1002/mgg3.1488