Back to Search Start Over

Toward Data-Model-Agnostic Autonomous Machine-Generated Data Labeling and Annotation Platform: COVID-19 Autoannotation Use Case

Authors :
Vinayakumar Ravi
Ajay Arunachalam
Vasundhara Acharya
Tuan D. Pham
Source :
IEEE Transactions on Engineering Management. :1-12
Publication Year :
2022
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2022.

Abstract

Quick, early, and precise detection is important for diagnosis to control the spread of COVID-19 infection. Artificial Intelligence (AI) technology could certainly be used as a modulating tool to ease the detection, and help with the preventive steps further. Convolutional neural networks (CNNs) have achieved state-of-the-art performance in many visual recognition tasks. Nevertheless, most of these state-of-the-art networks highly rely on the availability of a high amount of labeled data, being an essential step in supervised machine learning tasks. Conventionally, this manual, mundane, and time-consuming process of annotating images is done by humans. Learning to localize or detect COVID-19 infection masks in our specific case study typically requires the collection of CT scan data that has been labeled with bounding boxes or similar annotations, which generally is limited. A technique that could perform such learning with much less annotations, and transfer the learned proposals that are algorithm-driven to generate more synthetic annotated samples would be helpful & quite valuable. We present such a technique inspired by weakly trained mask region based convolutional neural networks (R-CNN) architecture for localization, in which the number of images with their pixel-level masks can be a small proportion of the total dataset, and then further improvise CNNs by inversely generating dense annotations on-the-go using an algorithmic-based computational approach. We focus on alleviating the bottleneck associated with deep learning models needing annotated data for training in an intuitive reverse engineering fashion through this work. Our proposed solution can certainly provide the prospect of automated labeling on-the-fly, thereby reducing much of the manual work. As a result, one can quickly train a precise COVID-19 infection detector with the leverage of autonomous frame-by-frame machine generated annotations. The model achieved mean precision accuracy (%) of 0.99, 0.931, and 0.8 for train, validation, and test set, respectively. The results demonstrate that the proposed method can be adopted in a clinical setting for assisting radiologists, and also our fully autonomous approach can be generalized to any detection/recognition tasks at ease.

Details

ISSN :
15580040 and 00189391
Database :
OpenAIRE
Journal :
IEEE Transactions on Engineering Management
Accession number :
edsair.doi...........1a6b639a26384897e119463b9b65197e