Author: "Jae Ho Sohn" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jae Ho Sohn"' showing total 80 results

Start Over Author "Jae Ho Sohn"

80 results on '"Jae Ho Sohn"'

1. An open-source fine-tuned large language model for radiological impression generation: a multi-reader performance study

Author: Adrian Serapio, Gunvant Chaudhari, Cody Savage, Yoo Jin Lee, Maya Vella, Shravan Sridhar, Jamie Lee Schroeder, Jonathan Liu, Adam Yala, and Jae Ho Sohn
Subjects: Natural language processing, Large language model, Open-source, Summarization, Impressions, Medical technology, R855-855.5
Abstract: Abstract Background The impression section integrates key findings of a radiology report but can be subjective and variable. We sought to fine-tune and evaluate an open-source Large Language Model (LLM) in automatically generating impressions from the remainder of a radiology report across different imaging modalities and hospitals. Methods In this institutional review board-approved retrospective study, we collated a dataset of CT, US, and MRI radiology reports from the University of California San Francisco Medical Center (UCSFMC) (n = 372,716) and the Zuckerberg San Francisco General (ZSFG) Hospital and Trauma Center (n = 60,049), both under a single institution. The Recall-Oriented Understudy for Gisting Evaluation (ROUGE) score, an automatic natural language evaluation metric that measures word overlap, was used for automatic natural language evaluation. A reader study with five cardiothoracic radiologists was performed to more strictly evaluate the model’s performance on a specific modality (CT chest exams) with a radiologist subspecialist baseline. We stratified the results of the reader performance study based on the diagnosis category and the original impression length to gauge case complexity. Results The LLM achieved ROUGE-L scores of 46.51, 44.2, and 50.96 on UCSFMC and upon external validation, ROUGE-L scores of 40.74, 37.89, and 24.61 on ZSFG across the CT, US, and MRI modalities respectively, implying a substantial degree of overlap between the model-generated impressions and impressions written by the subspecialist attending radiologists, but with a degree of degradation upon external validation. In our reader study, the model-generated impressions achieved overall mean scores of 3.56/4, 3.92/4, 3.37/4, 18.29 s,12.32 words, and 84 while the original impression written by a subspecialist radiologist achieved overall mean scores of 3.75/4, 3.87/4, 3.54/4, 12.2 s, 5.74 words, and 89 for clinical accuracy, grammatical accuracy, stylistic quality, edit time, edit distance, and ROUGE-L score respectively. The LLM achieved the highest clinical accuracy ratings for acute/emergent findings and on shorter impressions. Conclusions An open-source fine-tuned LLM can generate impressions to a satisfactory level of clinical accuracy, grammatical accuracy, and stylistic quality. Our reader performance study demonstrates the potential of large language models in drafting radiology report impressions that can aid in streamlining radiologists’ workflows.
Published: 2024
Full Text: View/download PDF

2. Mitral Annular Disjunction

Author: Fahad A. Alfares, MD, Jae Ho Sohn, MD, MS, Yoo Jin Lee, MD, Rachel Farrell, MS, Francesca N. Delling, MD, MPH, Kishor Avasarala, MD, Anita J. Moon-Grady, MD, Shafkat Anwar, MD, and Karyn M. Austin, MD, PhD
Subjects: mitral annular disjunction, mitral valve prolapse, pediatrics, sudden cardiac arrest, ventricular arrhythmia, Diseases of the circulatory (Cardiovascular) system, RC666-701
Abstract: Mitral annular disjunction (MAD) is a rare and under-recognized entity in the pediatric population. We present 2 cases of MAD in previously healthy pediatric patients and highlight clinical scenarios where MAD should be suspected.
Published: 2024
Full Text: View/download PDF

3. Fast Screening Cardiothoracic MRI Protocol in 0.55T Low-field Strength MRI: Initial Experience in Healthy Volunteers

Author: Yoo Jin Lee, MD, Jae Ho Sohn, MD, Dante Capaldi, and Yang Yang, PhD
Subjects: Diseases of the circulatory (Cardiovascular) system, RC666-701
Published: 2024
Full Text: View/download PDF

4. Prediction of future healthcare expenses of patients from chest radiographs using deep learning: a pilot study

Author: Jae Ho Sohn, Yixin Chen, Dmytro Lituiev, Jaewon Yang, Karen Ordovas, Dexter Hadley, Thienkhai H. Vu, Benjamin L. Franc, and Youngho Seo
Subjects: Medicine, Science
Abstract: Abstract Our objective was to develop deep learning models with chest radiograph data to predict healthcare costs and classify top-50% spenders. 21,872 frontal chest radiographs were retrospectively collected from 19,524 patients with at least 1-year spending data. Among the patients, 11,003 patients had 3 years of cost data, and 1678 patients had 5 years of cost data. Model performances were measured with area under the receiver operating characteristic curve (ROC-AUC) for classification of top-50% spenders and Spearman ρ for prediction of healthcare cost. The best model predicting 1-year (N = 21,872) expenditure achieved ROC-AUC of 0.806 [95% CI 0.793–0.819] for top-50% spender classification and ρ of 0.561 [0.536–0.586] for regression. Similarly, for predicting 3-year (N = 12,395) expenditure, ROC-AUC of 0.771 [0.750–0.794] and ρ of 0.524 [0.489–0.559]; for predicting 5-year (N = 1779) expenditure ROC-AUC of 0.729 [0.667–0.729] and ρ of 0.424 [0.324–0.529]. Our deep learning model demonstrated the feasibility of predicting health care expenditure as well as classifying top 50% healthcare spenders at 1, 3, and 5 year(s), implying the feasibility of combining deep learning with information-rich imaging data to uncover hidden associations that may allude to physicians. Such a model can be a starting point of making an accurate budget in reimbursement models in healthcare industries.
Published: 2022
Full Text: View/download PDF

5. Clinical language search algorithm from free-text: facilitating appropriate imaging

Author: Gunvant R. Chaudhari, Yeshwant R. Chillakuru, Timothy L. Chen, Valentina Pedoia, Thienkhai H. Vu, Christopher P. Hess, Youngho Seo, and Jae Ho Sohn
Subjects: Natural language processing, Information retrieval, Appropriateness criteria, Term frequency-inverse document frequency, Medical technology, R855-855.5
Abstract: Abstract Background The comprehensiveness and maintenance of the American College of Radiology (ACR) Appropriateness Criteria (AC) makes it a unique resource for evidence-based clinical imaging decision support, but it is underutilized by clinicians. To facilitate the use of imaging recommendations, we develop a natural language processing (NLP) search algorithm that automatically matches clinical indications that physicians write into imaging orders to appropriate AC imaging recommendations. Methods We apply a hybrid model of semantic similarity from a sent2vec model trained on 223 million scientific sentences, combined with term frequency inverse document frequency features. AC documents are ranked based on their embeddings’ cosine distance to query. For model testing, we compiled a dataset of simulated simple and complex indications for each AC document (n = 410) and another with clinical indications from randomly sampled radiology reports (n = 100). We compare our algorithm to a custom google search engine. Results On the simulated indications, our algorithm ranked ground truth documents as top 3 for 98% of simple queries and 85% of complex queries. Similarly, on the randomly sampled radiology report dataset, the algorithm ranked 86% of indications with a single match as top 3. Vague and distracting phrases present in the free-text indications were main sources of errors. Our algorithm provides more relevant results than a custom Google search engine, especially for complex queries. Conclusions We have developed and evaluated an NLP algorithm that matches clinical indications to appropriate AC guidelines. This approach can be integrated into imaging ordering systems for automated access to guidelines.
Published: 2022
Full Text: View/download PDF

6. Development and web deployment of an automated neuroradiology MRI protocoling tool with natural language processing

Author: Yeshwant Reddy Chillakuru, Shourya Munjal, Benjamin Laguna, Timothy L. Chen, Gunvant R. Chaudhari, Thienkhai Vu, Youngho Seo, Jared Narvid, and Jae Ho Sohn
Subjects: Natural language processing, Protocol, Automation, Neuroimaging, Computer applications to medicine. Medical informatics, R858-859.7
Abstract: Abstract Background A systematic approach to MRI protocol assignment is essential for the efficient delivery of safe patient care. Advances in natural language processing (NLP) allow for the development of accurate automated protocol assignment. We aim to develop, evaluate, and deploy an NLP model that automates protocol assignment, given the clinician indication text. Methods We collected 7139 spine MRI protocols (routine or contrast) and 990 head MRI protocols (routine brain, contrast brain, or other) from a single institution. Protocols were split into training (n = 4997 for spine MRI; n = 839 for head MRI), validation (n = 1071 for spine MRI, fivefold cross-validation used for head MRI), and test (n = 1071 for spine MRI; n = 151 for head MRI) sets. fastText and XGBoost were used to develop 2 NLP models to classify spine and head MRI protocols, respectively. A Flask-based web app was developed to be deployed via Heroku. Results The spine MRI model had an accuracy of 83.38% and a receiver operator characteristic area under the curve (ROC-AUC) of 0.8873. The head MRI model had an accuracy of 85.43% with a routine brain protocol ROC-AUC of 0.9463 and contrast brain protocol ROC-AUC of 0.9284. Cancer, infectious, and inflammatory related keywords were associated with contrast administration. Structural anatomic abnormalities and stroke/altered mental status were indicative of routine spine and brain MRI, respectively. Error analysis revealed increasing the sample size may improve performance for head MRI protocols. A web version of the model is provided for demonstration and deployment. Conclusion We developed and web-deployed two NLP models that accurately predict spine and head MRI protocol assignment, which could improve radiology workflow efficiency.
Published: 2021
Full Text: View/download PDF

7. High precision localization of pulmonary nodules on chest CT utilizing axial slice number labels

Author: Yeshwant Reddy Chillakuru, Kyle Kranen, Vishnu Doppalapudi, Zhangyuan Xiong, Letian Fu, Aarash Heydari, Aditya Sheth, Youngho Seo, Thienkhai Vu, and Jae Ho Sohn
Subjects: Lung nodule, Lung cancer, Nodule detection, Deep learning, Machine learning, Medical technology, R855-855.5
Abstract: Abstract Background Reidentification of prior nodules for temporal comparison is an important but time-consuming step in lung cancer screening. We develop and evaluate an automated nodule detector that utilizes the axial-slice number of nodules found in radiology reports to generate high precision nodule predictions. Methods 888 CTs from Lung Nodule Analysis were used to train a 2-dimensional (2D) object detection neural network. A pipeline of 2D object detection, 3D unsupervised clustering, false positive reduction, and axial-slice numbers were used to generate nodule candidates. 47 CTs from the National Lung Cancer Screening Trial (NLST) were used for model evaluation. Results Our nodule detector achieved a precision of 0.962 at a recall of 0.573 on the NLST test set for any nodule. When adjusting for unintended nodule predictions, we achieved a precision of 0.931 at a recall 0.561, which corresponds to 0.06 false positives per CT. Error analysis revealed better detection of nodules with soft tissue attenuation compared to ground glass and undeterminable attenuation. Nodule margins, size, location, and patient demographics did not differ between correct and incorrect predictions. Conclusions Utilization of axial-slice numbers from radiology reports allowed for development of a lung nodule detector with a low false positive rate compared to prior feature-engineering and machine learning approaches. This high precision nodule detector can reduce time spent on reidentification of prior nodules during lung cancer screening and can rapidly develop new institutional datasets to explore novel applications of computer vision in lung cancer imaging.
Published: 2021
Full Text: View/download PDF

8. Longitudinal Analysis of the Effect of Repeated Transarterial Chemoembolization for Liver Cancer on Portal Venous Pressure

Author: Constantine Frangakis, Jae Ho Sohn, Ahmet Bas, Julius Chapiro, Ruediger E. Schernthaner, MingDe Lin, James P. Hamilton, Timothy M. Pawlik, Kelvin Hong, and Rafael Duran
Subjects: portal hypertension, longitudinal data analysis, transarterial chemoembolization, TACE, hepatocellular carcinoma, HCC, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Abstract: ObjectivesInvestigate long-term effects of repeated transarterial chemoembolization (TACE) on portal venous pressure (PVP) using non-invasive surrogate markers of portal hypertension.MethodsRetrospective, Institutional Review Board-approved study. 99 patients [hepatocellular carcinoma (HCC) group (n=57); liver metastasis group (n=42)] who underwent 279TACEs and had longitudinal pre-/post-therapy contrast-enhanced-MRI (n=388) and complete blood work were included. Outcomes of interest were platelet count (PC), spleen volume, ascites and portosystemic collaterals. Variables included TACE type/number, tumor type, microcatheter location, Child-Pugh, baseline tumor burden (tumor number/total/largest size), vessel invasion, alpha-fetoprotein, Eastern Cooperative Oncology Group (ECOG) performance status, and Model for End-Stage Liver Disease (MELD) score. Generalized Estimating Equations assessed the associations between TACE and outcomes. Power analysis determined the sample size was sufficient.ResultsNo significant change in PC over time was observed in either groups, regardless of liver function (P>0.05). Baseline spleen volume was 226 cm3 for metastatic group, and was larger by 204 cm3 for HCC group (P
Published: 2021
Full Text: View/download PDF

9. Renal Cell Carcinoma Metastatic to the Liver: Early Response Assessment after Intraarterial Therapy Using 3D Quantitative Tumor Enhancement Analysis

Author: Florian Nima Fleckenstein, Rüdiger Egbert Schernthaner, Rafael Duran, Jae Ho Sohn, Sonia Sahu, Karen Marshall, MingDe Lin, Bernhard Gebauer, Julius Chapiro, Riad Salem, and Jean-François Geschwind
Subjects: Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Abstract: PURPOSE: Liver metastases from renal cell carcinoma (RCC) are not uncommon in the course of disease. However, data about tumor response to intraarterial therapy (IAT) are scarce. This study assessed whether changes of enhancing tumor volume using quantitative European Association for the Study of the Liver (qEASL) on magnetic resonance imaging (MRI) and computed tomography (CT) can evaluate tumor response and predict overall survival (OS) early after therapy. METHODS AND MATERIALS: Fourteen patients with liver metastatic RCC treated with IAT (transarterial chemoembolization: n= 9 and yttrium-90: n= 5) were retrospectively included. All patients underwent contrast-enhanced imaging (MRI: n= 10 and CT: n= 4) 3 to 4 weeks pre- and posttreatment. Response to treatment was evaluated on the arterial phase using Response Evaluation Criteria in Solid Tumors (RECIST), World Health Organization, modified RECIST, EASL, tumor volume, and qEASL. Paired t test was used to compare measurements pre- and post-IAT. Patients were stratified into responders (≥65% decrease in qEASL) and nonresponders (
Published: 2016
Full Text: View/download PDF

10. Class 3 semaphorin mediates dendrite growth in adult newborn neurons through Cdk5/FAK pathway.

Author: Teclise Ng, Jae Ryun Ryu, Jae Ho Sohn, Terence Tan, Hongjun Song, Guo-Li Ming, and Eyleen L K Goh
Subjects: Medicine, Science
Abstract: Class 3 semaphorins are well-known axonal guidance cues during the embryonic development of mammalian nervous system. However, their activity on postnatally differentiated neurons in neurogenic regions of adult brains has not been characterized. We found that silencing of semaphorin receptors neuropilins (NRP) 1 or 2 in neural progenitors at the adult mouse dentate gyrus resulted in newly differentiated neurons with shorter dendrites and simpler branching in vivo. Tyrosine phosphorylation (Tyr 397) and serine phosphorylation (Ser 732) of FAK were essential for these effects. Semaphorin 3A and 3F mediate serine phosphorylation of FAK through the activation of Cdk5. Silencing of either Cdk5 or FAK in newborn neurons phenocopied the defects in dendritic development seen upon silencing of NRP1 or NRP2. Furthermore, in vivo overexpression of Cdk5 or FAK rescued the dendritic phenotypes seen in NRP1 and NRP2 deficient neurons. These results point to a novel role for class 3 semaphorins in promoting dendritic growth and branching during adult hippocampal neurogenesis through the activation of Cdk5-FAK signaling pathway.
Published: 2013
Full Text: View/download PDF

11. Conformal Language Modeling.

Author: Victor Quach, Adam Fisch, Tal Schuster, Adam Yala, Jae Ho Sohn, Tommi S. Jaakkola, and Regina Barzilay
Published: 2024

12. Machine learning to predict incident radiographic knee osteoarthritis over 8 Years using combined MR imaging features, demographics, and clinical factors: data from the Osteoarthritis Initiative

Author: Thomas M. Link, Michael C. Nevitt, Charles E. McCulloch, Jae Ho Sohn, and Gabby B. Joseph
Subjects: Male, Time Factors, Knee Joint, Demographics, Radiography, Biomedical Engineering, Osteoarthritis, Meniscus (anatomy), Machine learning, computer.software_genre, Article, Machine Learning, Rheumatology, Predictive Value of Tests, medicine, Humans, Orthopedics and Sports Medicine, Clinical significance, Aged, Retrospective Studies, medicine.diagnostic_test, business.industry, Cartilage, Magnetic resonance imaging, Middle Aged, Osteoarthritis, Knee, medicine.disease, Magnetic Resonance Imaging, Mr imaging, medicine.anatomical_structure, Female, Artificial intelligence, business, computer
Abstract: Summary Objective To develop a machine learning-based prediction model for incident radiographic osteoarthritis (OA) of the knee over 8 years using MRI-based cartilage biochemical composition and knee joint structure, demographics, and clinical predictors including muscle strength and symptoms. Design Individuals (n=1044) with baseline Kellgren Lawrence (KL) grade 0-1 in the right knee from the Osteoarthritis Initiative database were analyzed. 3T MRI at baseline was used to quantify knee cartilage T2, and Whole-Organ Magnetic Resonance Imaging Scores (WORMS) were obtained for cartilage, meniscus, and bone marrow. The outcome was set as true if a subject developed KL grade 2-4 OA in the right knee over 8 years (n=183) and false if the subject remained at KL 0-1 over 8 years (n=861). We developed and compared three models: Model 1: 112 predictors based on OA risk factors; Model 2: top ten predictors based on feature importance score from Model 1 and clinical relevance; Model 3: Model 2 without the imaging predictors. We compared the models using the area under the R OC curve derived from hold-out data. Results The 10-predictor model (Model 2, that includes cartilage and meniscus WORMS scores and cartilage T2) had a slightly lower AUC (0.772) compared to the model with 112 predictors (Model 1: AUC=0.792, p=0.739); and had a significantly higher AUC compared to the model without MR imaging predictors (Model 3, AUC=0.669, p=0.011). Conclusions A 10-predictor model including MRI parameters coupled with demographics, symptoms, muscle, and physical activity scores provides good prediction of incident radiographic OA over 8 years.
Published: 2022

13. Semantic Retrieval of Similar Radiological Images using Vision Transformers

Author: Anjali Thakrar, Michael Jayasuriya, Adrian Serapio, Xiao Wu, Eric Davis, Jamie Schroeder, Maya Vella, and Jae Ho Sohn
Abstract: BackgroundIdentifying visually and semantically similar radiological images in a database can facilitate the creation of decision support tools, teaching files, and research cohorts. Existing content-based image retrieval tools are often limited to searching by pixel-wise difference or vector distance of model predictions. Vision transformers (ViT) use attention to simultaneously take into account radiological diagnosis and visual appearance.PurposeWe aim to develop a ViT-based image retrieval framework and evaluate the algorithm on NIH Chest Radiographs (CXR) and NLST Chest CTs.Materials and MethodsThe model was trained on 112,120 CXR and 111,955 CT images. For CXR, a ViT binary classifier was trained on 4 ground truth labels (Cardiomegaly, Opacity, Emphysema, No Finding) and ensembled to produce multilabel classifications for each CXR. For CT, a regression model was trained to minimize L1 loss on the continuous ground truth labels of patient weight. The ViT image embedding layer was treated as a global image descriptor, using the L2 distance between descriptors as a similarity measure. To qualitatively evaluate the model, five radiologists performed a reader performance study with random query images (25 CT, 25 CXR). For each image, they chose the 5 most similar images from a set of 10 images (the 5 closest and 5 furthest images from the query in model space). Inter-radiologist and radiologist-model agreement statistics were calculated.ResultsThe CXR model achieved nDCG@5 of 0.73 (pConclusionOur ViT architecture retrieved visually and semantically similar radiological images.Summary StatementThis study evaluates the efficacy of using ViT based image embeddings for CBIR tasks for CXR and CT images, finding that it performs well on visual and semantic recognition tasks.Key ResultsThe CXR model achieved nDCG@5 of 0.73 (pThe CT model achieved nDCG of 16.85 (pInter-radiologist Fleiss Kappa of 0.51 and radiologist consensus to model Cohen’s Kappa of 0.65 were observed.
Published: 2023

14. Underrecognition and Suboptimal Quality of Care for Nonalcoholic Fatty Liver Disease Cirrhosis in Primary Care Patients with Diabetes Mellitus

Author: Janet N. Chu, Max L. Goldman, Danielle Brandman, Jae Ho Sohn, Kendall B. Islam, Lauren Ross, and Rena K. Fox
Published: 2023

15. Federated learning for predicting clinical outcomes in patients with COVID-19

Author: Jiahui Guan, Krishna Juluru, Yothin Rakvongthai, Benjamin S. Glicksberg, Watsamon Jantarabenjakul, Li-Chen Fu, Mike Fralick, Anthony Costa, Quanzheng Li, Andrew Feng, Eric K. Oermann, Joshua D. Kaggie, Xihong Lin, Pedro Mário Cruz e Silva, Deepeksha Bhatia, Byung Seok Kim, Hitoshi Mori, Pablo F. Damasceno, Peiying Ruan, Yuhong Wen, Hao-Hsin Shin, Amilcare Gentili, Weichung Wang, Chiu-Ling Lai, Jason C. Crane, Andrew N. Priest, Soo-Young Park, Peerapon Vateekul, Matheus Ribeiro Furtado de Mendonça, Gustavo César de Antônio Corradi, Griffin Lacey, Meena AbdelMaseeh, Yu Rim Lee, Tatsuya Kodama, Pierre Elnajjar, Krishna Nand Keshava Murthy, Xiang Li, Evan Leibovitz, Vitor Lavor, Christopher P. Hess, Colin B. Compas, Stefan Gräf, Masoom A. Haider, Daguang Xu, Nicola Rieke, Thanyawee Puthanakit, Sarah E Hickman, Hui Ren, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Jung Gil Park, Jesse Tetreault, Hisashi Sasaki, Min Kyu Kang, Won Young Tak, Chun-Nan Hsu, Fiona J. Gilbert, Chin Lin, Varun Buch, Felipe Kitamura, Tony Mazzulli, Eddie Huang, Abood Quraini, Shelley McLeod, Young Joon Kwon, Gustavo Nino, Dufan Wu, Chien-Sung Tsai, Mona Flores, Baris Turkbey, Sira Sriswasdi, Pochuan Wang, Mohammad Adil, Aoxiao Zhong, Chih-Hung Wang, Sheng Xu, C. K. Lee, Isaac Yang, Marius George Linguraru, Holger R. Roth, Chia-Jung Hsu, Anas Z. Abidin, Thomas M. Grist, Hirofumi Obinata, Sheridan Reed, Andrew Liu, Ahmed Harouni, Natalie Gangai, Ittai Dayan, Kristopher Kersten, Stephanie Harmon, Jae Ho Sohn, John Garrett, Bradford J. Wood, Sharmila Majumdar, Bernardo Bizzo, Shuichi Kawano, Keith J. Dreyer, Carlos Tor-Díez, and Chia-Cheng Lee
Subjects: medicine.medical_specialty, Information privacy, Coronavirus disease 2019 (COVID-19), Computer science, business.industry, Vital signs, General Medicine, General Biochemistry, Genetics and Molecular Biology, Data sharing, Data exchange, Health care, medicine, Generalizability theory, Medical physics, In patient, business
Abstract: Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site’s data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare. Federated learning, a method for training artificial intelligence algorithms that protects data privacy, was used to predict future oxygen requirements of symptomatic patients with COVID-19 using data from 20 different institutes across the globe.
Published: 2021

16. Emphysema Quantification and Severity Classification with 3-Dimensional Averaging Kernel and Airways Removal

Author: Jianxiang Zhang, Gunvant R. Chaudhari, Masha Bonderenko, and Jae Ho Sohn
Abstract: BackgroundEmphysema is a common pulmonary pathology known to be associated with increased risk of lung cancer and lung biopsy complications. Prevailing quantitation method of calculating voxel-wise percentage of low attenuation area (LAA) of lung tissue from CT scans is prone to noise and error due overcounting of single voxel LAA and incomplete segmentation of airways.PurposeWe aim to develop an accurate algorithm to quantitatively measure emphysema and classify its severity..Methods and MaterialsTwo chest CT datasets were obtained from two tertiary hospitals as training and external validation datasets. Exclusion criteria included any patients whose emphysema extent was not specified by the accompanying report. The training dataset included 722 patients, and the validation dataset included 1006 patients. Following lung segmentation and airways removal, we applied convolution of the segmented lung with averaging kernels of different sizes in 2D and 3D. Cutoffs between “none,” “mild to moderate,” and “severe” emphysema were determined via weighted logistic regression on the training dataset, and the categorical emphysema extent was obtained for each patient. The main measure for evaluating model performance was area under the curve (AUC) of the receiver operating characteristic (ROC) on the training dataset and accuracy of classification on both the training and the validation dataset. The 1×1×1 kernel, which is equivalent to the traditional LAA score, was used for comparison to other kernels for performance evaluation.ResultsThe best model used a 3D 3×3×3 kernel for average filtering with airways post processing and achieved a mean AUC of 0.782 and 0.985 for “none”-versus-rest and “severe”-versus-rest classifications respectively. It achieved a 0.676 and 0.757 multiclass classification accuracy on the training and validation dataset respectively.Conclusions and RelevanceWe present an automated pipeline that can achieve accurate emphysema quantification and severity classification. We showed that convolving the segmented lung with a 3D 3×3×3 kernel and post-processing to remove airways can reliably quantify emphysema.
Published: 2022

17. Development and web deployment of an automated neuroradiology MRI protocoling tool with natural language processing

Author: Benjamin Laguna, Youngho Seo, Thienkhai Vu, Shourya Munjal, Jae Ho Sohn, Gunvant R. Chaudhari, Jared Narvid, Timothy L. Chen, and Yeshwant Chillakuru
Subjects: Computer science, Radiology workflow, Clinical Sciences, Computer applications to medicine. Medical informatics, R858-859.7, Health Informatics, Neuroimaging, computer.software_genre, Mri model, Workflow, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, Automation, 0302 clinical medicine, medicine, Protocol, Humans, 030212 general & internal medicine, Single institution, Stroke, Neuroradiology, Protocol (science), Receiver operating characteristic, business.industry, Research, Health Policy, Natural language processing, Neurosciences, medicine.disease, Magnetic Resonance Imaging, Brain Disorders, Computer Science Applications, Radiography, Networking and Information Technology R&D, Good Health and Well Being, Biomedical Imaging, Artificial intelligence, Radiology, business, computer, Medical Informatics, Information Systems
Abstract: Background A systematic approach to MRI protocol assignment is essential for the efficient delivery of safe patient care. Advances in natural language processing (NLP) allow for the development of accurate automated protocol assignment. We aim to develop, evaluate, and deploy an NLP model that automates protocol assignment, given the clinician indication text. Methods We collected 7139 spine MRI protocols (routine or contrast) and 990 head MRI protocols (routine brain, contrast brain, or other) from a single institution. Protocols were split into training (n = 4997 for spine MRI; n = 839 for head MRI), validation (n = 1071 for spine MRI, fivefold cross-validation used for head MRI), and test (n = 1071 for spine MRI; n = 151 for head MRI) sets. fastText and XGBoost were used to develop 2 NLP models to classify spine and head MRI protocols, respectively. A Flask-based web app was developed to be deployed via Heroku. Results The spine MRI model had an accuracy of 83.38% and a receiver operator characteristic area under the curve (ROC-AUC) of 0.8873. The head MRI model had an accuracy of 85.43% with a routine brain protocol ROC-AUC of 0.9463 and contrast brain protocol ROC-AUC of 0.9284. Cancer, infectious, and inflammatory related keywords were associated with contrast administration. Structural anatomic abnormalities and stroke/altered mental status were indicative of routine spine and brain MRI, respectively. Error analysis revealed increasing the sample size may improve performance for head MRI protocols. A web version of the model is provided for demonstration and deployment. Conclusion We developed and web-deployed two NLP models that accurately predict spine and head MRI protocol assignment, which could improve radiology workflow efficiency.
Published: 2021

18. Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports

Author: Gunvant R. Chaudhari, Tengxiao Liu, Timothy L. Chen, Gabby B. Joseph, Maya Vella, Yoo Jin Lee, Thienkhai H. Vu, Youngho Seo, Andreas M. Rauschecker, Charles E. McCulloch, and Jae Ho Sohn
Subjects: Radiological and Ultrasound Technology, Artificial Intelligence, Technology Assessment, Radiology, Nuclear Medicine and imaging, Computer Applications, Original Research
Abstract: PURPOSE: To develop radiology domain–specific bidirectional encoder representations from transformers (BERT) models that can identify speech recognition (SR) errors and suggest corrections in radiology reports. MATERIALS AND METHODS: A pretrained BERT model, Clinical BioBERT, was further pretrained on a corpus of 114 008 radiology reports between April 2016 and August 2019 that were retrospectively collected from two hospitals. Next, the model was fine-tuned on a training dataset of generated insertion, deletion, and substitution errors, creating Radiology BERT. This model was retrospectively evaluated on an independent dataset of radiology reports with generated errors (n = 18 885) and on unaltered report sentences (n = 2000) and prospectively evaluated on true clinical SR errors (n = 92). Correction Radiology BERT was separately trained to suggest corrections for detected deletion and substitution errors. Area under the receiver operating characteristic curve (AUC) and bootstrapped 95% CIs were calculated for each evaluation dataset. RESULTS: Radiology-specific BERT had AUC values of >.99 (95% CI: >0.99, >0.99), 0.94 (95% CI: 0.93, 0.94), 0.98 (95% CI: 0.98, 0.98), and 0.97 (95% CI: 0.97, 0.97) for detecting insertion, deletion, substitution, and all errors, respectively, on the independently generated test set. Testing on unaltered report impressions revealed a sensitivity of 82% (28 of 34; 95% CI: 70%, 93%) and specificity of 88% (1521 of 1728; 95% CI: 87%, 90%). Testing on prospective SR errors showed an accuracy of 75% (69 of 92; 95% CI: 65%, 83%). Finally, the correct word was the top suggestion for 45.6% (475 of 1041; 95% CI: 42.5%, 49.3%) of errors. CONCLUSION: Radiology-specific BERT models fine-tuned on generated errors were able to identify SR errors in radiology reports and suggest corrections. Keywords: Computer Applications, Technology Assessment Supplemental material is available for this article. © RSNA, 2022 See also the commentary by Abajian and Cheung in this issue.
Published: 2022

19. CT Features of Benign Intrapulmonary Lymph Nodes in Pediatric Patients With Known Extrapulmonary Solid Malignancy

Author: Joo Y Cho, Jae Ho Sohn, Abbey J. Winant, Sara O. Vargas, and Edward Y. Lee
Subjects: Male, medicine.medical_specialty, Lung Neoplasms, Adolescent, Intrapulmonary lymph nodes, Malignancy, 030218 nuclear medicine & medical imaging, Surgical pathology, 03 medical and health sciences, 0302 clinical medicine, medicine, Humans, Radiology, Nuclear Medicine and imaging, Child, Lung, Lymph node, Retrospective Studies, Kappa value, business.industry, Infant, General Medicine, medicine.disease, medicine.anatomical_structure, Child, Preschool, Lymphatic Metastasis, 030220 oncology & carcinogenesis, Female, Lymph Nodes, Radiology, Lymph, Tomography, X-Ray Computed, business, Calcification
Abstract: OBJECTIVE. The purpose of our study was to determine the CT features of benign intrapulmonary lymph nodes in pediatric patients with known extrapulmonary solid malignancy. MATERIALS AND METHODS. A retrospective review of surgical pathology archives was performed to identify consecutive chest CT studies of pediatric patients (≤ 18 years) with extrapulmonary solid malignancy and histologically confirmed benign intrapulmonary lymph nodes between January 1, 2004, and March 15, 2020. CT features of intrapulmonary lymph nodes-including size, shape, margin, type, associated calcification or fat, and location-were independently evaluated by two pediatric radiologist reviewers. The CT features of benign intrapulmonary lymph nodes in pediatric patients were analyzed using summary statistics. Interobserver agreement was measured with the kappa coefficient. RESULTS. There were 36 pathology-confirmed benign intrapulmonary lymph nodes in 27 pediatric patients (18 boys and nine girls; mean age, 12 years; age range, 1-18.2 years). Twenty-three (63.9%) of the benign intrapulmonary lymph nodes were biopsied from the right lung and 13 (36.1%) from the left lung (p = .03). The mean size, determined from CT studies, of benign intrapulmonary lymph nodes was 3.6 mm (SD, 1.4 mm; range, 1.3-7.8 mm). Triangular shape (25/36, 69.4%) was the most common shape of the benign intrapulmonary lymph nodes. Less commonly seen shapes of benign intrapulmonary lymph nodes were oval (6/36, 16.7%), round (3/36, 8.3%), and trapezoidal (2/36, 5.6%). All benign intrapulmonary lymph nodes were smoothly marginated and solid without associated calcification or fat. Of the 36 benign intrapulmonary lymph nodes, 15 (41.7%) were pleura-based; 11 (30.6%), perifissural; and 10 (27.8%), parenchymal. The kappa value for interobserver agreement between the two reviewers was 0.917 (95% CI, 0.825-1.000; standard error, 0.047), which corresponds to near-perfect agreement. CONCLUSION. In pediatric patients with known extrapulmonary solid malignancy, benign intrapulmonary lymph nodes are subcentimeter (mean size, 3.6 mm), smoothly marginated, and solid without containing calcification or fat on CT. In particular, triangular shape was the most commonly encountered shape of a benign intrapulmonary lymph node.
Published: 2021

20. Prediction of future healthcare expenses of patients from chest radiographs using deep learning: a pilot study

Author: Benjamin L. Franc, Thienkhai Vu, Jaewon Yang, Yixin Chen, Jae Ho Sohn, Dexter Hadley, Youngho Seo, Dmytro Lituiev, and Karen G. Ordovas
Subjects: medicine.medical_specialty, Multidisciplinary, business.industry, Radiography, Deep learning, Pilot Projects, Deep Learning, Good Health and Well Being, ROC Curve, Clinical Research, Health care, medicine, Humans, Medical physics, Artificial intelligence, business, Delivery of Health Care, Retrospective Studies
Abstract: Our objective was to develop deep learning models with chest radiograph data to predict healthcare costs and classify top-50% spenders. 21,872 frontal chest radiographs were retrospectively collected from 19,524 patients with at least 1-year spending data. Among the patients, 11,003 patients had 3 years of cost data, and 1678 patients had 5 years of cost data. Model performances were measured with area under the receiver operating characteristic curve (ROC-AUC) for classification of top-50% spenders and Spearman ρ for prediction of healthcare cost. The best model predicting 1-year (N=21,872) expenditure achieved ROC-AUC of 0.806 [95% CI, 0.793-0.819] for top-50% spender classification and ρ of 0.561 [0.536-0.586] for regression. Similarly, for predicting 3-year (N=12,395) expenditure, ROC-AUC of 0.771 [0.750-0.794] and ρ of 0.524 [0.489-0.559]; for predicting 5-year (N=1,779) expenditure ROC-AUC of 0.729 [0.667-0.729] and ρ of 0.424 [0.324-0.529]. Our deep learning model demonstrated the feasibility of predicting health care expenditure as well as classifying top 50% healthcare spenders at 1, 3, and 5 year(s), implying the feasibility of combining deep learning with information-rich imaging data to uncover hidden associations that may allude physicians. Such a model can be a starting point of making an accurate budget in reimbursement models in healthcare industries.
Published: 2022

21. Using Radiomics for Risk Stratification: Where We Need to Go

Author: Jae Ho Sohn and William Hsu
Subjects: Male, Lung Neoplasms, business.industry, Middle Aged, Data science, Risk Assessment, Tumor Burden, Survival Rate, Reviews and Commentary, Radiomics, Carcinoma, Non-Small-Cell Lung, Risk stratification, Biomarkers, Tumor, Medicine, Humans, Radiology, Nuclear Medicine and imaging, Female, business, Tomography, X-Ray Computed, Aged, Neoplasm Staging
Abstract: Background Radiomics-based biomarkers enable the prognostication of resected non-small cell lung cancer (NSCLC), but their effectiveness in clinical stage and pathologic stage IA pure-solid tumors requires further determination. Purpose To construct an efficient radiomics signature for survival risk stratification personalized for patients with clinical stage and pathologic stage IA pure-solid NSCLC. Materials and Methods In this retrospective study, six radiomics signatures were constructed for patients with stage IA pure-solid NSCLC who underwent resection between January 2011 and December 2013 at authors' institution and were tested in the radiogenomics data set. The radiomics features were extracted from the intratumoral two-dimensional region, three-dimensional volume, and peritumoral area using PyRadiomics. The discriminative abilities of the signatures were quantified using the area under the time-dependent receiver operating characteristic curve (AUC), and the optimal signature was selected for patient stratification. Results The study included 592 patients with stage IA pure-solid NSCLC (median age, 61 years; interquartile range, 55-66 years; 269 women) for radiomics analysis: 381 patients for training, 163 for internal validation, and 48 for external validation. The radiomics signature combining three-region features yielded the highest 3- and 5-year AUCs of 0.77 and 0.78, respectively, in the internal validation set and 0.76 and 0.75, respectively, in the external validation set. Multivariable analysis suggested that the radiomics signature remained an independent prognostic factor (hazard ratio, 6.2; 95% CI: 3.5, 11.0
Published: 2021

22. AI MSK clinical applications: cartilage and osteoarthritis

Author: Gabby B. Joseph, Thomas M. Link, Sharmila Majumdar, Valentina Pedoia, Jae Ho Sohn, and Charles E. McCulloch
Subjects: Cartilage, Articular, Knee Joint, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Osteoarthritis, Machine learning, computer.software_genre, Artificial Intelligence, Medicine, Humans, Radiology, Nuclear Medicine and imaging, Segmentation, Contextual image classification, business.industry, Cartilage, Deep learning, Reproducibility of Results, Osteoarthritis, Knee, Precision medicine, medicine.disease, Magnetic Resonance Imaging, medicine.anatomical_structure, Workflow, Applications of artificial intelligence, Artificial intelligence, business, computer
Abstract: The advancements of artificial intelligence (AI) for osteoarthritis (OA) applications have been rapid in recent years, particularly innovations of deep learning for image classification, lesion detection, cartilage segmentation, and prediction modeling of future knee OA development. This review article focuses on AI applications in OA research, first describing machine learning (ML) techniques and workflow, followed by how these algorithms are used for OA classification tasks through imaging and non-imaging-based ML models. Deep learning applications for OA research, including analysis of both radiographs for automatic detection of OA severity, and MR images for detection of cartilage/meniscus lesions and cartilage segmentation for automatic T2 quantification will be described. In addition, information on ML models that identify individuals at high risk of OA development will be provided. The future vision of machine learning applications in imaging of OA and cartilage hinges on implementation of AI for optimizing imaging protocols, quantitative assessment of cartilage, and automated analysis of disease burden yielding a faster and more efficient workflow for a radiologist with a higher level of reproducibility and precision. It may also provide risk assessment tools for individual patients, which is an integral part of precision medicine.
Published: 2021

23. Development and performance comparison with radiologists of a multitask deep learning model for severity grading of hip osteoarthritis features on radiographs

Author: Pia M. Jungmann, Valentina Pedoia, Sarah C. Foreman, Lorenzo Nardo, Eugene Ozhinsky, Michael C. Nevitt, Magdalena Posadzy, Thomas M. Link, C.E. von Schacky, Felix Liu, and Jae Ho Sohn
Subjects: medicine.medical_specialty, business.industry, Deep learning, Radiography, Biomedical Engineering, Severity grading, Rheumatology, Performance comparison, Hip osteoarthritis, Physical therapy, Medicine, Orthopedics and Sports Medicine, Artificial intelligence, business
Published: 2020

24. Papillary and sclerosing lesions of the breast detected and biopsied by MRI: Clinical management, upgrade rate, and association with apocrine metaplasia

Author: Jesse S Bond, Eduardo Cervantes, Jonathan D. Marotti, Roberta M. diFlorio-Alexander, Kari M. Rosenkranz, Natalie Y. Ring, and Jae Ho Sohn
Subjects: Adult, Image-Guided Biopsy, Pathology, medicine.medical_specialty, Radial scar, Lobular carcinoma, Scars, Breast Neoplasms, 030218 nuclear medicine & medical imaging, Lesion, Breast Diseases, 03 medical and health sciences, 0302 clinical medicine, Internal Medicine, Atypia, Carcinoma, Humans, Medicine, Mammary Glands, Human, Aged, Retrospective Studies, Aged, 80 and over, Sclerosis, business.industry, Apocrine, Middle Aged, medicine.disease, Magnetic Resonance Imaging, Carcinoma, Papillary, Carcinoma, Intraductal, Noninfiltrating, Oncology, 030220 oncology & carcinogenesis, Papilloma, Female, Surgery, medicine.symptom, business, Follow-Up Studies
Abstract: Benign papillary and sclerosing lesions of the breast (intraductal papillomas, complex sclerosing lesions, radial scars) are considered high-risk lesions due to the potential for upgrade to carcinoma on subsequent surgical excision. Optimal clinical management of such lesions remains unclear due to variable reported upgrade rates. Apocrine metaplasia is a common finding in breast tissue and its role in MRI enhancing lesions is increasingly being recognized. The purpose of this study was to investigate the MRI features of papillary and sclerosing lesions of the breast, evaluate the clinical management and upgrade rate of such lesions, and examine the contribution of apocrine metaplasia to the imaging findings. A 13-year retrospective review of MRI-guided biopsies identified 70 MRI-detected and -biopsied papillary and sclerosing lesions. Sixteen lesions without atypia underwent surgical excision; only one case (6%) was upgraded to pleomorphic lobular carcinoma in situ. The majority (64%) of biopsies contained apocrine metaplasia either within or adjacent to the targeted lesion. We found that half of MRI-detected lesions had T2 hyperintense foci (2-5 mm) or masses (>5 mm) adjacent to the lesion. Histologic correlation showed apocrine cysts were likely responsible for this imaging finding in 56% of these cases.
Published: 2019

25. High precision localization of pulmonary nodules on chest CT utilizing axial slice number labels

Author: Jae Ho Sohn, Aditya Sheth, Vishnu Doppalapudi, Thienkhai Vu, Kyle Kranen, Youngho Seo, Zhangyuan Xiong, Aarash Heydari, Letian Fu, and Yeshwant Chillakuru
Subjects: Male, medicine.medical_specialty, lcsh:Medical technology, Lung Neoplasms, Neural Networks, Computer science, Clinical Sciences, Bioengineering, Machine Learning, Computer, Lung nodule, medicine, False positive paradox, Humans, Radiology, Nuclear Medicine and imaging, False Positive Reactions, Lung cancer, Tomography, Lung, Cancer, Original Research, Lung Cancer, Slice Number, Detector, Nodule detection, Solitary Pulmonary Nodule, Nodule (medicine), Deep learning, Middle Aged, medicine.disease, Object detection, X-Ray Computed, Tumor Burden, Nuclear Medicine & Medical Imaging, Good Health and Well Being, lcsh:R855-855.5, Biomedical Imaging, Female, Radiology, False positive rate, Neural Networks, Computer, medicine.symptom, Tomography, X-Ray Computed, Lung cancer screening
Abstract: BackgroundReidentification of prior nodules for temporal comparison is an important but time-consuming step in lung cancer screening. We develop and evaluate an automated nodule detector that utilizes the axial-slice number of nodules found in radiology reports to generate high precision nodule predictions.Methods888 CTs from Lung Nodule Analysis were used to train a 2-dimensional (2D) object detection neural network. A pipeline of 2D object detection, 3D unsupervised clustering, false positive reduction, and axial-slice numbers were used to generate nodule candidates. 47 CTs from the National Lung Cancer Screening Trial (NLST) were used for model evaluation.ResultsOur nodule detector achieved a precision of 0.962 at a recall of 0.573 on the NLST test set for any nodule. When adjusting for unintended nodule predictions, we achieved a precision of 0.931 at a recall 0.561, which corresponds to 0.06 false positives per CT. Error analysis revealed better detection of nodules with soft tissue attenuation compared to ground glass and undeterminable attenuation. Nodule margins, size, location, and patient demographics did not differ between correct and incorrect predictions.ConclusionsUtilization of axial-slice numbers from radiology reports allowed for development of a lung nodule detector with a low false positive rate compared to prior feature-engineering and machine learning approaches. This high precision nodule detector can reduce time spent on reidentification of prior nodules during lung cancer screening and can rapidly develop new institutional datasets to explore novel applications of computer vision in lung cancer imaging.
Published: 2021

26. Algorithmic Prediction of Delayed Radiology Turn-Around-Time during Non-Business Hours

Author: Alex Rybkin, Thienkhai Vu, Jae Ho Sohn, Vaibhavi Shah, Yeshwant Chillakuru, and Youngho Seo
Subjects: medicine.medical_specialty, Receiver operating characteristic, business.industry, Retrospective cohort study, medicine.disease, Turnaround time, Confidence interval, 030218 nuclear medicine & medical imaging, Random forest, Pulmonary embolism, Machine Learning, Radiography, 03 medical and health sciences, 0302 clinical medicine, Business hours, ROC Curve, Feature (computer vision), 030220 oncology & carcinogenesis, Medicine, Humans, Radiology, Nuclear Medicine and imaging, Radiology, business, Retrospective Studies
Abstract: Rationale and Objectives Radiology turnaround time is an important quality measure that can impact hospital workflow and patient outcomes. We aimed to develop a machine learning model to predict delayed turnaround time during non-business hours and identify factors that contribute to this delay. Materials and Methods This retrospective study consisted of 15,117 CT cases from May 2018 to May 2019 during non-business hours at two hospital campuses after applying exclusion criteria. Of these 15,177 cases, 7,532 were inpatient cases and 7,585 were emergency cases. Order time, scan time, first communication by radiologist, free-text indications, and other clinical metadata were extracted. A combined XGBoost classifier and Random Forest natural language processing model was trained with 85% of the data and tested with 15% of the data. The model predicted two measures of delay: when the exam was ordered to first communication (total time) and when the scan was completed to first communication (interpretation time). The model was analyzed with the area under the curve (AUC) of receiver operating characteristic (ROC) and feature importance. Source code: https://bit.ly/2UrLiVJ Results The algorithm reached an AUC of 0.85, with a 95% confidence interval [0.83, 0.87], when predicting delays greater than 245 minutes for “total time” and 0.71, with a 95% confidence interval [0.68, 0.73], when predicting delays greater than 57 minutes for “interpretation time”. At our institution, CT scan description (e.g. "CTA chest pulmonary embolism protocol"), time of day, and year in training were more predictive features compared to body part, inpatient status, and hospital campus for both interpretation and total time delay. Conclusion This algorithm can be applied clinically when a physician is ordering the scan to reasonably predict delayed turnaround time. Such a model can be leveraged to identify factors associated with delays and emphasize areas for improvement to patient outcomes.
Published: 2021

27. Longitudinal Analysis of the Effect of Repeated Transarterial Chemoembolization for Liver Cancer on Portal Venous Pressure

Author: MingDe Lin, Julius Chapiro, Jae Ho Sohn, Ruediger E. Schernthaner, Rafael Duran, Timothy M. Pawlik, James P. Hamilton, Kelvin Hong, Constantine Frangakis, and Ahmet Bas
Subjects: Liver Cancer, medicine.medical_specialty, Cancer Research, Hepatocellular-Carcinoma, Complications, Transcatheter Arterial Embolization, Portal venous pressure, longitudinal data analysis, Oncology and Carcinogenesis, HCC, TACE, hepatocellular carcinoma, portal hypertension, transarterial chemoembolization, variceal bleeding, Gastroenterology, Liver disease, Rare Diseases, Clinical Research, Splenic Volume, Internal medicine, Esophageal-Varices, Ascites, medicine, Count/Spleen Diameter Ratio, RC254-282, Cancer, Ct, Original Research, Performance status, business.industry, Liver Disease, Hemodynamics, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, medicine.disease, Cirrhosis, Oncology, Hepatocellular carcinoma, Hypertension, Portal hypertension, Liver function, medicine.symptom, Liver cancer, business, Digestive Diseases
Abstract: ObjectivesInvestigate long-term effects of repeated transarterial chemoembolization (TACE) on portal venous pressure (PVP) using non-invasive surrogate markers of portal hypertension.MethodsRetrospective, Institutional Review Board-approved study. 99 patients [hepatocellular carcinoma (HCC) group (n=57); liver metastasis group (n=42)] who underwent 279TACEs and had longitudinal pre-/post-therapy contrast-enhanced-MRI (n=388) and complete blood work were included. Outcomes of interest were platelet count (PC), spleen volume, ascites and portosystemic collaterals. Variables included TACE type/number, tumor type, microcatheter location, Child-Pugh, baseline tumor burden (tumor number/total/largest size), vessel invasion, alpha-fetoprotein, Eastern Cooperative Oncology Group (ECOG) performance status, and Model for End-Stage Liver Disease (MELD) score. Generalized Estimating Equations assessed the associations between TACE and outcomes. Power analysis determined the sample size was sufficient.ResultsNo significant change in PC over time was observed in either groups, regardless of liver function (P>0.05). Baseline spleen volume was 226 cm3 for metastatic group, and was larger by 204 cm3 for HCC group (P3 (95%CI: 8-32; P=0.001) for both groups after 1stTACE and by 16cm3/TACE (P=0.099) over the full follow-up (up to 9TACEs). Spleen volume also tended to increase by 23cm3 (95%CI: -1–48; P=0.064) with higher tumor burden. Odds of developing moderate/severe ascites for metastatic patients was decreased by 0.5 (95%CI: 0.3–0.9; P=0.014), regardless of the Child-Pugh, and increased by 1.5 (95%CI: 1.2–1.9; PP=0.008). PVP-related complications such as variceal bleeding post-TACE were low (0.4%).ConclusionRepeated TACEs did seem to have an impact on PVP. However, the increase in PVP had marginal effects with low portal hypertension-related complications.
Published: 2021

28. Federated Learning used for predicting outcomes in SARS-COV-2 patients

Author: Matheus Ribeiro Furtado de Mendonça, Evan Leibovitz, Kristopher Kersten, Mona Flores, John Garrett, Baris Turkbey, Pablo F. Damasceno, Masoom A. Haider, Fred Kwon, Soo-Young Park, Chun-Nan Hsu, Keith J. Dreyer, Chien-Sung Tsai, Tatsuya Kodama, Daguang Xu, Min Kyu Kang, Tony Mazzulli, Andrew Feng, C. K. Lee, Isaac Yang, Deepi Bhatia, Marius George Linguraru, Byung Seok Kim, Aoxiao Zhong, Mohammad Adil, Pochuan Wang, Sheridan Reed, Peerapon Vateekul, Anas Z. Abidin, Sira Sriswa, J. D. Kaggie, Chia-Cheng Lee, Carlos Tor-Díez, Krishna Juluru, Xiang Li, Colin B. Compas, Xihong Lin, Jiahui Guan, Pierre Elnajjar, Yuhong Wen, Jung Gil Park, Hao-Hsin Shin, Amilcare Gentili, Weichung Wang, Colleen Ruan, Hui Ren, Hisashi Sasaki, Hitoshi Mori, Holger R. Roth, Felipe Kitamura, Chiu-Ling Lai, Jason C. Crane, Thomas M. Grist, Bradford J. Wood, Bernardo Bizzo, Dufan Wu, Jesse Tetreault, Andrew N. Priest, Mike Fralick, Anthony Costa, Andrew Liu, Benjamin S. Glicksberg, Griffin Lacey, Meena Abdelmaseeh, Thanyawee Puthanakit, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Shelley McLeod, Pedro Mário Cruz e Silva, Chih-Hung Wang, Chia-Jung Hsu, Sarah E Hickman, Won Young Tak, Quanzheng Li, Yothin Rakvongthai, Watsamon Jantarabenjakul, Li-Chen Fu, Gustavo César de Antônio Corradi, Eric K. Oermann, Nicola Rieke, Varun Buch, Abood Quraini, Shuichi Kawano, Natalie Gangai, Yu Rim Lee, Krishna Nand Keshava Murthy, Christopher P. Hess, Stefan Gräf, Ittai Dayan, Stephanie Harmon, Jae Ho Sohn, Eddie Huang, Ahmed Harouni, Vitor de Lima Lavor, Sharmila Majumdar, Sheng Xu, Hirofumi Obinata, Fiona J. Gilbert, and Chin Lin
Subjects: federated learning, Computer science, business.industry, SARS-CoV-2, Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), Vital signs, MEDLINE, COVID-19, artificial intelligence, Prognosis, Data science, Article, Data sharing, Machine Learning, Data exchange, Health care, Outcome Assessment, Health Care, Electronic Health Records, Humans, Set (psychology), business, Anonymity
Abstract: Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC)0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site's data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare.
Published: 2021

29. An Open-Source, Vender Agnostic Hardware and Software Pipeline for Integration of Artificial Intelligence in Radiology Workflow

Author: Yeshwant Chillakuru, Thienkhai Vu, Bonnie N. Joe, Amie Y. Lee, Stanley Lee, Youngho Seo, Tatiana Kelil, Jae Ho Sohn, and Christopher P. Hess
Subjects: Computer science, Integration testing, Methods Paper, Graphics processing unit, 030218 nuclear medicine & medical imaging, Workflow, 03 medical and health sciences, DICOM, 0302 clinical medicine, Software, Picture archiving and communication system, Artificial Intelligence, Humans, Radiology, Nuclear Medicine and imaging, Software system, Intranet, Radiological and Ultrasound Technology, business.industry, Pipeline (software), Computer Science Applications, Systems Integration, Radiology Information Systems, business, Radiology, 030217 neurology & neurosurgery, Computer hardware
Abstract: Although machine learning (ML) has made significant improvements in radiology, few algorithms have been integrated into clinical radiology workflow. Complex radiology IT environments and Picture Archiving and Communication System (PACS) pose unique challenges in creating a practical ML schema. However, clinical integration and testing are critical to ensuring the safety and accuracy of ML algorithms. This study aims to propose, develop, and demonstrate a simple, efficient, and understandable hardware and software system for integrating ML models into the standard radiology workflow and PACS that can serve as a framework for testing ML algorithms. A Digital Imaging and Communications in Medicine/Graphics Processing Unit (DICOM/GPU) server and software pipeline was established at a metropolitan county hospital intranet to demonstrate clinical integration of ML algorithms in radiology. A clinical ML integration schema, agnostic to the hospital IT system and specific ML models/frameworks, was implemented and tested with a breast density classification algorithm and prospectively evaluated for time delays using 100 digital 2D mammograms. An open-source clinical ML integration schema was successfully implemented and demonstrated. This schema allows for simple uploading of custom ML models. With the proposed setup, the ML pipeline took an average of 26.52 s per second to process a batch of 100 studies. The most significant processing time delays were noted in model load and study stability times. The code is made available at “ http://bit.ly/2Z121hX ”. We demonstrated the feasibility to deploy and utilize ML models in radiology without disrupting existing radiology workflow.
Published: 2020

30. A deep learning algorithm to detect the presence of basal cell carcinoma on Mohs micrographic surgery frozen sections

Author: Jessica Yeh, Jae Ho Sohn, Yixin Chen, Shang I Brian Jiang, and Grace K. Sohn
Subjects: medicine.medical_specialty, Frozen section procedure, Skin Neoplasms, business.industry, Pilot Projects, Dermatology, medicine.disease, Mohs Surgery, Micrographic surgery, Proof of Concept Study, Article, Deep Learning, ROC Curve, Carcinoma, Basal Cell, Image Interpretation, Computer-Assisted, medicine, Feasibility Studies, Frozen Sections, Humans, Basal cell carcinoma, Radiology, business, False Negative Reactions, Retrospective Studies, Skin
Published: 2020

31. Development and Validation of a Multitask Deep Learning Model for Severity Grading of Hip Osteoarthritis Features on Radiographs

Author: Michael C. Nevitt, Lorenzo Nardo, Eugene Ozhinsky, Jae Ho Sohn, Claudio E. von Schacky, Felix Liu, Sarah C Foreman, Thomas M. Link, Pia M. Jungmann, Valentina Pedoia, and Magdalena Posadzy
Subjects: musculoskeletal diseases, Male, medicine.medical_specialty, Validation study, Radiography, education, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Severity grading, Osteoarthritis, Medical and Health Sciences, Severity of Illness Index, Article, Osteoarthritis, Hip, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, 0302 clinical medicine, Physical medicine and rehabilitation, Deep Learning, Theoretical, Models, Clinical Research, Severity of illness, medicine, Hip osteoarthritis, Humans, Radiology, Nuclear Medicine and imaging, Multiple classification, Aged, Retrospective Studies, Hip, business.industry, Deep learning, Prevention, Arthritis, Middle Aged, Models, Theoretical, medicine.disease, Nuclear Medicine & Medical Imaging, 030220 oncology & carcinogenesis, Female, Artificial intelligence, business
Abstract: Background A multitask deep learning model might be useful in large epidemiologic studies wherein detailed structural assessment of osteoarthritis still relies on expert radiologists' readings. The potential of such a model in clinical routine should be investigated. Purpose To develop a multitask deep learning model for grading radiographic hip osteoarthritis features on radiographs and compare its performance to that of attending-level radiologists. Materials and Methods This retrospective study analyzed hip joints seen on weight-bearing anterior-posterior pelvic radiographs from participants in the Osteoarthritis Initiative (OAI). Participants were recruited from February 2004 to May 2006 for baseline measurements, and follow-up was performed 48 months later. Femoral osteophytes (FOs), acetabular osteophytes (AOs), and joint-space narrowing (JSN) were graded as absent, mild, moderate, or severe according to the Osteoarthritis Research Society International atlas. Subchondral sclerosis and subchondral cysts were graded as present or absent. The participants were split at 80% (n = 3494), 10% (n = 437), and 10% (n = 437) by using split-sample validation into training, validation, and testing sets, respectively. The multitask neural network was based on DenseNet-161, a shared convolutional features extractor trained with multitask loss function. Model performance was evaluated in the internal test set from the OAI and in an external test set by using temporal and geographic validation consisting of routine clinical radiographs. Results A total of 4368 participants (mean age, 61.0 years ± 9.2 [standard deviation]; 2538 women) were evaluated (15 364 hip joints on 7738 weight-bearing anterior-posterior pelvic radiographs). The accuracy of the model for assessing these five features was 86.7% (1333 of 1538) for FOs, 69.9% (1075 of 1538) for AOs, 81.7% (1257 of 1538) for JSN, 95.8% (1473 of 1538) for subchondral sclerosis, and 97.6% (1501 of 1538) for subchondral cysts in the internal test set, and 82.7% (86 of 104) for FOS, 65.4% (68 of 104) for AOs, 80.8% (84 of 104) for JSN, 88.5% (92 of 104) for subchondral sclerosis, and 91.3% (95 of 104) for subchondral cysts in the external test set. Conclusion A multitask deep learning model is a feasible approach to reliably assess radiographic features of hip osteoarthritis. © RSNA, 2020 Online supplemental material is available for this article.
Published: 2020

32. Machine learning for predicting knee osteoarthritis progression over 8 years using combined MR imaging features, demographics, and clinical factors: data from the osteoarthritis initiative

Author: Gabby B. Joseph, Charles E. McCulloch, Michael C. Nevitt, Jae Ho Sohn, and Thomas M. Link
Subjects: medicine.medical_specialty, Rheumatology, Demographics, business.industry, Biomedical Engineering, Physical therapy, Medicine, Orthopedics and Sports Medicine, Osteoarthritis, business, medicine.disease, Mr imaging
Published: 2021

33. Characteristics of a New X-Ray Imaging System for Interventional Procedures: Improved Image Quality and Reduced Radiation Dose

Author: MingDe Lin, Ruediger E. Schernthaner, Sonia Sahu, Reham R. Haroun, Rafael Duran, Imramsjah M. J. van der Bom, Jean Francois H. Geschwind, Kelvin Hong, Yan Zhao, Alessandro Radaelli, Julius Chapiro, Maria Mauti, S. Nguyen, and Jae Ho Sohn
Subjects: Image quality, medicine.medical_treatment, Image Processing, Cardiorespiratory Medicine and Haematology, Radiography, Interventional, 030218 nuclear medicine & medical imaging, 0302 clinical medicine, Computer-Assisted, Uterine artery embolization, Image noise, Technical Note, Image Processing, Computer-Assisted, Fluoroscopy, Uterine artery, 2. Zero hunger, medicine.diagnostic_test, Interventional, Leiomyoma, Ultrasound, Angiography, Middle Aged, 3. Good health, Nuclear Medicine & Medical Imaging, Uterine Artery, 030220 oncology & carcinogenesis, Biomedical Imaging, Female, Radiology, Cardiology and Cardiovascular Medicine, Adult, medicine.medical_specialty, Radiation Dosage, 03 medical and health sciences, Image quality enhancement, Clinical Research, medicine.artery, medicine, Humans, Radiology, Nuclear Medicine and imaging, Retrospective Studies, Interventional radiology, business.industry, X-Rays, Angiography, Digital Subtraction, Reproducibility of Results, Digital subtraction angiography, Uterine Artery Embolization, Radiography, Dose area product, business, Nuclear medicine, Digital Subtraction
Abstract: Purpose To compare image quality and radiation exposure between a new angiographic imaging system and the preceding generation system during uterine artery embolization (UAE). Materials and Methods In this retrospective, IRB-approved two-arm study, 54 patients with symptomatic uterine fibroids were treated with UAE on two different angiographic imaging systems. The new system includes optimized acquisition parameters and real-time image processing algorithms. Air kerma (AK), dose area product (DAP) and acquisition time for digital fluoroscopy (DF) and digital subtraction angiography (DSA) were recorded. Body mass index was noted as well. DF image quality was assessed objectively by image noise measurements. DSA image quality was rated by two blinded, independent readers on a four-rank scale. Statistical differences were assessed with unpaired t tests and Wilcoxon rank-sum tests. Results There was no significant difference between the patients treated on the new (n = 36) and the old system (n = 18) regarding age (p = 0.10), BMI (p = 0.18), DF time (p = 0.35) and DSA time (p = 0.17). The new system significantly reduced the cumulative AK and DAP by 64 and 72%, respectively (median 0.58 Gy and 145.9 Gy*cm2 vs. 1.62 Gy and 526.8 Gy*cm2, p
Published: 2017

34. Prospective Comparison of Diagnostic Accuracy Between Point-of-Care and Conventional Ultrasound in a General Diagnostic Department: Implications for Resource-Limited Settings

Author: Robert D. Harris, Roberta M. diFlorio-Alexander, Steffen J A Haider, Joo Y Cho, David H. Lam, and Jae Ho Sohn
Subjects: medicine.medical_specialty, Radiological and Ultrasound Technology, Image quality, business.industry, Point-of-care testing, Ultrasound, Diagnostic accuracy, 030204 cardiovascular system & hematology, Conventional ultrasound, 03 medical and health sciences, 0302 clinical medicine, medicine, Radiology, Nuclear Medicine and imaging, 030212 general & internal medicine, Radiology, Medical diagnosis, business, Prospective cohort study, Point of care
Abstract: Objectives To compare the diagnostic accuracy of hand-held point of care (POC) versus conventional sonography in a general diagnostic setting with the intention to inform medical providers or clinicians on the rational use of POC ultrasound in resource limited settings. Methods Over 3 months in 2010, 47 patients were prospectively enrolled at a single academic center to obtain 54 clinical conventional ultrasound examinations and 54 study-only POC ultrasound examinations. Indications were 48% abdominal, 26% retroperitoneal, and 24% obstetrical. Nine blinded readers (sonographers, residents, and attending radiologists) sequentially assigned diagnoses to POC and then conventional studies, yielding 476 interpreted study pairs. Diagnostic accuracy was obtained by comparing POC and conventional diagnoses to a reference diagnosis established by the unblinded, senior author. Analysis was stratified by study type, body mass index (BMI), diagnostic confidence, and image quality. Results The mean diagnostic accuracy of conventional sonography was 84% compared with 74% for POC (P < .001). This difference was constant regardless of reader, exam type, or BMI. The sensitivity and specificity to detect abnormalities with conventional was 85 and 83%, compared with 75 and 68% for POC. The POC sonography demonstrated greater variability in image quality and diagnostic confidence, and this accounted for lower diagnostic accuracy. When image quality and diagnostic confidence were similar between POC and conventional examinations, there was no difference in accuracy. Conclusions Point of care was nearly as accurate as conventional sonography for basic, focused examinations. Observed differences in accuracy were attributed to greater variation in POC image quality.
Published: 2017

35. Automated Localization and Segmentation of Mononuclear Cell Aggregates in Kidney Histological Images Using Deep Learning

Author: Benjamin S. Glicksberg, Aaron Chin, Andrew M. Bishara, Sung Jik Cha, Dmytro Lituiev, Ruizhe (Ryan) Cheng, Dejan Dobi, Dexter Hadley, Jae Ho Sohn, and Zoltan Laszik
Subjects: 0303 health sciences, Pathology, medicine.medical_specialty, Kidney, medicine.diagnostic_test, business.industry, Deep learning, 030230 surgery, medicine.disease, Peripheral blood mononuclear cell, Convolutional neural network, 3. Good health, 03 medical and health sciences, 0302 clinical medicine, medicine.anatomical_structure, Allograft rejection, Biopsy, medicine, Segmentation, Artificial intelligence, business, Kidney transplantation, 030304 developmental biology
Abstract: Allograft rejection is a major concern in kidney transplantation. Inflammatory processes in patients with kidney allografts involve various patterns of immune cell recruitment and distributions. Lymphoid aggregates (LAs) are commonly observed in patients with kidney allografts and their presence and localization may correlate with severity of acute rejection. Alongside with other markers of inflammation, LAs assessment is currently performed by pathologists manually in a qualitative way, which is both time consuming and far from precise. Here we present the first automated method of identifying LAs and measuring their densities in whole slide images of transplant kidney biopsies. We trained a deep convolutional neural network based on U-Net on 44 core needle kidney biopsy slides, monitoring loss on a validation set (n=7 slides). The model was subsequently tested on a hold-out set (n=10 slides). We found that the coarse pattern of LAs localization agrees between the annotations and predictions, which is reflected by high correlation between the annotated and predicted fraction of LAs area per slide (Pearson R of 0.9756). Furthermore, the network achieves an auROC of 97.78 ± 0.93% and an IoU score of 69.72 ± 6.24 % per LA-containing slide in the test set. Our study demonstrates that a deep convolutional neural network can accurately identify lymphoid aggregates in digitized histological slides of kidney. This study presents a first automatic DL-based approach for quantifying inflammation marks in allograft kidney, which can greatly improve precision and speed of assessment of allograft kidney biopsies when implemented as a part of computer-aided diagnosis system.
Published: 2019

36. Artificial Intelligence to Grade Hip Osteoarthritis Features on Radiographs

Author: C.E. von Schacky, Felix Liu, Eugene Ozhinsky, Thomas M. Link, Jae Ho Sohn, Michael C. Nevitt, Sarah C. Foreman, Lorenzo Nardo, Valentina Pedoia, and Pia M. Jungmann
Subjects: Orthodontics, business.industry, Radiography, Hip osteoarthritis, Medicine, business
Published: 2019

37. Automatic Hip Fracture Identification and Functional Subclassification with Deep Learning

Author: Musa Zaid, Kevin A. Padrez, Eugene Ozhinsky, Kevin M. Hwang, Kevin C. McGill, Alexandra Wright, Eric G. Meinberg, Erik J. Geiger, Jae Ho Sohn, Paul Toogood, Bryan F. Darger, Rina Patel, Valentina Pedoia, Kaiyang Cheng, Justin D. Krogue, and Sharmila Majumdar
Subjects: musculoskeletal diseases, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Computer Vision and Pattern Recognition (cs.CV), cs.LG, education, Computer Science - Computer Vision and Pattern Recognition, FOS: Physical sciences, physics.med-ph, Quantitative Biology - Quantitative Methods, Machine Learning (cs.LG), Artificial Intelligence, parasitic diseases, medicine, FOS: Electrical engineering, electronic engineering, information engineering, Radiology, Nuclear Medicine and imaging, cs.CV, Quantitative Methods (q-bio.QM), Original Research, Hip fracture, Radiological and Ultrasound Technology, q-bio.QM, business.industry, Deep learning, Image and Video Processing (eess.IV), Pattern recognition, Electrical Engineering and Systems Science - Image and Video Processing, medicine.disease, Physics - Medical Physics, Identification (information), FOS: Biological sciences, eess.IV, Artificial intelligence, Medical Physics (physics.med-ph), business
Abstract: Purpose: Hip fractures are a common cause of morbidity and mortality. Automatic identification and classification of hip fractures using deep learning may improve outcomes by reducing diagnostic errors and decreasing time to operation. Methods: Hip and pelvic radiographs from 1118 studies were reviewed and 3034 hips were labeled via bounding boxes and classified as normal, displaced femoral neck fracture, nondisplaced femoral neck fracture, intertrochanteric fracture, previous ORIF, or previous arthroplasty. A deep learning-based object detection model was trained to automate the placement of the bounding boxes. A Densely Connected Convolutional Neural Network (DenseNet) was trained on a subset of the bounding box images, and its performance evaluated on a held out test set and by comparison on a 100-image subset to two groups of human observers: fellowship-trained radiologists and orthopaedists, and senior residents in emergency medicine, radiology, and orthopaedics. Results: The binary accuracy for fracture of our model was 93.8% (95% CI, 91.3-95.8%), with sensitivity of 92.7% (95% CI, 88.7-95.6%), and specificity 95.0% (95% CI, 91.5-97.3%). Multiclass classification accuracy was 90.4% (95% CI, 87.4-92.9%). When compared to human observers, our model achieved at least expert-level classification under all conditions. Additionally, when the model was used as an aid, human performance improved, with aided resident performance approximating unaided fellowship-trained expert performance. Conclusions: Our deep learning model identified and classified hip fractures with at least expert-level accuracy, and when used as an aid improved human performance, with aided resident performance approximating that of unaided fellowship-trained attendings., Comment: Presented at Orthopaedic Research Society, Austin, TX, Feb 2, 2019, currently in submission for publication
Published: 2019
Full Text: View/download PDF

38. CT-less Direct Correction of Attenuation and Scatter in the Image Space Using Deep Learning for Whole-Body FDG PET: Potential Benefits and Pitfalls

Author: Grant T. Gullberg, Jae Ho Sohn, Jaewon Yang, Spencer C. Behr, and Youngho Seo
Subjects: Radiological and Ultrasound Technology, business.industry, Computer science, Deep learning, Attenuation, Space (mathematics), Image (mathematics), Artificial Intelligence, Biomedical Imaging, Radiology, Nuclear Medicine and imaging, Computer vision, Artificial intelligence, Whole body, business, Original Research
Abstract: PURPOSE: To demonstrate the feasibility of CT-less attenuation and scatter correction (ASC) in the image space using deep learning for whole-body PET, with a focus on the potential benefits and pitfalls. MATERIALS AND METHODS: In this retrospective study, 110 whole-body fluorodeoxyglucose (FDG) PET/CT studies acquired in 107 patients (mean age ± standard deviation, 58 years ± 18; age range, 11–92 years; 72 females) from February 2016 through January 2018 were randomly collected. A total of 37.3% (41 of 110) of the studies showed metastases, with diverse FDG PET findings throughout the whole body. A U-Net–based network was developed for directly transforming noncorrected PET (PET(NC)) into attenuation- and scatter-corrected PET (PET(ASC)). Deep learning–corrected PET (PET(DL)) images were quantitatively evaluated by using the standardized uptake value (SUV) of the normalized root mean square error, the peak signal-to-noise ratio, and the structural similarity index, in addition to a joint histogram for statistical analysis. Qualitative reviews by radiologists revealed the potential benefits and pitfalls of this correction method. RESULTS: The normalized root mean square error (0.21 ± 0.05 [mean SUV ± standard deviation]), mean peak signal-to-noise ratio (36.3 ± 3.0), mean structural similarity index (0.98 ± 0.01), and voxelwise correlation (97.62%) of PET(DL) demonstrated quantitatively high similarity with PET(ASC). Radiologist reviews revealed the overall quality of PET(DL). The potential benefits of PET(DL) include a radiation dose reduction on follow-up scans and artifact removal in the regions with attenuation correction– and scatter correction–based artifacts. The pitfalls involve potential false-negative results due to blurring or missing lesions or false-positive results due to pseudo–low-uptake patterns. CONCLUSION: Deep learning–based direct ASC at whole-body PET is feasible and potentially can be used to overcome the current limitations of CT-based approaches, benefiting patients who are sensitive to radiation from CT. Supplemental material is available for this article. © RSNA, 2020
Published: 2021

39. Domain specific word embeddings for natural language processing in radiology

Author: Gunvant R. Chaudhari, Max Emerling, Youngho Seo, Yeshwant Chillakuru, Thienkhai Vu, Jae Ho Sohn, and Timothy L. Chen
Subjects: medicine.medical_specialty, Computer science, Health Informatics, Semantics, computer.software_genre, Article, Task (project management), Machine Learning, Set (abstract data type), 03 medical and health sciences, 0302 clinical medicine, McNemar's test, medicine, 030212 general & internal medicine, Natural Language Processing, 030304 developmental biology, Multi-label classification, 0303 health sciences, Artificial neural network, business.industry, Unified Medical Language System, Computer Science Applications, Radiology, Artificial intelligence, business, Hamming code, computer, Word (computer architecture), Natural language processing
Abstract: BACKGROUND: There has been increasing interest in machine learning based natural language processing (NLP) methods in radiology; however, models have often used word embeddings trained on general web corpora due to lack of a radiology-specific corpus. PURPOSE: We examined the potential of Radiopaedia to serve as a general radiology corpus to produce radiology specific word embeddings that could be used to enhance performance on a NLP task on radiological text. MATERIALS AND METHODS: Embeddings of dimension 50, 100, 200, and 300 were trained on articles collected from Radiopaedia using a GloVe algorithm and evaluated on analogy completion. A shallow neural network using input from either our trained embeddings or pre-trained Wikipedia2014+Gigaword5 (WG) embeddings was used to label the Radiopaedia articles. Labeling performance was evaluated based on exact match accuracy and Hamming loss. The McNemar’s test with continuity and the Benjamini-Hochberg correction and a 5×2 cross validation paired two-tailed t-test were used to assess statistical significance. RESULTS: For accuracy in the analogy task, 50-dimensional (50-D) Radiopaedia embeddings outperformed WG embeddings on tumor origin analogies (p < 0.05) and organ adjectives (p < 0.01) whereas WG embeddings tended to outperform on inflammation location and bone vs. muscle analogies (p < 0.01). The two embeddings had comparable performance on other subcategories. In the labeling task, the Radiopaedia-based model outperformed the WG based model at 50, 100, 200, and 300-D for exact match accuracy (p < 0.001, p < 0.001, p< 0.01, and p < 0.05, respectively) and Hamming loss (p < 0.001, p < 0.001, p < 0.01, and p < 0.05, respectively). CONCLUSION: We have developed a set of word embeddings from Radiopaedia and shown that they can preserve relevant medical semantics and augment performance on a radiology NLP task. Our results suggest that the cultivation of a radiology-specific corpus can benefit radiology NLP models in the future.
Published: 2021

40. Radiomics and Computerized Analysis of CT Images: Looking Forward

Author: Jae Ho Sohn and Brett M. Elicker
Subjects: Radiomics, Computer science, Computerized analysis, Commentary, Radiology, Nuclear Medicine and imaging, Data mining, computer.software_genre, computer, Original Research
Abstract: PURPOSE: To develop radiomics-based CT scores for assessing lung disease severity and exacerbation risk in adult patients with cystic fibrosis (CF). MATERIALS AND METHODS: This two-center retrospective observational study was approved by an institutional ethics committee, and the need for patient consent was waived. A total of 215 outpatients with CF referred for unenhanced follow-up chest CT were evaluated in two different centers between January 2013 and December 2016. After lung segmentation, chest CT scans from center 1 (training cohort, 162 patients [median age, 29 years; interquartile range {IQR}, 24–36 years; 84 men]) were used to build CT scores from 38 extracted CT features, using five different machine learning techniques trained to predict a clinical prognostic score, the Nkam score. The correlations between the developed CT scores, two different clinical prognostic scores (Liou and CF-ABLE), forced expiratory volume in 1 second (FEV(1)), and risk of respiratory exacerbations were evaluated in the test cohort (center 2, 53 patients [median age, 27 years; IQR, 22–35 years; 34 men]) using the Spearman rank coefficient. RESULTS: In the test cohort, all radiomics-based CT scores showed moderate to strong correlation with the Nkam score (R = 0.57 to 0.63, P < .001) and Liou scores (R = −0.55 to −0.65, P < .001), whereas the correlation with CF-ABLE score was weaker (R = 0.28 to 0.38, P = .005 to .048). The developed CT scores showed strong correlation with predicted FEV(1) (R = −0.62 to −0.66, P < .001) and weak to moderate correlation with the number of pulmonary exacerbations to occur in the 12 months after the CT examination (R = 0.38 to 0.55, P < .001 to P = .006). CONCLUSION: Radiomics can be used to build automated CT scores that correlate to clinical severity and exacerbation risk in adult patients with CF. Supplemental material is available for this article. See also the commentary by Elicker and Sohn in this issue. © RSNA, 2020
Published: 2020

41. Improved Visibility of Metastatic Disease in the Liver During Intra-Arterial Therapy Using Delayed Arterial Phase Cone-Beam CT

Author: Boris Gorodetski, Imramsjah M. J. van der Bom, Ruediger E. Schernthaner, MingDe Lin, Alessandro Radaelli, Florian Fleckenstein, Sonia Sahu, Jae Ho Sohn, Julius Chapiro, Reham R. Haroun, Jean Francois H. Geschwind, Rafael Duran, Yan Zhao, Howard Lee, and Susanne Smolka
Subjects: Male, Contrast Media, Cardiorespiratory Medicine and Haematology, Imaging, 030218 nuclear medicine & medical imaging, 0302 clinical medicine, Interventional oncology, Neoplasms, Cancer, Liver/hepatic, medicine.diagnostic_test, Liver Disease, Liver Neoplasms, Visibility (geometry), Ultrasound, Angiography, Neoplasms, Second Primary, Cone-Beam Computed Tomography, Middle Aged, Magnetic Resonance Imaging, Nuclear Medicine & Medical Imaging, Radio-embolization/radio-embolisation, Second Primary, Liver, Radiology Nuclear Medicine and imaging, 030220 oncology & carcinogenesis, Biomedical Imaging, Chemoembolization, Female, Radiology, Therapeutic, Transarterial chemoembolization/embolisation, Cardiology and Cardiovascular Medicine, Adult, medicine.medical_specialty, Clinical Trials and Supportive Activities, 03 medical and health sciences, Clinical Research, medicine, Intra arterial, Humans, Radiology, Nuclear Medicine and imaging, Clinical Investigation, Chemoembolization, Therapeutic, Retrospective Studies, business.industry, Angiography, Digital Subtraction, Magnetic resonance imaging, Digital subtraction angiography, Image Enhancement, equipment and supplies, medicine.disease, Transarterial chemoembolization/embolisation (TACE), Digestive Diseases, business, human activities, Digital Subtraction, Arterial phase
Abstract: PurposeTo compare the visibility of liver metastases on dual-phase cone-beam CT (DP-CBCT) and digital subtraction angiography (DSA), with reference to preinterventional contrast-enhanced magnetic resonance imaging (CE-MRI) of the liver.MethodsThis IRB-approved, retrospective study included 28 patients with neuroendocrine (NELM), colorectal (CRCLM), or sarcoma (SLM) liver metastases who underwent DP-CBCT during intra-arterial therapy (IAT) between 01/2010 and 10/2014. DP-CBCT was acquired after a single contrast agent injection in the tumor-feeding arteries at early and delayed arterial phases (EAP and DAP). The visibility of each lesion was graded by two radiologists in consensus on a three-rank scale (complete, partial, none) on DP-CBCT and DSA images using CE-MRI as reference.Results47 NELM, 43 CRCLM, and 16 SLM were included. On DSA 85.1, 44.1, and 37.5% of NELM, CRCLM, and SLM, were at least partially depicted, respectively. EAP-CBCT yielded significantly higher sensitivities of 88.3 and 87.5% for CRCLM and SLM, respectively (p&nbsp
Published: 2016

42. Multimodality Imaging of Ethiodized Oil–loaded Radiopaque Microspheres during Transarterial Embolization of Rabbits with VX2 Liver Tumors

Author: Karun Sharma, Yiqing Tang, Michael Grass, MingDe Lin, Bradford J. Wood, Nikhil Bhagat, Dirk Schäfer, Rafael Duran, David L. Woods, Jae Ho Sohn, Jean Francois H. Geschwind, Zhijun Wang, Julius Chapiro, Andrew L. Lewis, Matthew R. Dreher, Vania Tacher, and Carmen Gacchina Johnson
Subjects: Male, medicine.medical_specialty, Liver tumor, Contrast Media, Multimodal Imaging, 030218 nuclear medicine & medical imaging, Microsphere, Imaging modalities, 03 medical and health sciences, Ethiodized Oil, Liver Neoplasms, Experimental, 0302 clinical medicine, Multidetector Computed Tomography, Multidetector computed tomography, Transarterial embolization, medicine, Animals, Radiology, Nuclear Medicine and imaging, Tumor location, Original Research, Multimodal imaging, business.industry, Cone-Beam Computed Tomography, medicine.disease, Embolization, Therapeutic, Microspheres, 030220 oncology & carcinogenesis, Rabbits, Radiology, business
Abstract: Purpose To assess the visibility of radiopaque microspheres during transarterial embolization (TAE) in the VX2 rabbit liver tumor model by using multimodality imaging, including single-snapshot radiography, cone-beam computed tomography (CT), multidetector CT, and micro-CT. Materials and Methods The study was approved by the institutional animal care and use committee. Fifteen VX2-tumor-bearing rabbits were assigned to three groups depending on the type of embolic agent injected: 70-150-μm radiopaque microspheres in saline (radiopaque microsphere group), 70-150-μm radiopaque microspheres in contrast material (radiopaque microsphere plus contrast material group), and 70-150-μm radiolucent microspheres in contrast material (nonradiopaque microsphere plus contrast material group). Rabbits were imaged with single-snapshot radiography, cone-beam CT, and multidetector CT. Three to 5 weeks after sacrifice, excised livers were imaged with micro-CT and histologic analysis was performed. The visibility of the embolic agent was assessed with all modalities before and after embolization by using a qualitative three-point scale score reading study and a quantitative assessment of the signal-to-noise ratio (SNR) change in various regions of interest, including the tumor and its feeding arteries. The Kruskal-Wallis test was used to compare the rabbit characteristics across groups, and the Wilcoxon signed rank test was used to compare SNR measurements before and after embolization. Results Radiopaque microspheres were qualitatively visualized within tumor feeding arteries and targeted tissue with all imaging modalities (P.05), and their presence was confirmed with histologic examination. SNRs of radiopaque microsphere deposition increased after TAE on multidetector CT, cone-beam CT, and micro-CT images (P.05). Similar results were obtained when contrast material was added to radiopaque microspheres, except for additional image attenuation due to tumor enhancement. For the group with nonradiopaque microspheres and contrast material, retained tumoral contrast remained qualitatively visible with all modalities except for micro-CT, which demonstrated soluble contrast material washout over time. Conclusion Radiopaque microspheres were visible with all imaging modalities and helped increase conspicuity of the tumor as well as its feeding arteries after TAE in a rabbit VX2 liver tumor model. (©) RSNA, 2015.
Published: 2016

43. Comparison of Existing Response Criteria in Patients with Hepatocellular Carcinoma Treated with Transarterial Chemoembolization Using a 3D Quantitative Approach

Author: Michael Chao, Vania Tacher, Hooman Yarmohammadi, MingDe Lin, Rafael Duran, Jae Ho Sohn, Timothy M. Pawlik, Julius Chapiro, Zhijun Wang, Mitchell Gil Maltenfort, Howard Lee, Jean Francois H. Geschwind, and Constantine Frangakis
Subjects: Gadolinium DTPA, Male, Treatment outcome, Contrast Media, Medical and Health Sciences, 030218 nuclear medicine & medical imaging, Microsphere, 0302 clinical medicine, Cancer, Original Research, screening and diagnosis, Liver Disease, Liver Neoplasms, food and beverages, Middle Aged, Microspheres, Detection, Nuclear Medicine & Medical Imaging, Treatment Outcome, Response Evaluation Criteria in Solid Tumors, 030220 oncology & carcinogenesis, Hepatocellular carcinoma, Chemoembolization, Female, Radiology, Therapeutic, Liver Cancer, Diagnostic Imaging, medicine.medical_specialty, Carcinoma, Hepatocellular, 03 medical and health sciences, Rare Diseases, Carcinoma, medicine, Humans, Radiology, Nuclear Medicine and imaging, In patient, Chemoembolization, Therapeutic, Response criteria, Aged, Retrospective Studies, business.industry, fungi, Hepatocellular, Retrospective cohort study, medicine.disease, 4.1 Discovery and preclinical testing of markers and technologies, Doxorubicin, Digestive Diseases, business
Abstract: PurposeTo compare currently available non-three-dimensional methods (Response Evaluation Criteria in Solid Tumors [RECIST], European Association for Study of the Liver [EASL], modified RECIST [mRECIST[) with three-dimensional (3D) quantitative methods of the index tumor as early response markers in predicting patient survival after initial transcatheter arterial chemoembolization (TACE).Materials and methodsThis was a retrospective single-institution HIPAA-compliant and institutional review board-approved study. From November 2001 to November 2008, 491 consecutive patients underwent intraarterial therapy for liver cancer with either conventional TACE or TACE with drug-eluting beads. A diagnosis of hepatocellular carcinoma (HCC) was made in 290 of these patients. The response of the index tumor on pre- and post-TACE magnetic resonance images was assessed retrospectively in 78 treatment-naïve patients with HCC (63 male; mean age, 63 years ± 11 [standard deviation]). Each response assessment method (RECIST, mRECIST, EASL, and 3D methods of volumetric RECIST [vRECIST] and quantitative EASL [qEASL]) was used to classify patients as responders or nonresponders by following standard guidelines for the uni- and bidimensional measurements and by using the formula for a sphere for the 3D measurements. The Kaplan-Meier method with the log-rank test was performed for each method to evaluate its ability to help predict survival of responders and nonresponders. Uni- and multivariate Cox proportional hazard ratio models were used to identify covariates that had significant association with survival.ResultsThe uni- and bidimensional measurements of RECIST (hazard ratio, 0.6; 95% confidence interval [CI]: 0.3, 1.0; P = .09), mRECIST (hazard ratio, 0.6; 95% CI: 0.6, 1.0; P = .05), and EASL (hazard ratio, 1.1; 95% CI: 0.6, 2.2; P = .75) did not show a significant difference in survival between responders and nonresponders, whereas vRECIST (hazard ratio, 0.6; 95% CI: 0.3, 1.0; P = .04), qEASL (Vol) (hazard ratio, 0.5; 95% CI: 0.3, 0.9; P = .02), and qEASL (%) (hazard ratio, 0.3; 95% CI: 0.15, 0.60; P < .001) did show a significant difference between these groups.ConclusionThe 3D-based imaging biomarkers qEASL and vRECIST were tumor response criteria that could be used to predict patient survival early after initial TACE and enabled clear identification of nonresponders.
Published: 2016

44. Feasibility of a Modified Cone-Beam CT Rotation Trajectory to Improve Liver Periphery Visualization during Transarterial Chemoembolization

Author: Julius Chapiro, Paul Withagen, Jae Ho Sohn, Sonia Sahu, Rüdiger Schernthaner, Imramsjah Martin Van Der Bom, MingDe Lin, Jean Francois H. Geschwind, Alessandro Radaelli, and Rafael Duran
Subjects: Male, Cone beam computed tomography, medicine.medical_specialty, Rotation, genetic structures, Technical success, Medical and Health Sciences, Rare Diseases, Clinical Research, medicine, Humans, Radiology, Nuclear Medicine and imaging, Chemoembolization, Therapeutic, Cone beam ct, Cancer, Original Research, Aged, Retrospective Studies, medicine.diagnostic_test, Lesion detection, business.industry, Liver Disease, Liver Neoplasms, Magnetic resonance imaging, Cone-Beam Computed Tomography, Radiation Exposure, Magnetic Resonance Imaging, Visualization, Nuclear Medicine & Medical Imaging, Liver, Trajectory, Biomedical Imaging, Feasibility Studies, Chemoembolization, Female, sense organs, Radiology, Therapeutic, Digestive Diseases, business
Abstract: PurposeTo compare liver coverage and tumor detectability by using preprocedural magnetic resonance (MR) images as a reference, as well as radiation exposure of cone-beam computed tomography (CT) with different rotational trajectories.Materials and methodsFifteen patients (nine men and six women; mean age ± standard deviation, 65 years ± 5) with primary or secondary liver cancer were retrospectively included in this institutional review board-approved study. A modified cone-beam CT protocol was used in which the C-arm rotates from +55° to -185° (open arc cone-beam CT) instead of -120° to +120° (closed arc cone-beam CT). Each patient underwent two sessions of transarterial chemoembolization between February 2013 and March 2014 with closed arc and open arc cone-beam CT (during the first and second transarterial chemoembolization sessions, respectively, as part of the institutional transarterial chemoembolization protocol). For each cone-beam CT examination, liver volume and tumor detectability were assessed by using MR images as the reference. Radiation exposure was compared by means of a phantom study. For statistical analysis, paired t tests and a Wilcoxon signed rank test were performed.ResultsMean liver volume imaged was 1695 cm(3) ± 542 and 1857 cm(3) ± 571 at closed arc and open arc cone-beam CT, respectively. The coverage of open arc cone-beam CT was significantly higher compared with closed arc cone-beam CT (97% vs 86% of the MR imaging liver volume, P = .002). In eight patients (53%), tumors were partially or completely outside the closed arc cone-beam CT field of view. All tumors were within the open arc cone-beam CT field of view. The open arc cone-beam CT radiation exposure by means of weighted CT index was slightly lower compared with that of closed arc cone-beam CT (-5.1%).ConclusionOpen arc cone-beam CT allowed for a significantly improved intraprocedural depiction of peripheral hepatic tumors while achieving a slight radiation exposure reduction.
Published: 2015

45. Development and web deployment of an automated neuroradiology MRI protocoling tool with natural language processing.

Author: Chillakuru, Yeshwant Reddy, Munjal, Shourya, Laguna, Benjamin, Chen, Timothy L., Chaudhari, Gunvant R., Thienkhai Vu, Youngho Seo, Narvid, Jared, Jae Ho Sohn, Vu, Thienkhai, Seo, Youngho, and Sohn, Jae Ho
Subjects: RESEARCH, NATURAL language processing, RESEARCH methodology, RADIOGRAPHY, MAGNETIC resonance imaging, MEDICAL cooperation, EVALUATION research, COMPARATIVE studies, SYSTEM analysis, MEDICAL specialties & specialists
Abstract: Background: A systematic approach to MRI protocol assignment is essential for the efficient delivery of safe patient care. Advances in natural language processing (NLP) allow for the development of accurate automated protocol assignment. We aim to develop, evaluate, and deploy an NLP model that automates protocol assignment, given the clinician indication text.Methods: We collected 7139 spine MRI protocols (routine or contrast) and 990 head MRI protocols (routine brain, contrast brain, or other) from a single institution. Protocols were split into training (n = 4997 for spine MRI; n = 839 for head MRI), validation (n = 1071 for spine MRI, fivefold cross-validation used for head MRI), and test (n = 1071 for spine MRI; n = 151 for head MRI) sets. fastText and XGBoost were used to develop 2 NLP models to classify spine and head MRI protocols, respectively. A Flask-based web app was developed to be deployed via Heroku.Results: The spine MRI model had an accuracy of 83.38% and a receiver operator characteristic area under the curve (ROC-AUC) of 0.8873. The head MRI model had an accuracy of 85.43% with a routine brain protocol ROC-AUC of 0.9463 and contrast brain protocol ROC-AUC of 0.9284. Cancer, infectious, and inflammatory related keywords were associated with contrast administration. Structural anatomic abnormalities and stroke/altered mental status were indicative of routine spine and brain MRI, respectively. Error analysis revealed increasing the sample size may improve performance for head MRI protocols. A web version of the model is provided for demonstration and deployment.Conclusion: We developed and web-deployed two NLP models that accurately predict spine and head MRI protocol assignment, which could improve radiology workflow efficiency. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

46. Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Author: Hyo-Eun Kim, Jiashi Feng, Stephen H. Friend, Ljubomir Buturovic, Dezső Ribli, Luis Caballero, Li Shen, Fredrik Strand, Yaroslav Nikulin, Krzysztof J. Geras, Kyunghyun Cho, Elias Chaibub Neto, Rami Ben-Ari, Christoph I. Lee, Zequn Jie, Imane Nedjar, Felix Nensa, Darvin Yi, Shivanthan A.C. Yohanandan, Bruce Hoff, Justin Guinney, Jaime S. Cardoso, Russell B. McBride, Mengling Feng, Yiqiu Shen, Simona Rabinovici-Cohen, Ethan Goan, Stefan Harrer, Sven Koitka, Michael Kawczynski, Hari Trivedi, Karl Trygve Kalleberg, Christoph M. Friedrich, F. Albiol, Dimitri Perrin, Jose Costa Pereira, Umar Asif, Bibo Shi, Zbigniew Wojna, Antonio Jimeno Yepes, Peter Lindholm, Berkman Sahiner, Sijia Wang, Thea Norman, Weiva Sieh, Joyce Cahoon, Gerard Cardoso Negrie, Pavitra Krishnaswamy, Diana S. M. Buist, Alberto Albiol, Lester Mackey, Hwejin Jung, Laurie R. Margolies, Gaurav Pandey, Can Son Khoo, William Lotter, Yuanfang Guan, Thomas Yu, Andrew D. Trister, Stephen Morrell, Gustavo Stolovitzky, A. Gregory Sorensen, Clinton Fookes, Mehmet Eren Ahsen, David D. Cox, Jae Ho Sohn, Hao Du, Thomas Schaffter, Joseph H. Rothstein, Eduardo Castro, Joseph Y. Lo, Daniel L. Rubin, and Obioma Pelka
Subjects: Adult, medicine.medical_specialty, Medizin, MEDLINE, Breast Neoplasms, Diagnostic accuracy, Sensitivity and Specificity, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, Deep Learning, 0302 clinical medicine, Breast cancer, Artificial Intelligence, Image Interpretation, Computer-Assisted, Radiologists, medicine, False positive paradox, Humans, Mammography, Risk factor, Early Detection of Cancer, Aged, Sweden, medicine.diagnostic_test, Screening mammography, business.industry, Correction, General Medicine, Middle Aged, medicine.disease, United States, 3. Good health, Online Only, 030220 oncology & carcinogenesis, Female, Other, Radiology, Artificial intelligence, business, Validation cohort, Algorithms
Abstract: Importance: Mammography screening currently relies on subjective human interpretation. Artificial intelligence (AI) advances could be used to increase mammography screening accuracy by reducing missed cancers and false positives. Objective: To evaluate whether AI can overcome human mammography interpretation limitations with a rigorous, unbiased evaluation of machine learning algorithms. Design, Setting, and Participants: In this diagnostic accuracy study conducted between September 2016 and November 2017, an international, crowdsourced challenge was hosted to foster AI algorithm development focused on interpreting screening mammography. More than 1100 participants comprising 126 teams from 44 countries participated. Analysis began November 18, 2016. Main Outcomes and Measurements: Algorithms used images alone (challenge 1) or combined images, previous examinations (if available), and clinical and demographic risk factor data (challenge 2) and output a score that translated to cancer yes/no within 12 months. Algorithm accuracy for breast cancer detection was evaluated using area under the curve and algorithm specificity compared with radiologists' specificity with radiologists' sensitivity set at 85.9% (United States) and 83.9% (Sweden). An ensemble method aggregating top-performing AI algorithms and radiologists' recall assessment was developed and evaluated. Results: Overall, 144 231 screening mammograms from 85 580 US women (952 cancer positive ≤12 months from screening) were used for algorithm training and validation. A second independent validation cohort included 166 578 examinations from 68 008 Swedish women (780 cancer positive). The top-performing algorithm achieved an area under the curve of 0.858 (United States) and 0.903 (Sweden) and 66.2% (United States) and 81.2% (Sweden) specificity at the radiologists' sensitivity, lower than community-practice radiologists' specificity of 90.5% (United States) and 98.5% (Sweden). Combining top-performing algorithms and US radiologist assessments resulted in a higher area under the curve of 0.942 and achieved a significantly improved specificity (92.0%) at the same sensitivity. Conclusions and Relevance: While no single AI algorithm outperformed radiologists, an ensemble of AI algorithms combined with radiologist assessment in a single-reader screening environment improved overall accuracy. This study underscores the potential of using machine learning methods for enhancing mammography screening interpretation. CA extern
Published: 2020

47. A Deep Learning Model to Predict a Diagnosis of Alzheimer Disease by Using

Author: Miguel Hernandez Pampaloni, Dmytro Lituiev, Spencer C. Behr, Benjamin L. Franc, Lorenzo Nardo, Jae Ho Sohn, Yiming Ding, Roy Harnish, Robert R. Flavell, Shih-ying Huang, Hari Trivedi, Youngho Seo, Nathaniel W. Jenkins, Dexter Hadley, Carina Mari Aparici, Randall A. Hawkins, Michael Kawczynski, Kelly A. Zalocusky, Timothy P. Copeland, and Mariam Aboian
Subjects: Male, Aging, Neurodegenerative, Alzheimer's Disease, Medical and Health Sciences, 030218 nuclear medicine & medical imaging, Machine Learning, Computer-Assisted, 0302 clinical medicine, 80 and over, Original Research, Aged, 80 and over, screening and diagnosis, Brain, Middle Aged, Detection, Nuclear Medicine & Medical Imaging, 030220 oncology & carcinogenesis, Neurological, Biomedical Imaging, Female, Radiology, Alzheimer's disease, Algorithms, 4.2 Evaluation of markers and technologies, medicine.drug, medicine.medical_specialty, Sensitivity and Specificity, 03 medical and health sciences, Deep Learning, Neuroimaging, Clinical Research, Alzheimer Disease, Fluorodeoxyglucose F18, Image Interpretation, Computer-Assisted, Acquired Cognitive Impairment, medicine, Humans, Radiology, Nuclear Medicine and imaging, Cognitive Dysfunction, Image Interpretation, Aged, Retrospective Studies, Fluorodeoxyglucose, Receiver operating characteristic, business.industry, Neurosciences, Alzheimer's Disease including Alzheimer's Disease Related Dementias (AD/ADRD), Retrospective cohort study, medicine.disease, Confidence interval, Brain Disorders, 4.1 Discovery and preclinical testing of markers and technologies, Data set, Test set, Positron-Emission Tomography, Dementia, business
Abstract: PURPOSE: To develop and validate a deep learning algorithm that predicts the final diagnosis of Alzheimer disease (AD), mild cognitive impairment, or neither at fluorine 18 ((18)F) fluorodeoxyglucose (FDG) PET of the brain and compare its performance to that of radiologic readers. MATERIALS AND METHODS: Prospective (18)F-FDG PET brain images from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (2109 imaging studies from 2005 to 2017, 1002 patients) and retrospective independent test set (40 imaging studies from 2006 to 2016, 40 patients) were collected. Final clinical diagnosis at follow-up was recorded. Convolutional neural network of InceptionV3 architecture was trained on 90% of ADNI data set and tested on the remaining 10%, as well as the independent test set, with performance compared to radiologic readers. Model was analyzed with sensitivity, specificity, receiver operating characteristic (ROC), saliency map, and t-distributed stochastic neighbor embedding. RESULTS: The algorithm achieved area under the ROC curve of 0.98 (95% confidence interval: 0.94, 1.00) when evaluated on predicting the final clinical diagnosis of AD in the independent test set (82% specificity at 100% sensitivity), an average of 75.8 months prior to the final diagnosis, which in ROC space outperformed reader performance (57% [four of seven] sensitivity, 91% [30 of 33] specificity; P < .05). Saliency map demonstrated attention to known areas of interest but with focus on the entire brain. CONCLUSION: By using fluorine 18 fluorodeoxyglucose PET of the brain, a deep learning algorithm developed for early prediction of Alzheimer disease achieved 82% specificity at 100% sensitivity, an average of 75.8 months prior to the final diagnosis. © RSNA, 2018 Online supplemental material is available for this article. See also the editorial by Larvie in this issue.
Published: 2018

48. Development and Validation of Deep Learning-based Automatic Detection Algorithm for Malignant Pulmonary Nodules on Chest Radiographs

Author: Jae Ho Sohn, Chang Min Park, Ju Gang Nam, Eui Jin Hwang, Jong Hyuk Lee, Kun Young Lim, Kwang Nam Jin, Sangheum Hwang, Thienkai Huy Vu, Sunggyun Park, and Jin Mo Goo
Subjects: Adult, Male, Nodule detection, Lung Neoplasms, Adolescent, Radiography, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, External data, Young Adult, 0302 clinical medicine, Deep Learning, Observer performance, Medicine, Humans, Radiology, Nuclear Medicine and imaging, Aged, Retrospective Studies, Aged, 80 and over, Receiver operating characteristic, business.industry, Deep learning, External validation, Reproducibility of Results, Mean age, Middle Aged, 030220 oncology & carcinogenesis, Multiple Pulmonary Nodules, Radiographic Image Interpretation, Computer-Assisted, Female, Radiography, Thoracic, Artificial intelligence, business, Algorithm, Algorithms
Abstract: Purpose To develop and validate a deep learning-based automatic detection algorithm (DLAD) for malignant pulmonary nodules on chest radiographs and to compare its performance with physicians including thoracic radiologists. Materials and Methods For this retrospective study, DLAD was developed by using 43 292 chest radiographs (normal radiograph-to-nodule radiograph ratio, 34 067:9225) in 34 676 patients (healthy-to-nodule ratio, 30 784:3892; 19 230 men [mean age, 52.8 years; age range, 18-99 years]; 15 446 women [mean age, 52.3 years; age range, 18-98 years]) obtained between 2010 and 2015, which were labeled and partially annotated by 13 board-certified radiologists, in a convolutional neural network. Radiograph classification and nodule detection performances of DLAD were validated by using one internal and four external data sets from three South Korean hospitals and one U.S. hospital. For internal and external validation, radiograph classification and nodule detection performances of DLAD were evaluated by using the area under the receiver operating characteristic curve (AUROC) and jackknife alternative free-response receiver-operating characteristic (JAFROC) figure of merit (FOM), respectively. An observer performance test involving 18 physicians, including nine board-certified radiologists, was conducted by using one of the four external validation data sets. Performances of DLAD, physicians, and physicians assisted with DLAD were evaluated and compared. Results According to one internal and four external validation data sets, radiograph classification and nodule detection performances of DLAD were a range of 0.92-0.99 (AUROC) and 0.831-0.924 (JAFROC FOM), respectively. DLAD showed a higher AUROC and JAFROC FOM at the observer performance test than 17 of 18 and 15 of 18 physicians, respectively (P < .05), and all physicians showed improved nodule detection performances with DLAD (mean JAFROC FOM improvement, 0.043; range, 0.006-0.190; P < .05). Conclusion This deep learning-based automatic detection algorithm outperformed physicians in radiograph classification and nodule detection performance for malignant pulmonary nodules on chest radiographs, and it enhanced physicians' performances when used as a second reader. © RSNA, 2018 Online supplemental material is available for this article.
Published: 2018

49. Automated severity grading of radiographic hip osteoarthritis features with deep learning

Author: Valentina Pedoia, C.E. von Schacky, Thomas M. Link, Felix Liu, Pia M. Jungmann, Lorenzo Nardo, Sarah C. Foreman, Michael C. Nevitt, and Jae Ho Sohn
Subjects: medicine.medical_specialty, Rheumatology, business.industry, Radiography, Biomedical Engineering, Hip osteoarthritis, Physical therapy, Medicine, Orthopedics and Sports Medicine, Severity grading, business
Published: 2019

50. Automatic Determination of the Need for Intravenous Contrast in Musculoskeletal MRI Examinations Using IBM Watson's Natural Language Processing Algorithm

Author: Jae Ho Sohn, Thienkhai Vu, Benjamin Laguna, Hari Trivedi, and Joseph Mesterhazy
Subjects: Artificial intelligence, 020205 medical informatics, Computer science, Contrast Media, 02 engineering and technology, computer.software_genre, 030218 nuclear medicine & medical imaging, Computer-Assisted, 0302 clinical medicine, 0202 electrical engineering, electronic engineering, information engineering, Musculoskeletal Diseases, Musculoskeletal System, Ground truth, Radiological and Ultrasound Technology, Imaging protocol, Supercomputer, Magnetic Resonance Imaging, Computer Science Applications, Nuclear Medicine & Medical Imaging, Injections, Intravenous, Biomedical Imaging, Patient Safety, Intravenous, Algorithm, Natural language, Natural language processing, Algorithms, Workflow efficiency, Clinical Sciences, Bioengineering, IBM Watson, Machine learning, Clinical decision support system, Article, Injections, 03 medical and health sciences, Clinical Research, Image Interpretation, Computer-Assisted, Humans, Radiology, Nuclear Medicine and imaging, Quality improvement, Image Interpretation, Natural Language Processing, Retrospective Studies, business.industry, Deep learning, Reproducibility of Results, Image Enhancement, Workflow, Test set, business, computer, Classifier (UML)
Abstract: Magnetic resonance imaging (MRI) protocoling can be time- and resource-intensive, and protocols can often be suboptimal dependent upon the expertise or preferences of the protocoling radiologist. Providing a best-practice recommendation for an MRI protocol has the potential to improve efficiency and decrease the likelihood of a suboptimal or erroneous study. The goal of this study was to develop and validate a machine learning-based natural language classifier that can automatically assign the use of intravenous contrast for musculoskeletal MRI protocols based upon the free-text clinical indication of the study, thereby improving efficiency of the protocoling radiologist and potentially decreasing errors. We utilized a deep learning-based natural language classification system from IBM Watson, a question-answering supercomputer that gained fame after challenging the best human players on Jeopardy! in 2011. We compared this solution to a series of traditional machine learning-based natural language processing techniques that utilize a term-document frequency matrix. Each classifier was trained with 1240 MRI protocols plus their respective clinical indications and validated with a test set of 280. Ground truth of contrast assignment was obtained from the clinical record. For evaluation of inter-reader agreement, a blinded second reader radiologist analyzed all cases and determined contrast assignment based on only the free-text clinical indication. In the test set, Watson demonstrated overall accuracy of 83.2% when compared to the original protocol. This was similar to the overall accuracy of 80.2% achieved by an ensemble of eight traditional machine learning algorithms based on a term-document matrix. When compared to the second reader’s contrast assignment, Watson achieved 88.6% agreement. When evaluating only the subset of cases where the original protocol and second reader were concordant (n = 251), agreement climbed further to 90.0%. The classifier was relatively robust to spelling and grammatical errors, which were frequent. Implementation of this automated MR contrast determination system as a clinical decision support tool may save considerable time and effort of the radiologist while potentially decreasing error rates, and require no change in order entry or workflow. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s10278-017-0021-3) contains supplementary material, which is available to authorized users.
Published: 2017

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

80 results on '"Jae Ho Sohn"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources