Author: "Shih, George" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Shih, George"' showing total 325 results

Start Over Author "Shih, George"

325 results on '"Shih, George"'

1. Enhancing disease detection in radiology reports through fine-tuning lightweight LLM on weak labels

Author: Wei, Yishu, Wang, Xindi, Ong, Hanley, Zhou, Yiliang, Flanders, Adam, Shih, George, and Peng, Yifan
Subjects: Computer Science - Artificial Intelligence
Abstract: Despite significant progress in applying large language models (LLMs) to the medical domain, several limitations still prevent them from practical applications. Among these are the constraints on model size and the lack of cohort-specific labeled datasets. In this work, we investigated the potential of improving a lightweight LLM, such as Llama 3.1-8B, through fine-tuning with datasets using synthetic labels. Two tasks are jointly trained by combining their respective instruction datasets. When the quality of the task-specific synthetic labels is relatively high (e.g., generated by GPT4- o), Llama 3.1-8B achieves satisfactory performance on the open-ended disease detection task, with a micro F1 score of 0.91. Conversely, when the quality of the task-relevant synthetic labels is relatively low (e.g., from the MIMIC-CXR dataset), fine-tuned Llama 3.1-8B is able to surpass its noisy teacher labels (micro F1 score of 0.67 v.s. 0.63) when calibrated against curated labels, indicating the strong inherent underlying capability of the model. These findings demonstrate the potential of fine-tuning LLMs with synthetic labels, offering a promising direction for future research on LLM specialization in the medical domain.
Published: 2024

2. Agentic LLM Workflows for Generating Patient-Friendly Medical Reports

Author: Sudarshan, Malavikha, Shih, Sophie, Yee, Estella, Yang, Alina, Zou, John, Chen, Cathy, Zhou, Quan, Chen, Leon, Singhal, Chinmay, and Shih, George
Subjects: Computer Science - Multiagent Systems
Abstract: The application of Large Language Models (LLMs) in healthcare is expanding rapidly, with one potential use case being the translation of formal medical reports into patient-legible equivalents. Currently, LLM outputs often need to be edited and evaluated by a human to ensure both factual accuracy and comprehensibility, and this is true for the above use case. We aim to minimize this step by proposing an agentic workflow with the Reflexion framework, which uses iterative self-reflection to correct outputs from an LLM. This pipeline was tested and compared to zero-shot prompting on 16 randomized radiology reports. In our multi-agent approach, reports had an accuracy rate of 94.94% when looking at verification of ICD-10 codes, compared to zero-shot prompted reports, which had an accuracy rate of 68.23%. Additionally, 81.25% of the final reflected reports required no corrections for accuracy or readability, while only 25% of zero-shot prompted reports met these criteria without needing modifications. These results indicate that our approach presents a feasible method for communicating clinical findings to patients in a quick, efficient and coherent manner whilst also retaining medical accuracy. The codebase is available for viewing at http://github.com/malavikhasudarshan/Multi-Agent-Patient-Letter-Generation., Comment: 12 pages, 7 figures
Published: 2024

3. The RSNA Abdominal Traumatic Injury CT (RATIC) Dataset

Author: Rudie, Jeffrey D., Lin, Hui-Ming, Ball, Robyn L., Jalal, Sabeena, Prevedello, Luciano M., Nicolaou, Savvas, Marinelli, Brett S., Flanders, Adam E., Magudia, Kirti, Shih, George, Davis, Melissa A., Mongan, John, Chang, Peter D., Berger, Ferco H., Hermans, Sebastiaan, Law, Meng, Richards, Tyler, Grunz, Jan-Peter, Kunz, Andreas Steven, Mathur, Shobhit, Galea-Soler, Sandro, Chung, Andrew D., Afat, Saif, Kuo, Chin-Chi, Aweidah, Layal, Campos, Ana Villanueva, Somasundaram, Arjuna, Tijmes, Felipe Antonio Sanchez, Jantarangkoon, Attaporn, Bittencourt, Leonardo Kayat, Brassil, Michael, Hajjami, Ayoub El, Dogan, Hakan, Becircic, Muris, Bharatkumar, Agrahara G., Farina, Eduardo Moreno Júdice de Mattos, Group, Dataset Curator, Group, Dataset Contributor, Group, Dataset Annotator, and Colak, Errol
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The RSNA Abdominal Traumatic Injury CT (RATIC) dataset is the largest publicly available collection of adult abdominal CT studies annotated for traumatic injuries. This dataset includes 4,274 studies from 23 institutions across 14 countries. The dataset is freely available for non-commercial use via Kaggle at https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection. Created for the RSNA 2023 Abdominal Trauma Detection competition, the dataset encourages the development of advanced machine learning models for detecting abdominal injuries on CT scans. The dataset encompasses detection and classification of traumatic injuries across multiple organs, including the liver, spleen, kidneys, bowel, and mesentery. Annotations were created by expert radiologists from the American Society of Emergency Radiology (ASER) and Society of Abdominal Radiology (SAR). The dataset is annotated at multiple levels, including the presence of injuries in three solid organs with injury grading, image-level annotations for active extravasations and bowel injury, and voxelwise segmentations of each of the potentially injured organs. With the release of this dataset, we hope to facilitate research and development in machine learning and abdominal trauma that can lead to improved patient care and outcomes., Comment: 40 pages, 2 figures, 3 tables
Published: 2024

4. GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration

Author: He, Sunan, Nie, Yuxiang, Wang, Hongmei, Yang, Shu, Wang, Yihui, Cai, Zhiyuan, Chen, Zhixuan, Xu, Yingxue, Luo, Luyang, Xiang, Huiling, Lin, Xi, Wu, Mingxiang, Peng, Yifan, Shih, George, Xu, Ziyang, Wu, Xian, Wang, Qiong, Chan, Ronald Cheong Kin, Vardhanabhuti, Varut, Chu, Winnie Chiu Wing, Zheng, Yefeng, Rajpurkar, Pranav, Zhang, Kang, and Chen, Hao
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Generalist foundation models (GFMs) are renowned for their exceptional capability and flexibility in effectively generalizing across diverse tasks and modalities. In the field of medicine, while GFMs exhibit superior generalizability based on their extensive intrinsic knowledge as well as proficiency in instruction following and in-context learning, specialist models excel in precision due to their domain knowledge. In this work, for the first time, we explore the synergy between the GFM and specialist models, to enable precise medical image analysis on a broader scope. Specifically, we propose a cooperative framework, Generalist-Specialist Collaboration (GSCo), which consists of two stages, namely the construction of GFM and specialists, and collaborative inference on downstream tasks. In the construction stage, we develop MedDr, the largest open-source GFM tailored for medicine, showcasing exceptional instruction-following and in-context learning capabilities. Meanwhile, a series of lightweight specialists are crafted for downstream tasks with low computational cost. In the collaborative inference stage, we introduce two cooperative mechanisms, Mixture-of-Expert Diagnosis and Retrieval-Augmented Diagnosis, to harvest the generalist's in-context learning abilities alongside the specialists' domain expertise. For a comprehensive evaluation, we curate a large-scale benchmark featuring 28 datasets and about 250,000 images. Extensive results demonstrate that MedDr consistently outperforms state-of-the-art GFMs on downstream datasets. Furthermore, GSCo exceeds both GFMs and specialists across all out-of-domain disease diagnosis datasets. These findings indicate a significant paradigm shift in the application of GFMs, transitioning from separate models for specific tasks to a collaborative approach between GFMs and specialists, thereby advancing the frontiers of generalizable AI in medicine.
Published: 2024

5. Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs

Author: Zhou, Yiliang, Ong, Hanley, Kennedy, Patrick, Wu, Carol, Kazam, Jacob, Hentel, Keith, Flanders, Adam, Shih, George, and Peng, Yifan
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: The study examines the application of GPT-4V, a multi-modal large language model equipped with visual recognition, in detecting radiological findings from a set of 100 chest radiographs and suggests that GPT-4V is currently not ready for real-world diagnostic usage in interpreting chest radiographs.
Published: 2024

6. Improving Fairness of Automated Chest X-ray Diagnosis by Contrastive Learning

Author: Lin, Mingquan, Li, Tianhao, Sun, Zhaoyi, Holste, Gregory, Ding, Ying, Wang, Fei, Shih, George, and Peng, Yifan
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, arms.org
Abstract: Purpose: Limited studies exploring concrete methods or approaches to tackle and enhance model fairness in the radiology domain. Our proposed AI model utilizes supervised contrastive learning to minimize bias in CXR diagnosis. Materials and Methods: In this retrospective study, we evaluated our proposed method on two datasets: the Medical Imaging and Data Resource Center (MIDRC) dataset with 77,887 CXR images from 27,796 patients collected as of April 20, 2023 for COVID-19 diagnosis, and the NIH Chest X-ray (NIH-CXR) dataset with 112,120 CXR images from 30,805 patients collected between 1992 and 2015. In the NIH-CXR dataset, thoracic abnormalities include atelectasis, cardiomegaly, effusion, infiltration, mass, nodule, pneumonia, pneumothorax, consolidation, edema, emphysema, fibrosis, pleural thickening, or hernia. Our proposed method utilizes supervised contrastive learning with carefully selected positive and negative samples to generate fair image embeddings, which are fine-tuned for subsequent tasks to reduce bias in chest X-ray (CXR) diagnosis. We evaluated the methods using the marginal AUC difference ($\delta$ mAUC). Results: The proposed model showed a significant decrease in bias across all subgroups when compared to the baseline models, as evidenced by a paired T-test (p<0.0001). The $\delta$ mAUC obtained by our method were 0.0116 (95\% CI, 0.0110-0.0123), 0.2102 (95% CI, 0.2087-0.2118), and 0.1000 (95\% CI, 0.0988-0.1011) for sex, race, and age on MIDRC, and 0.0090 (95\% CI, 0.0082-0.0097) for sex and 0.0512 (95% CI, 0.0512-0.0532) for age on NIH-CXR, respectively. Conclusion: Employing supervised contrastive learning can mitigate bias in CXR diagnosis, addressing concerns of fairness and reliability in deep learning-based diagnostic methods., Comment: 23 pages, 5 figures
Published: 2024

7. Lessons Learned in Building Expertly Annotated Multi-Institution Datasets and Hosting the RSNA AI Challenges.

Author: Kitamura, Felipe, Prevedello, Luciano, Colak, Errol, Halabi, Safwan, Lungren, Matthew, Ball, Robyn, Kalpathy-Cramer, Jayashree, Kahn, Charles, Richards, Tyler, Shih, George, Lin, Hui, Andriole, Katherine, Vazirabad, Maryam, Erickson, Bradley, Flanders, Adam, Talbott, Jason, and Mongan, John
Subjects: Artificial Intelligence, Use of AI in Education, Humans, Artificial Intelligence, Radiology, Diagnostic Imaging, Societies, Medical, North America
Abstract: The Radiological Society of North America (RSNA) has held artificial intelligence competitions to tackle real-world medical imaging problems at least annually since 2017. This article examines the challenges and processes involved in organizing these competitions, with a specific emphasis on the creation and curation of high-quality datasets. The collection of diverse and representative medical imaging data involves dealing with issues of patient privacy and data security. Furthermore, ensuring quality and consistency in data, which includes expert labeling and accounting for various patient and imaging characteristics, necessitates substantial planning and resources. Overcoming these obstacles requires meticulous project management and adherence to strict timelines. The article also highlights the potential of crowdsourced annotation to progress medical imaging research. Through the RSNA competitions, an effective global engagement has been realized, resulting in innovative solutions to complex medical imaging problems, thus potentially transforming health care by enhancing diagnostic accuracy and patient outcomes. Keywords: Use of AI in Education, Artificial Intelligence © RSNA, 2024.
Published: 2024

8. Summary of the National Cancer Institute 2023 Virtual Workshop on Medical Image De-identification—Part 2: Pathology Whole Slide Image De-identification, De-facing, the Role of AI in Image De-identification, and the NCI MIDI Datasets and Pipeline

Author: Clunie, David, Taylor, Adam, Bisson, Tom, Gutman, David, Xiao, Ying, Schwarz, Christopher G., Greve, Douglas, Gichoya, Judy, Shih, George, Kline, Adrienne, Kopchick, Ben, and Farahani, Keyvan
Published: 2024
Full Text: View/download PDF

9. Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

Author: Holste, Gregory, Zhou, Yiliang, Wang, Song, Jaiswal, Ajay, Lin, Mingquan, Zhuge, Sherry, Yang, Yuzhe, Kim, Dongkyun, Nguyen-Mau, Trong-Hieu, Tran, Minh-Triet, Jeong, Jaehyup, Park, Wongi, Ryu, Jongbin, Hong, Feng, Verma, Arsh, Yamagishi, Yosuke, Kim, Changhyun, Seo, Hyeryeong, Kang, Myungjoo, Celi, Leo Anthony, Lu, Zhiyong, Summers, Ronald M., Shih, George, Wang, Zhangyang, and Peng, Yifan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification., Comment: Update after major revision
Published: 2023
Full Text: View/download PDF

10. How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?

Author: Holste, Gregory, Jiang, Ziyu, Jaiswal, Ajay, Hanna, Maria, Minkowitz, Shlomo, Legasto, Alan C., Escalon, Joanna G., Steinberger, Sharon, Bittman, Mark, Shen, Thomas C., Ding, Ying, Summers, Ronald M., Shih, George, Peng, Yifan, and Wang, Zhangyang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance. However, the nuanced ways in which pruning impacts model behavior are not well understood, particularly for long-tailed, multi-label datasets commonly found in clinical settings. This knowledge gap could have dangerous implications when deploying a pruned model for diagnosis, where unexpected model behavior could impact patient well-being. To fill this gap, we perform the first analysis of pruning's effect on neural networks trained to diagnose thorax diseases from chest X-rays (CXRs). On two large CXR datasets, we examine which diseases are most affected by pruning and characterize class "forgettability" based on disease frequency and co-occurrence behavior. Further, we identify individual CXRs where uncompressed and heavily pruned models disagree, known as pruning-identified exemplars (PIEs), and conduct a human reader study to evaluate their unifying qualities. We find that radiologists perceive PIEs as having more label noise, lower image quality, and higher diagnosis difficulty. This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification. All code, model weights, and data access instructions can be found at https://github.com/VITA-Group/PruneCXR., Comment: Early accepted to MICCAI 2023
Published: 2023
Full Text: View/download PDF

11. Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Author: Holste, Gregory, Wang, Song, Jiang, Ziyu, Shen, Thomas C., Shih, George, Summers, Ronald M., Peng, Yifan, and Wang, Zhangyang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR., Comment: DALI 2022 (MICCAI workshop)
Published: 2022
Full Text: View/download PDF

12. Radiology Text Analysis System (RadText): Architecture and Evaluation

Author: Wang, Song, Lin, Mingquan, Ding, Ying, Shih, George, Lu, Zhiyong, and Peng, Yifan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Signal Processing, Electrical Engineering and Systems Science - Systems and Control
Abstract: Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis. In this work, we present RadText, an open-source radiology text analysis system developed by Python. RadText offers an easy-to-use text analysis pipeline, including de-identification, section segmentation, sentence split and word tokenization, named entity recognition, parsing, and negation detection. RadText features a flexible modular design, provides a hybrid text processing schema, and supports raw text processing and local processing, which enables better usability and improved data privacy. RadText adopts BioC as the unified interface, and also standardizes the input / output into a structured representation compatible with Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). This allows for a more systematic approach to observational research across multiple, disparate data sources. We evaluated RadText on the MIMIC-CXR dataset, with five new disease labels we annotated for this work. RadText demonstrates highly accurate classification performances, with an average precision of, a recall of 0.94, and an F-1 score of 0.92. We have made our code, documentation, examples, and the test set available at https://github.com/bionlplab/radtext ., Comment: 9 pages, 2 figures, Accepted by 2022 IEEE 10th International Conference on Healthcare Informatics (ICHI)
Published: 2022

13. Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

Author: Holste, Gregory, Zhou, Yiliang, Wang, Song, Jaiswal, Ajay, Lin, Mingquan, Zhuge, Sherry, Yang, Yuzhe, Kim, Dongkyun, Nguyen-Mau, Trong-Hieu, Tran, Minh-Triet, Jeong, Jaehyup, Park, Wongi, Ryu, Jongbin, Hong, Feng, Verma, Arsh, Yamagishi, Yosuke, Kim, Changhyun, Seo, Hyeryeong, Kang, Myungjoo, Celi, Leo Anthony, Lu, Zhiyong, Summers, Ronald M., Shih, George, Wang, Zhangyang, and Peng, Yifan
Published: 2024
Full Text: View/download PDF

14. Best Practices and Scoring System on Reviewing A.I. based Medical Imaging Papers: Part 1 Classification

Author: Kline, Timothy L., Kitamura, Felipe, Pan, Ian, Korchi, Amine M., Tenenholtz, Neil, Moy, Linda, Gichoya, Judy Wawira, Santos, Igor, Blumer, Steven, Hwang, Misha Ysabel, Git, Kim-Ann, Shroff, Abishek, Walach, Elad, Shih, George, and Langer, Steve
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: With the recent advances in A.I. methodologies and their application to medical imaging, there has been an explosion of related research programs utilizing these techniques to produce state-of-the-art classification performance. Ultimately, these research programs culminate in submission of their work for consideration in peer reviewed journals. To date, the criteria for acceptance vs. rejection is often subjective; however, reproducible science requires reproducible review. The Machine Learning Education Sub-Committee of SIIM has identified a knowledge gap and a serious need to establish guidelines for reviewing these studies. Although there have been several recent papers with this goal, this present work is written from the machine learning practitioners standpoint. In this series, the committee will address the best practices to be followed in an A.I.-based study and present the required sections in terms of examples and discussion of what should be included to make the studies cohesive, reproducible, accurate, and self-contained. This first entry in the series focuses on the task of image classification. Elements such as dataset curation, data pre-processing steps, defining an appropriate reference standard, data partitioning, model architecture and training are discussed. The sections are presented as they would be detailed in a typical manuscript, with content describing the necessary information that should be included to make sure the study is of sufficient quality to be considered for publication. The goal of this series is to provide resources to not only help improve the review process for A.I.-based medical imaging papers, but to facilitate a standard for the information that is presented within all components of the research study. We hope to provide quantitative metrics in what otherwise may be a qualitative review process.
Published: 2022

15. Prior Knowledge Enhances Radiology Report Generation

Author: Wang, Song, Tang, Liyan, Lin, Mingquan, Shih, George, Ding, Ying, and Peng, Yifan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Radiology report generation aims to produce computer-aided diagnoses to alleviate the workload of radiologists and has drawn increasing attention recently. However, previous deep learning methods tend to neglect the mutual influences between medical findings, which can be the bottleneck that limits the quality of generated reports. In this work, we propose to mine and represent the associations among medical findings in an informative knowledge graph and incorporate this prior knowledge with radiology report generation to help improve the quality of generated reports. Experiment results demonstrate the superior performance of our proposed method on the IU X-ray dataset with a ROUGE-L of 0.384$\pm$0.007 and CIDEr of 0.340$\pm$0.011. Compared with previous works, our model achieves an average of 1.6% improvement (2.0% and 1.5% improvements in CIDEr and ROUGE-L, respectively). The experiments suggest that prior knowledge can bring performance gains to accurate radiology report generation. We will make the code publicly available at https://github.com/bionlplab/report_generation_amia2022., Comment: 10 pages, 4 figures, accepted by AMIA 2022 Informatics Summit
Published: 2022

16. Improving model fairness in image-based computer-aided diagnosis

Author: Lin, Mingquan, Li, Tianhao, Yang, Yifan, Holste, Gregory, Ding, Ying, Van Tassel, Sarah H., Kovacs, Kyle, Shih, George, Wang, Zhangyang, Lu, Zhiyong, Wang, Fei, and Peng, Yifan
Published: 2023
Full Text: View/download PDF

17. Test Retest Reproducibility of Organ Volume Measurements in ADPKD Using 3D Multimodality Deep Learning

Author: He, Xinzi, Hu, Zhongxiu, Dev, Hreedi, Romano, Dominick J., Sharbatdaran, Arman, Raza, Syed I., Wang, Sophie J., Teichman, Kurt, Shih, George, Chevalier, James M., Shimonov, Daniil, Blumenfeld, Jon D., Goel, Akshay, Sabuncu, Mert R., and Prince, Martin R.
Published: 2024
Full Text: View/download PDF

18. Using Radiomics as Prior Knowledge for Thorax Disease Classification and Localization in Chest X-rays

Author: Han, Yan, Chen, Chongyan, Tang, Liyan, Lin, Mingquan, Jaiswal, Ajay, Wang, Song, Tewfik, Ahmed, Shih, George, Ding, Ying, and Peng, Yifan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Chest X-ray becomes one of the most common medical diagnoses due to its noninvasiveness. The number of chest X-ray images has skyrocketed, but reading chest X-rays still have been manually performed by radiologists, which creates huge burnouts and delays. Traditionally, radiomics, as a subfield of radiology that can extract a large number of quantitative features from medical images, demonstrates its potential to facilitate medical imaging diagnosis before the deep learning era. In this paper, we develop an end-to-end framework, ChexRadiNet, that can utilize the radiomics features to improve the abnormality classification performance. Specifically, ChexRadiNet first applies a light-weight but efficient triplet-attention mechanism to classify the chest X-rays and highlight the abnormal regions. Then it uses the generated class activation map to extract radiomic features, which further guides our model to learn more robust image features. After a number of iterations and with the help of radiomic features, our framework can converge to more accurate image regions. We evaluate the ChexRadiNet framework using three public datasets: NIH ChestX-ray, CheXpert, and MIMIC-CXR. We find that ChexRadiNet outperforms the state-of-the-art on both disease detection (0.843 in AUC) and localization (0.679 in T(IoU) = 0.1). We will make the code publicly available at https://github.com/bionlplab/lung_disease_detection_amia2021, with the hope that this method can facilitate the development of automatic systems with a higher-level understanding of the radiological world., Comment: Accepted by AMIA 2021
Published: 2020

19. A Patient-Centric Dataset of Images and Metadata for Identifying Melanomas Using Clinical Context

Author: Rotemberg, Veronica, Kurtansky, Nicholas, Betz-Stablein, Brigid, Caffery, Liam, Chousakos, Emmanouil, Codella, Noel, Combalia, Marc, Dusza, Stephen, Guitera, Pascale, Gutman, David, Halpern, Allan, Kittler, Harald, Kose, Kivanc, Langer, Steve, Lioprys, Konstantinos, Malvehy, Josep, Musthaq, Shenara, Nanda, Jabpani, Reiter, Ofer, Shih, George, Stratigos, Alexander, Tschandl, Philipp, Weber, Jochen, and Soyer, H. Peter
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society, Physics - Medical Physics
Abstract: Prior skin image datasets have not addressed patient-level information obtained from multiple skin lesions from the same patient. Though artificial intelligence classification algorithms have achieved expert-level performance in controlled studies examining single images, in practice dermatologists base their judgment holistically from multiple lesions on the same patient. The 2020 SIIM-ISIC Melanoma Classification challenge dataset described herein was constructed to address this discrepancy between prior challenges and clinical practice, providing for each image in the dataset an identifier allowing lesions from the same patient to be mapped to one another. This patient-level contextual information is frequently used by clinicians to diagnose melanoma and is especially useful in ruling out false positives in patients with many atypical nevi. The dataset represents 2,056 patients from three continents with an average of 16 lesions per patient, consisting of 33,126 dermoscopic images and 584 histopathologically confirmed melanomas compared with benign melanoma mimickers., Comment: Figures: 3, Tables: 2, Pages: 12
Published: 2020

20. The RSNA International COVID-19 Open Radiology Database (RICORD).

Author: Tsai, Emily, Simpson, Scott, Lungren, Matthew, Hershman, Michelle, Roshkovan, Leonid, Colak, Errol, Erickson, Bradley, Shih, George, Stein, Anouk, Kalpathy-Cramer, Jayashree, Shen, Jody, Hafez, Mona, John, Susan, Rajiah, Prabhakar, Pogatchnik, Brian, Altinmakas, Emre, Ranschaert, Erik, Kitamura, Felipe, Topff, Laurens, Moy, Linda, Kanne, Jeffrey, Wu, Carol, and Mongan, John
Subjects: COVID-19, Databases, Factual, Global Health, Humans, Internationality, Lung, Radiography, Thoracic, Radiology, SARS-CoV-2, Societies, Medical, Tomography, X-Ray Computed
Abstract: The coronavirus disease 2019 (COVID-19) pandemic is a global health care emergency. Although reverse-transcription polymerase chain reaction testing is the reference standard method to identify patients with COVID-19 infection, chest radiography and CT play a vital role in the detection and management of these patients. Prediction models for COVID-19 imaging are rapidly being developed to support medical decision making. However, inadequate availability of a diverse annotated data set has limited the performance and generalizability of existing models. To address this unmet need, the RSNA and Society of Thoracic Radiology collaborated to develop the RSNA International COVID-19 Open Radiology Database (RICORD). This database is the first multi-institutional, multinational, expert-annotated COVID-19 imaging data set. It is made freely available to the machine learning community as a research and educational resource for COVID-19 chest imaging. Pixel-level volumetric segmentation with clinical annotations was performed by thoracic radiology subspecialists for all COVID-19-positive thoracic CT scans. The labeling schema was coordinated with other international consensus panels and COVID-19 data annotation efforts, the European Society of Medical Imaging Informatics, the American College of Radiology, and the American Association of Physicists in Medicine. Study-level COVID-19 classification labels for chest radiographs were annotated by three radiologists, with majority vote adjudication by board-certified radiologists. RICORD consists of 240 thoracic CT scans and 1000 chest radiographs contributed from four international sites. It is anticipated that RICORD will ideally lead to prediction models that can demonstrate sustained performance across populations and health care systems.
Published: 2021

21. Enhancing thoracic disease detection using chest X-rays from PubMed Central Open Access

Author: Lin, Mingquan, Hou, Bojian, Mishra, Swati, Yao, Tianyuan, Huo, Yuankai, Yang, Qian, Wang, Fei, Shih, George, and Peng, Yifan
Published: 2023
Full Text: View/download PDF

22. Erratum: Construction of a Machine Learning Dataset through Collaboration: The RSNA 2019 Brain CT Hemorrhage Challenge.

Author: Flanders, Adam, Prevedello, Luciano, Shih, George, Halabi, Safwan, Kalpathy-Cramer, Jayashree, Ball, Robyn, Stein, Anouk, Kitamura, Felipe, Lungren, Matthew, Choudhary, Gagandeep, Cala, Lesley, Coelho, Luiz, Mogensen, Monique, Morón, Fanny, Miller, Elka, Ikuta, Ichiro, Zohrabian, Vahe, McDonnell, Olivia, Lincoln, Christie, Shah, Lubdha, Joyner, David, Agarwal, Amit, Lee, Ryan, Nath, Jaya, and Mongan, John
Abstract: [This corrects the article DOI: 10.1148/ryai.2020190211.].
Published: 2020

23. Construction of a Machine Learning Dataset through Collaboration: The RSNA 2019 Brain CT Hemorrhage Challenge.

Author: Flanders, Adam, Prevedello, Luciano, Shih, George, Halabi, Safwan, Kalpathy-Cramer, Jayashree, Ball, Robyn, Mongan, John, Stein, Anouk, Kitamura, Felipe, Lungren, Matthew, Choudhary, Gagandeep, Cala, Lesley, Coelho, Luiz, Mogensen, Monique, Morón, Fanny, Miller, Elka, Ikuta, Ichiro, Zohrabian, Vahe, McDonnell, Olivia, Lincoln, Christie, Shah, Lubdha, Joyner, David, Agarwal, Amit, Lee, Ryan, and Nath, Jaya
Abstract: This dataset is composed of annotations of the five hemorrhage subtypes (subarachnoid, intraventricular, subdural, epidural, and intraparenchymal hemorrhage) typically encountered at brain CT.
Published: 2020

24. How Does Pruning Impact Long-Tailed Multi-label Medical Image Classifiers?

Author: Holste, Gregory, primary, Jiang, Ziyu, additional, Jaiswal, Ajay, additional, Hanna, Maria, additional, Minkowitz, Shlomo, additional, Legasto, Alan C., additional, Escalon, Joanna G., additional, Steinberger, Sharon, additional, Bittman, Mark, additional, Shen, Thomas C., additional, Ding, Ying, additional, Summers, Ronald M., additional, Shih, George, additional, Peng, Yifan, additional, and Wang, Zhangyang, additional
Published: 2023
Full Text: View/download PDF

25. Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays

Author: Moukheiber, Dana, Mahindre, Saurabh, Moukheiber, Lama, Moukheiber, Mira, Wang, Song, Ma, Chunwei, Shih, George, Peng, Yifan, Gao, Mingchen, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Nguyen, Hien V., editor, Huang, Sharon X., editor, and Xue, Yuan, editor
Published: 2022
Full Text: View/download PDF

26. The RSNA Pediatric Bone Age Machine Learning Challenge.

Author: Halabi, Safwan S, Prevedello, Luciano M, Kalpathy-Cramer, Jayashree, Mamonov, Artem B, Bilbily, Alexander, Cicero, Mark, Pan, Ian, Pereira, Lucas Araújo, Sousa, Rafael Teixeira, Abdala, Nitamar, Kitamura, Felipe Campos, Thodberg, Hans H, Chen, Leon, Shih, George, Andriole, Katherine, Kohli, Marc D, Erickson, Bradley J, and Flanders, Adam E
Subjects: Humans, Image Interpretation, Computer-Assisted, Radiography, Age Determination by Skeleton, Algorithms, Databases, Factual, Child, Female, Male, Hand Bones, Machine Learning, Pediatric, Good Health and Well Being, Medical and Health Sciences, Nuclear Medicine & Medical Imaging
Abstract: Purpose The Radiological Society of North America (RSNA) Pediatric Bone Age Machine Learning Challenge was created to show an application of machine learning (ML) and artificial intelligence (AI) in medical imaging, promote collaboration to catalyze AI model creation, and identify innovators in medical imaging. Materials and Methods The goal of this challenge was to solicit individuals and teams to create an algorithm or model using ML techniques that would accurately determine skeletal age in a curated data set of pediatric hand radiographs. The primary evaluation measure was the mean absolute distance (MAD) in months, which was calculated as the mean of the absolute values of the difference between the model estimates and those of the reference standard, bone age. Results A data set consisting of 14 236 hand radiographs (12 611 training set, 1425 validation set, 200 test set) was made available to registered challenge participants. A total of 260 individuals or teams registered on the Challenge website. A total of 105 submissions were uploaded from 48 unique users during the training, validation, and test phases. Almost all methods used deep neural network techniques based on one or more convolutional neural networks (CNNs). The best five results based on MAD were 4.2, 4.4, 4.4, 4.5, and 4.5 months, respectively. Conclusion The RSNA Pediatric Bone Age Machine Learning Challenge showed how a coordinated approach to solving a medical imaging problem can be successfully conducted. Future ML challenges will catalyze collaboration and development of ML tools and methods that can potentially improve diagnostic accuracy and patient care. © RSNA, 2018 Online supplemental material is available for this article. See also the editorial by Siegel in this issue.
Published: 2019

27. Augmenting the National Institutes of Health Chest Radiograph Dataset with Expert Annotations of Possible Pneumonia

Author: Shih, George, Wu, Carol C, Halabi, Safwan S, Kohli, Marc D, Prevedello, Luciano M, Cook, Tessa S, Sharma, Arjun, Amorosa, Judith K, Arteaga, Veronica, Galperin-Aizenberg, Maya, Gill, Ritu R, Godoy, Myrna CB, Hobbs, Stephen, Jeudy, Jean, Laroia, Archana, Shah, Palmi N, Vummidi, Dharshan, Yaddanapudi, Kavitha, and Stein, Anouk
Subjects: Biological Sciences, Biomedical and Clinical Sciences, Clinical Sciences, Lung, Pneumonia, Pneumonia & Influenza, Patient Safety, Infectious Diseases
Abstract: This dataset is intended to be used for machine learning and is composed of annotations with bounding boxes for pulmonary opacity on chest radiographs which may represent pneumonia in the appropriate clinical setting.
Published: 2019

28. Challenges Related to Artificial Intelligence Research in Medical Imaging and the Importance of Image Analysis Competitions

Author: Prevedello, Luciano M, Halabi, Safwan S, Shih, George, Wu, Carol C, Kohli, Marc D, Chokshi, Falgun H, Erickson, Bradley J, Kalpathy-Cramer, Jayashree, Andriole, Katherine P, and Flanders, Adam E
Subjects: Information and Computing Sciences, Biomedical and Clinical Sciences, Clinical Sciences, Biomedical Imaging
Abstract: In recent years, there has been enormous interest in applying artificial intelligence (AI) to radiology. Although some of this interest may have been driven by exaggerated expectations that the technology can outperform radiologists in some tasks, there is a growing body of evidence that illustrates its limitations in medical imaging. The true potential of the technique probably lies somewhere in the middle, and AI will ultimately play a key role in medical imaging in the future. The limitless power of computers makes AI an ideal candidate to provide the standardization, consistency, and dependability needed to support radiologists in their mission to provide excellent patient care. However, important roadblocks currently limit the expansion of this field in medical imaging. This article reviews some of the challenges and potential solutions to advance the field forward, with focus on the experience gained by hosting image-based competitions.
Published: 2019

29. Trustworthy assertion classification through prompting

Author: Wang, Song, Tang, Liyan, Majety, Akash, Rousseau, Justin F., Shih, George, Ding, Ying, and Peng, Yifan
Published: 2022
Full Text: View/download PDF

30. Federated Deep Learning to More Reliably Detect Body Part for Hanging Protocols, Relevant Priors, and Workflow Optimization

Author: Filice, Ross W., Stein, Anouk, Pan, Ian, and Shih, George
Published: 2022
Full Text: View/download PDF

31. Deep Learning-Based Liver Cyst Segmentation in MRI for Autosomal Dominant Polycystic Kidney Disease (ADPKD)

Author: Moghadam, Mina Chookhachizadeh, primary, Aspal, Mohit, additional, He, Xinzi, additional, Romano, Dominick J, additional, Sharbatdaran, Arman, additional, Hu, Zhongxiu, additional, Teichman, Kurt, additional, He, Hui Yi Ng, additional, Sattar, Usama, additional, Zhu, Chenglin, additional, Dev, Hreedi, additional, Shimonov, Daniil, additional, Chevalier, James M, additional, Goel, Akshay, additional, Shih, George, additional, Blumenfeld, Jon D, additional, Sabuncu, Mert R, additional, and Prince, Martin R, additional
Published: 2024
Full Text: View/download PDF

32. Erratum for: Evaluating GPT-4V (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs

Author: Zhou, Yiliang, primary, Ong, Hanley, additional, Kennedy, Patrick, additional, Wu, Carol C., additional, Kazam, Jacob, additional, Hentel, Keith, additional, Flanders, Adam, additional, Shih, George, additional, and Peng, Yifan, additional
Published: 2024
Full Text: View/download PDF

33. Evaluating GPT-V4 (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs

Author: Zhou, Yiliang, primary, Ong, Hanley, additional, Kennedy, Patrick, additional, Wu, Carol C., additional, Kazam, Jacob, additional, Hentel, Keith, additional, Flanders, Adam, additional, Shih, George, additional, Peng, Yifan, additional, and Atzen, Sarah, additional
Published: 2024
Full Text: View/download PDF

34. Medical student education in the time of COVID-19: A virtual solution to the introductory radiology elective

Author: Belfi, Lily M., Dean, Kathryn E., Bartolotta, Roger J., Shih, George, and Min, Robert J.
Published: 2021
Full Text: View/download PDF

35. Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Author: Holste, Gregory, primary, Wang, Song, additional, Jiang, Ziyu, additional, Shen, Thomas C., additional, Shih, George, additional, Summers, Ronald M., additional, Peng, Yifan, additional, and Wang, Zhangyang, additional
Published: 2022
Full Text: View/download PDF

36. DICOM in Dermoscopic Research: an Experience Report and a Way Forward

Author: Caffery, Liam, Weber, Jochen, Kurtansky, Nicholas, Clunie, David, Langer, Steve, Shih, George, Halpern, Allan, and Rotemberg, Veronica
Published: 2021
Full Text: View/download PDF

37. Optimizing GPT-4 Turbo Diagnostic Accuracy in Neuroradiology through Prompt Engineering and Confidence Thresholds.

Author: Wada, Akihiko, Akashi, Toshiaki, Shih, George, Hagiwara, Akifumi, Nishizawa, Mitsuo, Hayakawa, Yayoi, Kikuta, Junko, Shimoji, Keigo, Sano, Katsuhiro, Kamagata, Koji, Nakanishi, Atsushi, and Aoki, Shigeki
Subjects: LANGUAGE models, CLINICAL decision support systems, GENERATIVE pre-trained transformers, ARTIFICIAL intelligence, CLINICAL medicine
Abstract: Background and Objectives: Integrating large language models (LLMs) such as GPT-4 Turbo into diagnostic imaging faces a significant challenge, with current misdiagnosis rates ranging from 30–50%. This study evaluates how prompt engineering and confidence thresholds can improve diagnostic accuracy in neuroradiology. Methods: We analyze 751 neuroradiology cases from the American Journal of Neuroradiology using GPT-4 Turbo with customized prompts to improve diagnostic precision. Results: Initially, GPT-4 Turbo achieved a baseline diagnostic accuracy of 55.1%. By reformatting responses to list five diagnostic candidates and applying a 90% confidence threshold, the highest precision of the diagnosis increased to 72.9%, with the candidate list providing the correct diagnosis at 85.9%, reducing the misdiagnosis rate to 14.1%. However, this threshold reduced the number of cases that responded. Conclusions: Strategic prompt engineering and high confidence thresholds significantly reduce misdiagnoses and improve the precision of the LLM diagnostic in neuroradiology. More research is needed to optimize these approaches for broader clinical implementation, balancing accuracy and utility. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. A patient-centric dataset of images and metadata for identifying melanomas using clinical context

Author: Rotemberg, Veronica, Kurtansky, Nicholas, Betz-Stablein, Brigid, Caffery, Liam, Chousakos, Emmanouil, Codella, Noel, Combalia, Marc, Dusza, Stephen, Guitera, Pascale, Gutman, David, Halpern, Allan, Helba, Brian, Kittler, Harald, Kose, Kivanc, Langer, Steve, Lioprys, Konstantinos, Malvehy, Josep, Musthaq, Shenara, Nanda, Jabpani, Reiter, Ofer, Shih, George, Stratigos, Alexander, Tschandl, Philipp, Weber, Jochen, and Soyer, H. Peter
Published: 2021
Full Text: View/download PDF

39. Publisher Correction: Author Correction: A patient-centric dataset of images and metadata for identifying melanomas using clinical context

Author: Rotemberg, Veronica, Kurtansky, Nicholas, Betz-Stablein, Brigid, Caffery, Liam, Chousakos, Emmanouil, Codella, Noel, Combalia, Marc, Dusza, Stephen, Guitera, Pascale, Gutman, David, Halpern, Allan, Helba, Brian, Kittler, Harald, Kose, Kivanc, Langer, Steve, Lioprys, Konstantinos, Malvehy, Josep, Musthaq, Shenara, Nanda, Jabpani, Reiter, Ofer, Shih, George, Stratigos, Alexander, Tschandl, Philipp, Weber, Jochen, and Soyer, H. Peter
Published: 2021
Full Text: View/download PDF

40. Author Correction: A patient-centric dataset of images and metadata for identifying melanomas using clinical context

Author: Rotemberg, Veronica, Kurtansky, Nicholas, Betz-Stablein, Brigid, Caffery, Liam, Chousakos, Emmanouil, Codella, Noel, Combalia, Marc, Dusza, Stephen, Guitera, Pascale, Gutman, David, Halpern, Allan, Helba, Brian, Kittler, Harald, Kose, Kivanc, Langer, Steve, Lioprys, Konstantinos, Malvehy, Josep, Musthaq, Shenara, Nanda, Jabpani, Reiter, Ofer, Shih, George, Stratigos, Alexander, Tschandl, Philipp, Weber, Jochen, and Soyer, H. Peter
Published: 2021
Full Text: View/download PDF

41. Crowdsourcing pneumothorax annotations using machine learning annotations on the NIH chest X-ray dataset

Author: Filice, Ross W., Stein, Anouk, Wu, Carol C., Arteaga, Veronica A., Borstelmann, Stephen, Gaddikeri, Ramya, Galperin-Aizenberg, Maya, Gill, Ritu R., Godoy, Myrna C., Hobbs, Stephen B., Jeudy, Jean, Lakhani, Paras C., Laroia, Archana, Nayak, Sundeep M., Parekh, Maansi R., Prasanna, Prasanth, Shah, Palmi, Vummidi, Dharshan, Yaddanapudi, Kavitha, and Shih, George
Published: 2020
Full Text: View/download PDF

42. Emerging Technology Commission on AI report

Author: Smith, Andrew D. and Shih, George
Published: 2021
Full Text: View/download PDF

43. Beyond the AJR: Don't Believe Everything You Read—Nearly One-Fifth of Cleared Artificial Intelligence Devices May Have Discordant Documentation

Author: Shih, George, primary and Flanders, Adam E., additional
Published: 2023
Full Text: View/download PDF

44. Test Retest Reproducibility of Organ Volume Measurements in ADPKD Using 3D Multimodality Deep Learning

Author: He, Xinzi, primary, Hu, Zhongxiu, additional, Dev, Hreedi, additional, Romano, Dominick J., additional, Sharbatdaran, Arman, additional, Raza, Syed I., additional, Wang, Sophie J., additional, Teichman, Kurt, additional, Shih, George, additional, Chevalier, James M., additional, Shimonov, Daniil, additional, Blumenfeld, Jon D., additional, Goel, Akshay, additional, Sabuncu, Mert R., additional, and Prince, Martin R., additional
Published: 2023
Full Text: View/download PDF

45. Developing a More Responsive Radiology Resident Dashboard

Author: Chen, Hongyu, Gangaram, Vineeth, and Shih, George
Published: 2019
Full Text: View/download PDF

46. Collaborative and Reproducible Research: Goals, Challenges, and Strategies

Author: Langer, Steve G., Shih, George, Nagy, Paul, and Landman, Bennet A.
Published: 2018
Full Text: View/download PDF

47. Hello World Deep Learning in Medical Imaging

Author: Lakhani, Paras, Gray, Daniel L., Pett, Carl R., Nagy, Paul, and Shih, George
Published: 2018
Full Text: View/download PDF

48. Ultrasound strain elastography in assessment of resting biceps brachii muscle stiffness in patients with Parkinson's disease: a primary observation

Author: Gao, Jing, He, Wen, Du, Li-Juan, Li, Shuo, Cheng, Ling-Gang, Shih, George, and Rubin, Jonathan
Published: 2016
Full Text: View/download PDF

49. Effect of Averaging Measurements From Multiple MRI Pulse Sequences on Kidney Volume Reproducibility in Autosomal Dominant Polycystic Kidney Disease.

Author: Dev, Hreedi, Zhu, Chenglin, Sharbatdaran, Arman, Raza, Syed I., Wang, Sophie J., Romano, Dominick J., Goel, Akshay, Teichman, Kurt, Moghadam, Mina C., Shih, George, Blumenfeld, Jon D., Shimonov, Daniil, Chevalier, James M., and Prince, Martin R.
Subjects: POLYCYSTIC kidney disease, MAGNETIC resonance imaging, ANATOMICAL planes, KIDNEYS
Abstract: Background: Total kidney volume (TKV) is an important biomarker for assessing kidney function, especially for autosomal dominant polycystic kidney disease (ADPKD). However, TKV measurements from a single MRI pulse sequence have limited reproducibility, ± ~5%, similar to ADPKD annual kidney growth rates. Purpose: To improve TKV measurement reproducibility on MRI by extending artificial intelligence algorithms to automatically segment kidneys on T1‐weighted, T2‐weighted, and steady state free precession (SSFP) sequences in axial and coronal planes and averaging measurements. Study Type: Retrospective training, prospective testing. Subjects: Three hundred ninety‐seven patients (356 with ADPKD, 41 without), 75% for training and 25% for validation, 40 ADPKD patients for testing and 17 ADPKD patients for assessing reproducibility. Field Strength/Sequence: T2‐weighted single‐shot fast spin echo (T2), SSFP, and T1‐weighted 3D spoiled gradient echo (T1) at 1.5 and 3T. Assessment: 2D U‐net segmentation algorithm was trained on images from all sequences. Five observers independently measured each kidney volume manually on axial T2 and using model‐assisted segmentations on all sequences and image plane orientations for two MRI exams in two sessions separated by 1–3 weeks to assess reproducibility. Manual and model‐assisted segmentation times were recorded. Statistical Tests: Bland–Altman, Schapiro–Wilk (normality assessment), Pearson's chi‐squared (categorical variables); Dice similarity coefficient, interclass correlation coefficient, and concordance correlation coefficient for analyzing TKV reproducibility. P‐value < 0.05 was considered statistically significant. Results: In 17 ADPKD subjects, model‐assisted segmentations of axial T2 images were significantly faster than manual segmentations (2:49 minute vs. 11:34 minute), with no significant absolute percent difference in TKV (5.9% vs. 5.3%, P = 0.88) between scans 1 and 2. Absolute percent differences between the two scans for model‐assisted segmentations on other sequences were 5.5% (axial T1), 4.5% (axial SSFP), 4.1% (coronal SSFP), and 3.2% (coronal T2). Averaging measurements from all five model‐assisted segmentations significantly reduced absolute percent difference to 2.5%, further improving to 2.1% after excluding an outlier. Data Conclusion: Measuring TKV on multiple MRI pulse sequences in coronal and axial planes is practical with deep learning model‐assisted segmentations and can improve TKV measurement reproducibility more than 2‐fold in ADPKD. Evidence Level: 2 Technical Efficacy: Stage 1 [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

50. ChatGPT and Other Large Language Models Are Double-edged Swords

Author: Shen, Yiqiu, primary, Heacock, Laura, additional, Elias, Jonathan, additional, Hentel, Keith D., additional, Reig, Beatriu, additional, Shih, George, additional, and Moy, Linda, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

325 results on '"Shih, George"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources