Author: "Sven Koitka" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Sven Koitka"' showing total 36 results

Start Over Author "Sven Koitka"

36 results on '"Sven Koitka"'

1. ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset

Author: Johannes Rückert, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Henning Schäfer, Cynthia S. Schmidt, Sven Koitka, Obioma Pelka, Asma Ben Abacha, Alba G. Seco de Herrera, Henning Müller, Peter A. Horn, Felix Nensa, and Christoph M. Friedrich
Subjects: Science
Abstract: Abstract Automated medical image analysis systems often require large amounts of training data with high quality labels, which are difficult and time consuming to generate. This paper introduces Radiology Object in COntext version 2 (ROCOv2), a multimodal dataset consisting of radiological images and associated medical concepts and captions extracted from the PMC Open Access subset. It is an updated version of the ROCO dataset published in 2018, and adds 35,705 new images added to PMC since 2018. It further provides manually curated concepts for imaging modalities with additional anatomical and directional concepts for X-rays. The dataset consists of 79,789 images and has been used, with minor modifications, in the concept detection and caption prediction tasks of ImageCLEFmedical Caption 2023. The dataset is suitable for training image annotation models based on image-caption pairs, or for multi-label image classification using Unified Medical Language System (UMLS) concepts provided with each image. In addition, it can serve for pre-training of medical domain models, and evaluation of deep learning models for multi-task learning.
Published: 2024
Full Text: View/download PDF

2. SAROS: A dataset for whole-body region and organ segmentation in CT imaging

Author: Sven Koitka, Giulia Baldini, Lennard Kroll, Natalie van Landeghem, Olivia B. Pollok, Johannes Haubold, Obioma Pelka, Moon Kim, Jens Kleesiek, Felix Nensa, and René Hosch
Subjects: Science
Abstract: Abstract The Sparsely Annotated Region and Organ Segmentation (SAROS) dataset was created using data from The Cancer Imaging Archive (TCIA) to provide a large open-access CT dataset with high-quality annotations of body landmarks. In-house segmentation models were employed to generate annotation proposals on randomly selected cases from TCIA. The dataset includes 13 semantic body region labels (abdominal/thoracic cavity, bones, brain, breast implant, mediastinum, muscle, parotid/submandibular/thyroid glands, pericardium, spinal cord, subcutaneous tissue) and six body part labels (left/right arm/leg, head, torso). Case selection was based on the DICOM series description, gender, and imaging protocol, resulting in 882 patients (438 female) for a total of 900 CTs. Manual review and correction of proposals were conducted in a continuous quality control cycle. Only every fifth axial slice was annotated, yielding 20150 annotated slices from 28 data collections. For the reproducibility on downstream tasks, five cross-validation folds and a test set were pre-defined. The SAROS dataset serves as an open-access resource for training and evaluating novel segmentation models, covering various scanner vendors and diseases.
Published: 2024
Full Text: View/download PDF

3. Elexacaftor/tezacaftor/ivacaftor influences body composition in adults with cystic fibrosis: a fully automated CT-based analysis

Author: Dirk Westhölter, Johannes Haubold, Matthias Welsner, Luca Salhöfer, Johannes Wienker, Sivagurunathan Sutharsan, Svenja Straßburg, Christian Taube, Lale Umutlu, Benedikt M. Schaarschmidt, Sven Koitka, Sebastian Zensen, Michael Forsting, Felix Nensa, René Hosch, and Marcel Opitz
Subjects: Medicine, Science
Abstract: Abstract A poor nutritional status is associated with worse pulmonary function and survival in people with cystic fibrosis (pwCF). CF transmembrane conductance regulator modulators can improve pulmonary function and body weight, but more data is needed to evaluate its effects on body composition. In this retrospective study, a pre-trained deep-learning network was used to perform a fully automated body composition analysis on chest CTs from 66 adult pwCF before and after receiving elexacaftor/tezacaftor/ivacaftor (ETI) therapy. Muscle and adipose tissues were quantified and divided by bone volume to obtain body size-adjusted ratios. After receiving ETI therapy, marked increases were observed in all adipose tissue ratios among pwCF, including the total adipose tissue ratio (+ 46.21%, p
Published: 2024
Full Text: View/download PDF

4. AI-based digital histopathology for perihilar cholangiocarcinoma: A step, not a jump

Author: Dieter P. Hoyer, Saskia Ting, Nina Rogacka, Sven Koitka, René Hosch, Nils Flaschel, Johannes Haubold, Eugen Malamutmann, Björn-Ole Stüben, Jürgen Treckmann, Felix Nensa, and Giulia Baldini
Subjects: Klatskin, Cholangiocarcinoma, Survival analysis, Segmentation, Digital pathology, Artificial intelligence, Computer applications to medicine. Medical informatics, R858-859.7, Pathology, RB1-214
Abstract: Introduction: Perihilar cholangiocarcinoma (PHCC) is a rare malignancy with limited survival prediction accuracy. Artificial intelligence (AI) and digital pathology advancements have shown promise in predicting outcomes in cancer. We aimed to improve prognosis prediction for PHCC by combining AI-based histopathological slide analysis with clinical factors. Methods: We retrospectively analyzed 317 surgically treated PHCC patients (January 2009–December 2018) at the University Hospital of Essen. Clinical data, surgical details, pathology, and outcomes were collected. Convolutional neural networks (CNN) analyzed whole-slide images. Survival models incorporated clinical and histological features. Results: Among 142 eligible patients, independent survival predictors were tumor grade (G), tumor size (T), and intraoperative transfusion requirement. The CNN-based model combining clinical and histopathological features demonstrates proof of concept in prognosis prediction, limited by histopathological complexity and feature extraction challenges. However, the CNN-based model generated heatmaps assisting pathologists in identifying areas of interest. Conclusion: AI-based digital pathology showed potential in PHCC prognosis prediction, though refinement is necessary for clinical relevance. Future research should focus on enhancing AI models and exploring novel approaches to improve PHCC patient prognosis prediction.
Published: 2024
Full Text: View/download PDF

5. FHIR-PYrate: a data science friendly Python package to query FHIR servers

Author: René Hosch, Giulia Baldini, Vicky Parmar, Katarzyna Borys, Sven Koitka, Merlin Engelke, Kamyar Arzideh, Moritz Ulrich, and Felix Nensa
Subjects: Electronic patient record, FHIR, Python, Dataframe, Information extraction, Dicom, Public aspects of medicine, RA1-1270
Abstract: Abstract Background We present FHIR-PYrate, a Python package to handle the full clinical data collection and extraction process. The software is to be plugged into a modern hospital domain, where electronic patient records are used to handle the entire patient’s history. Most research institutes follow the same procedures to build study cohorts, but mainly in a non-standardized and repetitive way. As a result, researchers spend time writing boilerplate code, which could be used for more challenging tasks. Methods The package can improve and simplify existing processes in the clinical research environment. It collects all needed functionalities into a straightforward interface that can be used to query a FHIR server, download imaging studies and filter clinical documents. The full capacity of the search mechanism of the FHIR REST API is available to the user, leading to a uniform querying process for all resources, thus simplifying the customization of each use case. Additionally, valuable features like parallelization and filtering are included to make it more performant. Results As an exemplary practical application, the package can be used to analyze the prognostic significance of routine CT imaging and clinical data in breast cancer with tumor metastases in the lungs. In this example, the initial patient cohort is first collected using ICD-10 codes. For these patients, the survival information is also gathered. Some additional clinical data is retrieved, and CT scans of the thorax are downloaded. Finally, the survival analysis can be computed using a deep learning model with the CT scans, the TNM staging and positivity of relevant markers as input. This process may vary depending on the FHIR server and available clinical data, and can be customized to cover even more use cases. Conclusions FHIR-PYrate opens up the possibility to quickly and easily retrieve FHIR data, download image data, and search medical documents for keywords within a Python package. With the demonstrated functionality, FHIR-PYrate opens an easy way to assemble research collectives automatically.
Published: 2023
Full Text: View/download PDF

6. Fully automated preoperative liver volumetry incorporating the anatomical location of the central hepatic vein

Author: Sven Koitka, Phillip Gudlin, Jens M. Theysohn, Arzu Oezcelik, Dieter P. Hoyer, Murat Dayangac, René Hosch, Johannes Haubold, Nils Flaschel, Felix Nensa, and Eugen Malamutmann
Subjects: Medicine, Science
Abstract: Abstract The precise preoperative calculation of functional liver volumes is essential prior major liver resections, as well as for the evaluation of a suitable donor for living donor liver transplantation. The aim of this study was to develop a fully automated, reproducible, and quantitative 3D volumetry of the liver from standard CT examinations of the abdomen as part of routine clinical imaging. Therefore, an in-house dataset of 100 venous phase CT examinations for training and 30 venous phase ex-house CT examinations with a slice thickness of 5 mm for testing and validating were fully annotated with right and left liver lobe. Multi-Resolution U-Net 3D neural networks were employed for segmenting these liver regions. The Sørensen-Dice coefficient was greater than 0.9726 ± 0.0058, 0.9639 ± 0.0088, and 0.9223 ± 0.0187 and a mean volume difference of 32.12 ± 19.40 ml, 22.68 ± 21.67 ml, and 9.44 ± 27.08 ml compared to the standard of reference (SoR) liver, right lobe, and left lobe annotation was achieved. Our results show that fully automated 3D volumetry of the liver on routine CT imaging can provide reproducible, quantitative, fast and accurate results without needing any examiner in the preoperative work-up for hepatobiliary surgery and especially for living donor liver transplantation.
Published: 2022
Full Text: View/download PDF

7. Biomarkers extracted by fully automated body composition analysis from chest CT correlate with SARS-CoV-2 outcome severity

Author: René Hosch, Simone Kattner, Marc Moritz Berger, Thorsten Brenner, Johannes Haubold, Jens Kleesiek, Sven Koitka, Lennard Kroll, Anisa Kureishi, Nils Flaschel, and Felix Nensa
Subjects: Medicine, Science
Abstract: Abstract The complex process of manual biomarker extraction from body composition analysis (BCA) has far restricted the analysis of SARS-CoV-2 outcomes to small patient cohorts and a limited number of tissue types. We investigate the association of two BCA-based biomarkers with the development of severe SARS-CoV-2 infections for 918 patients (354 female, 564 male) regarding disease severity and mortality (186 deceased). Multiple tissues, such as muscle, bone, or adipose tissue are used and acquired with a deep-learning-based, fully-automated BCA from computed tomography images of the chest. The BCA features and markers were univariately analyzed with a Shapiro–Wilk and two-sided Mann–Whitney-U test. In a multivariate approach, obtained markers were adjusted by a defined set of laboratory parameters promoted by other studies. Subsequently, the relationship between the markers and two endpoints, namely severity and mortality, was investigated with regard to statistical significance. The univariate approach showed that the muscle volume was significant for female (p severity ≤ 0.001, p mortality ≤ 0.0001) and male patients (p severity = 0.018, p mortality ≤ 0.0001) regarding the severity and mortality endpoints. For male patients, the intra- and intermuscular adipose tissue (IMAT) (p ≤ 0.0001), epicardial adipose tissue (EAT) (p ≤ 0.001) and pericardial adipose tissue (PAT) (p ≤ 0.0001) were significant regarding the severity outcome. With the mortality outcome, muscle (p ≤ 0.0001), IMAT (p ≤ 0.001), EAT (p = 0.011) and PAT (p = 0.003) remained significant. For female patients, bone (p ≤ 0.001), IMAT (p = 0.032) and PAT (p = 0.047) were significant in univariate analyses regarding the severity and bone (p = 0.005) regarding the mortality. Furthermore, the defined sarcopenia marker (p ≤ 0.0001, for female and male) was significant for both endpoints. The cardiac marker was significant for severity (pfemale = 0.014, pmale ≤ 0.0001) and for mortality (pfemale ≤ 0.0001, pmale ≤ 0.0001) endpoint for both genders. The multivariate logistic regression showed that the sarcopenia marker was significant (p severity = 0.006, p mortality = 0.002) for both endpoints (ORseverity = 0.42, 95% CIseverity: 0.23–0.78, ORmortality = 0.34, 95% CImortality: 0.17–0.67). The cardiac marker showed significance (p = 0.018) only for the severity endpoint (OR = 1.42, 95% CI 1.06–1.90). The association between BCA-based sarcopenia and cardiac biomarkers and disease severity and mortality suggests that these biomarkers can contribute to the risk stratification of SARS-CoV-2 patients. Patients with a higher cardiac marker and a lower sarcopenia marker are at risk for a severe course or death. Whether those biomarkers hold similar importance for other pneumonia-related diseases requires further investigation.
Published: 2022
Full Text: View/download PDF

8. CT-derived body composition analysis could possibly replace DXA and BIA to monitor NET-patients

Author: Lennard Kroll, Annie Mathew, Giulia Baldini, René Hosch, Sven Koitka, Jens Kleesiek, Christoph Rischpler, Johannes Haubold, Dagmar Fuhrer, Felix Nensa, and Harald Lahner
Subjects: Medicine, Science
Abstract: Abstract Patients with neuroendocrine tumors of gastro-entero-pancreatic origin (GEP-NET) experience changes in fat and muscle composition. Dual-energy X-ray absorptiometry (DXA) and bioelectrical impedance analysis (BIA) are currently used to analyze body composition. Changes thereof could indicate cancer progression or response to treatment. This study examines the correlation between CT-based (computed tomography) body composition analysis (BCA) and DXA or BIA measurement. 74 GEP-NET-patients received whole-body [68Ga]-DOTATOC-PET/CT, BIA, and DXA-scans. BCA was performed based on the non-contrast-enhanced, 5 mm, whole-body-CT images. BCA from CT shows a strong correlation between body fat ratio with DXA (r = 0.95, ρC = 0.83) and BIA (r = 0.92, ρC = 0.76) and between skeletal muscle ratio with BIA: r = 0.81, ρC = 0.49. The deep learning-network achieves highly accurate results (mean Sørensen-Dice-score 0.93). Using BCA on routine Positron emission tomography/CT-scans to monitor patients’ body composition in the diagnostic workflow can reduce additional exams whilst substantially amplifying measurement in slower progressing cancers such as GEP-NET.
Published: 2022
Full Text: View/download PDF

9. Ossification area localization in pediatric hand radiographs using deep neural networks for object detection.

Author: Sven Koitka, Aydin Demircioglu, Moon S Kim, Christoph M Friedrich, and Felix Nensa
Subjects: Medicine, Science
Abstract: BACKGROUND:Detection of ossification areas of hand bones in X-ray images is an important task, e.g. as a preprocessing step in automated bone age estimation. Deep neural networks have emerged recently as de facto standard detection methods, but their drawback is the need of large annotated datasets. Finetuning pre-trained networks is a viable alternative, but it is not clear a priori if training with small annotated datasets will be successful, as it depends on the problem at hand. In this paper, we show that pre-trained networks can be utilized to produce an effective detector of ossification areas in pediatric X-ray images of hands. METHODS AND FINDINGS:A publicly available Faster R-CNN network, pre-trained on the COCO dataset, was utilized and finetuned with 240 manually annotated radiographs from the RSNA Pediatric Bone Age Challenge, which comprises over 14.000 pediatric radiographs. The validation is done on another 89 radiographs from the dataset and the performance is measured by Intersection-over-Union (IoU). To understand the effect of the data size on the pre-trained network, subsampling was applied to the training data and the training was repeated. Additionally, the network was trained from scratch without any pre-trained weights. Finally, to understand whether the trained model could be useful, we compared the inference of the network to an annotation of an expert radiologist. The finetuned network was able to achieve an average precision (mAP@0.5IoU) of 92.92 ± 1.93. Apart from the wrist region, all ossification areas were able to benefit from more data. In contrast, the network trained from scratch was not able to produce any correct results. When compared to the annotations of the expert radiologist, the network was able to localize the regions quite well, as the F1-Score was on average 91.85 ± 1.06. CONCLUSIONS:By finetuning a pre-trained deep neural network, with 240 annotated radiographs, we were able to successfully detect ossification areas in prediatric hand radiographs.
Published: 2018
Full Text: View/download PDF

10. Unconditionally Generated and Pseudo-Labeled Synthetic Images for Diabetic Foot Ulcer Segmentation Dataset Extension.

Author: Raphael Brüngel, Sven Koitka, and Christoph M. Friedrich
Published: 2022
Full Text: View/download PDF

11. Radiology Objects in COntext (ROCO): A Multimodal Image Dataset.

Author: Obioma Pelka, Sven Koitka, Johannes Rückert, Felix Nensa, and Christoph M. Friedrich
Published: 2018
Full Text: View/download PDF

12. Predicting Individual Patient Platelet Demand in a Large Tertiary Care Hospital Using Machine Learning

Author: Merlin Engelke, Christian Martin Brieske, Vicky Parmar, Nils Flaschel, Anisa Kureishi, Rene Hosch, Sven Koitka, Cynthia Sabrina Schmidt, Peter A. Horn, and Felix Nensa
Subjects: Medizin, Immunology and Allergy, Hematology
Abstract: Introduction: An increasing shortage of donor blood is expected, considering the demographic change in Germany. Due to the short shelf life and varying daily fluctuations in consumption, the storage of platelet concentrates (PCs) becomes challenging. This emphasizes the need for reliable prediction of needed PCs for the blood bank inventories. Therefore, the objective of this study was to evaluate multimodal data from multiple source systems within a hospital to predict the number of platelet transfusions in 3 days on a per-patient level. Methods: Data were collected from 25,190 (42% female and 58% male) patients between 2017 and 2021. For each patient, the number of received PCs, platelet count blood tests, drugs causing thrombocytopenia, acute platelet diseases, procedures, age, gender, and the period of a patient’s hospital stay were collected. Two models were trained on samples using a sliding window of 7 days as input and a day 3 target. The model predicts whether a patient will be transfused 3 days in the future. The model was trained with an excessive hyperparameter search using patient-level repeated 5-fold cross-validation to optimize the average macro F2-score. Results: The trained models were tested on 5,022 unique patients. The best-performing model has a specificity of 0.99, a sensitivity of 0.37, an area under the precision-recall curve score of 0.45, an MCC score of 0.43, and an F1-score of 0.43. However, the model does not generalize well for cases when the need for a platelet transfusion is recognized. Conclusion: A patient AI-based platelet forecast could improve logistics management and reduce blood product waste. In this study, we build the first model to predict patient individual platelet demand. To the best of our knowledge, we are the first to introduce this approach. Our model predicts the need for platelet units for 3 days in the future. While sensitivity underperforms, specificity performs reliably. The model may be of clinical use as a pretest for potential patients needing a platelet transfusion within the next 3 days. As sensitivity needs to be improved, further studies should introduce deep learning and wider patient characterization to the methodological multimodal, multisource data approach. Furthermore, a hospital-wide consumption of PCs could be derived from individual predictions.
Published: 2023

13. Early Detection of Depression Based on Linguistic Metadata Augmented Classifiers Revisited - Best of the eRisk Lab Submission.

Author: Marcel Trotzek, Sven Koitka, and Christoph M. Friedrich
Published: 2017
Full Text: View/download PDF

14. Optimized Convolutional Neural Network Ensembles for Medical Subfigure Classification.

Author: Sven Koitka and Christoph M. Friedrich
Published: 2017
Full Text: View/download PDF

15. Traditional Feature Engineering and Deep Learning Approaches at Medical Classification Task of ImageCLEF 2016.

Author: Sven Koitka and Christoph M. Friedrich
Published: 2016

16. Word Embeddings and Linguistic Metadata at the CLEF 2018 Tasks for Early Detection of Depression and Anorexia.

Author: Marcel Trotzek, Sven Koitka, and Christoph M. Friedrich
Published: 2018

17. Adipopenia in body composition analysis: a promising imaging biomarker and potential predictive factor for patients undergoing transjugular intrahepatic portosystemic shunt placement

Author: Georgios Luca Alatzides, Johannes Haubold, Hannah Luisa Steinberg, Sven Koitka, Vicky Parmar, Johannes Grueneisen, Amos Cornelius Zeller, Hartmut Schmidt, Jens Matthias Theysohn, Yan Li, Felix Nensa, and Benedikt Michael Schaarschmidt
Subjects: Medizin, Radiology, Nuclear Medicine and imaging, General Medicine
Abstract: Objective: Body tissue composition plays a crucial role in the multisystemic processes of advanced liver disease and has been shown to be influenced by transjugular intrahepatic portosystemic shunt (TIPS). A differentiated analysis of the various tissue compartments has not been performed until now. The purpose of this study was to evaluate the value of imaging biomarkers derived from automated body composition analysis (BCA) to predict clinical and functional outcome. Methods: A retrospective analysis of 56 patients undergoing TIPS procedure between 2013 and 2021 was performed. BCA on the base of pre-interventional CT examination was used to determine quantitative data as well as ratios of bone, muscle and fat masses. Furthermore, a BCA-derived sarcopenia marker was investigated. Regarding potential correlations between BCA imaging biomarkers and the occurrence of hepatic encephalopathy (HE) as well as 1-year survival, an exploratory analysis was conducted. Results: No BCA imaging biomarker was associated with the occurrence of HE after TIPS placement. However, there were significant differences in alive and deceased patients regarding the BCA-derived sarcopenia marker (alive: 1.60, deceased: 1.83, p = 0.046), ratios of intra- and intermuscular fat/skeletal volume (alive: 0.53, deceased: 0.31, p = 0.015) and intra- and intermuscular fat/muscle volume (alive: 0.21, deceased: 0.14, p = 0.031). Conclusion: A lower amount of intra- and intermuscular adipose tissue might have protective effects regarding liver derived complications and survival. Advances in knowledge: Precise characterization of body tissue components with automated BCA might provide prognostic information in patients with advanced liver disease undergoing TIPS procedure.
Published: 2023

18. Recognizing Bird Species in Audio Files Using Transfer Learning.

Author: Andreas Fritzler, Sven Koitka, and Christoph M. Friedrich
Published: 2017

19. Linguistic Metadata Augmented Classifiers at the CLEF 2017 Task for Early Detection of Depression.

Author: Marcel Trotzek, Sven Koitka, and Christoph M. Friedrich
Published: 2017

20. Improving Model Performance for Plant Image Classification With Filtered Noisy Images.

Author: Andreas R. Ludwig, Helga Piorek, Andreas H. Kelch, David Rex, Sven Koitka, and Christoph M. Friedrich
Published: 2017

21. Unconditionally Generated and Pseudo-Labeled Synthetic Images for Diabetic Foot Ulcer Segmentation Dataset Extension

Author: Raphael Brüngel, Sven Koitka, and Christoph M. Friedrich
Subjects: Medizin
Published: 2023

22. Preoperative computed tomography volumetry and graft weight estimation of left lateral segment in pediatric living donor liver transplantation

Author: Martynas Tamulevicius, Sven Koitka, Jens M Theysohn, Arzu Oezcelik, Dieter P Hoyer, Farhad Farzaliyev, Johannes Haubold, Nils Flaschel, Felix Nensa, Juergen Treckmann, and Eugen Malamutmann
Abstract: Introduction: Liver volumetry based on a CT-Scan is widely used for estimating liver volume before any liver resection, especially for pediatric living donor liver transplantation (LDLT). The "One-to-one" conversion rule for liver volume to weight has been widely adopted worldwide for many years. However, most recent analyses discuss this approach. Methods: The study retrospectively included consecutive donors undergoing left lateral hepatectomy for pediatric LDLT between December 2008 and September 2020. All donors were healthy adults who met the evaluation criteria for pediatric LDLT and underwent a preoperative contrast-enhanced CT scan. Manual segmentation of the left lateral liver lobe for volume (GV) estimation and intraoperative measurement of an actual graft weight (AGW) was performed. A relationship between estimated GV and AGW was analyzed. Results: 94 living liver donors were included in the study. The mean AGW was approximately 283.4 ± 68.5 grams, and the mean GV was 244.9 ± 63.86 ml. There was a strong correlation between the GV and AGW (r = 0.804, p < 0.001). The Bland-Altman analysis revealed an interreader agreement of 38.0 ± 97.25 and an intraclass correlation coefficient showed an almost excellent agreement (0.840, p < 0.001). The conversion formula for calculating graft weight (GW) based on computed tomography volumetry was determined based on regression analysis: 0.88 x GV + 41.63. Conclusion: Estimating left liver GW using "one-to-one" rule is subject to measurable variability in calculated graft weights and tends to underestimate it. Instead, a conversion formula should be used to determine donor GW more accurately.
Published: 2022

23. Differentiation Between Anteroposterior and Posteroanterior Chest X-Ray View Position With Convolutional Neural Networks

Author: Felix Nensa, René Hosch, Sven Koitka, and Lennard Kroll
Subjects: Male, Computer science, Medizin, Convolutional neural network, Patient Positioning, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, DICOM, Deep Learning, 0302 clinical medicine, Radiologists, medicine, Humans, Radiology, Nuclear Medicine and imaging, Retrospective Studies, Network architecture, medicine.diagnostic_test, business.industry, Deep learning, Pattern recognition, Thorax, Visualization, Radiography, 030228 respiratory system, Area Under Curve, Data quality, Female, Neural Networks, Computer, Artificial intelligence, Error detection and correction, business, Chest radiograph, Algorithms
Abstract: Detection and validation of the chest X-ray view position with use of convolutional neural networks to improve meta-information for data cleaning within a hospital data infrastructure. Within this paper we developed a convolutional neural network which automatically detects the anteroposterior and posteroanterior view position of a chest radiograph. We trained two different network architectures (VGG variant and ResNet-34) with data published by the RSNA (26 684 radiographs, class distribution 46 % AP, 54 % PA) and validated these on a self-compiled dataset with data from the University Hospital Essen (4507, radiographs, class distribution 55 % PA, 45 % AP) labeled by a human reader. For visualization and better understanding of the network predictions, a Grad-CAM was generated for each network decision. The network results were evaluated based on the accuracy, the area under the curve (AUC), and the F1-score against the human reader labels. Also a final performance comparison between model predictions and DICOM labels was performed. The ensemble models reached accuracy and F1-scores greater than 95 %. The AUC reaches more than 0.99 for the ensemble models. The Grad-CAMs provide insight as to which anatomical structures contributed to a decision by the networks which are comparable with the ones a radiologist would use. Furthermore, the trained models were able to generalize over mislabeled examples, which was found by comparing the human reader labels to the predicted labels as well as the DICOM labels. The results show that certain incorrectly entered meta-information of radiological images can be effectively corrected by deep learning in order to increase data quality in clinical application as well as in research. · The predictions for both view positions are accurate with respect to external validation data.. · The networks based their decisions on anatomical structures and key points that were in-line with prior knowledge and human understanding.. · Final models were able to detect labeling errors within the test dataset..· Hosch R, Kroll L, Nensa F et al. Differentiation Between Anteroposterior and Posteroanterior Chest X-Ray View Position With Convolutional Neural Networks. Fortschr Röntgenstr 2021; 193: 168 - 176.ZIEL: Detektion der Röntgen-Thorax-Aufnahmeposition anhand von Convolutional Neural Networks zur Verbesserung und Bereinigung von Metainformationen innerhalb der Dateninfrastruktur eines Krankenhauses. Innerhalb dieser Studie wurde ein Convolutional Neural Network entwickelt, das automatisch die verwendete Anterior-posterior- bzw. Posterior-anterior-Aufnahmeprojektion einer Röntgen-Thoraxaufnahme erkennt. Es wurden 2 unterschiedliche Netzwerkarchitekturen (VGG Variante und ResNet-34) auf Basis von Daten der RSNA (26 684 Röntgenaufnahmen, Klassenverteilung: 46 % AP, 54 % PA) trainiert und anschließend auf einem zusammengestellten hauseigenen Datensatz (Verwendung von manuellen Labeln) aus dem Datenbestand des Universitätsklinikums Essen (4507 Röntgenaufnahmen, Klassenverteilung: 55 % PA, 45 % AP) getestet. Für eine bessere Nachvollziehbarkeit der getätigten Vorhersagen der Modelle wurde zudem für jede Vorhersage eine Grad-CAM generiert. Die Resultate der Modelle wurden anhand der Accuracy, der Area under the Curve (AUC) und dem F1-Score berechnet auf Basis des Abgleichs der manuellen Label. Abschließend wurde zudem die Genauigkeit der Modellvorhersagen und der DICOM-Label anhand des Vergleichs mit den manuellen Labeln berechnet. Die zusammengefassten Modelle erreichten Accuracy- und F1-Score-Werte von mehr als 95 %. Alle Modelle erreichten eine AUC von über 0,99. Die generierten Grad-CAMSs zeigen, dass die Modelle relevante anatomische Referenzpunkte für ihre Vorhersage nutzen, die auch ein Radiologe für eine Unterscheidung heranziehen würde. Zudem zeigen die antrainierten Modelle die Fähigkeit zur Generalisierung, da diese auch falsch gekennzeichnete Röntgenbilder richtig einordnen können, was durch den Vergleich der manuellen Label mit den jeweiligen Modellvorhersagen und den DICOM-Labeln ersichtlich wurde. Die Resultate zeigen, dass falsch eingetragene Metainformationen innerhalb der radiologischen Bildgebung effektiv durch den Einsatz von Deep Learning korrigiert und somit die Datenqualität sowohl für die klinische Anwendung als auch für die Forschung erhöht werden können. · Die trainierten Modelle erzielen akkurate Vorhersagen auf externen Validierungsdaten.. · Die Netzwerke treffen ihre Vorhersagen basierend auf anatomischen Strukturen und Referenzpunkten, die mit dem menschlichen Fachwissen übereinstimmen.. · Die finalen Modelle konnten Label-Fehler in dem Testdatensatz finden..
Published: 2020

24. Artificial intelligence guided enhancement of digital PET : Scans as fast as CT?

Author: René Hosch, Manuel Weber, Miriam Sraieb, Nils Flaschel, Johannes Haubold, Moon-Sung Kim, Lale Umutlu, Jens Kleesiek, Ken Herrmann, Felix Nensa, Christoph Rischpler, Sven Koitka, Robert Seifert, and David Kersting
Subjects: Fluorodeoxyglucose F18, Artificial Intelligence, Positron Emission Tomography Computed Tomography, Positron-Emission Tomography, Medizin, Humans, Radiology, Nuclear Medicine and imaging, General Medicine, Prospective Studies, Tomography, X-Ray Computed
Abstract: Purpose Both digital positron emission tomography (PET) detector technologies and artificial intelligence based image post-reconstruction methods allow to reduce the PET acquisition time while maintaining diagnostic quality. The aim of this study was to acquire ultra-low-count fluorodeoxyglucose (FDG) ExtremePET images on a digital PET/computed tomography (CT) scanner at an acquisition time comparable to a CT scan and to generate synthetic full-dose PET images using an artificial neural network. Methods This is a prospective, single-arm, single-center phase I/II imaging study. A total of 587 patients were included. For each patient, a standard and an ultra-low-count FDG PET/CT scan (whole-body acquisition time about 30 s) were acquired. A modified pix2pixHD deep-learning network was trained employing 387 data sets as training and 200 as test cohort. Three models (PET-only and PET/CT with or without group convolution) were compared. Detectability and quantification were evaluated. Results The PET/CT input model with group convolution performed best regarding lesion signal recovery and was selected for detailed evaluation. Synthetic PET images were of high visual image quality; mean absolute lesion SUVmax (maximum standardized uptake value) difference was 1.5. Patient-based sensitivity and specificity for lesion detection were 79% and 100%, respectively. Not-detected lesions were of lower tracer uptake and lesion volume. In a matched-pair comparison, patient-based (lesion-based) detection rate was 89% (78%) for PERCIST (PET response criteria in solid tumors)-measurable and 36% (22%) for non PERCIST-measurable lesions. Conclusion Lesion detectability and lesion quantification were promising in the context of extremely fast acquisition times. Possible application scenarios might include re-staging of late-stage cancer patients, in whom assessment of total tumor burden can be of higher relevance than detailed evaluation of small and low-uptake lesions.
Published: 2022

25. From Bag-of-Words to Transformers : A Deep Dive into the Participation in the eRisk Early Risk Detection of Depression Tasks with Classical and New Approaches

Author: Hendrik Damm, Marcel Trotzek, Sven Koitka, Henning Schäfer, Ahmad Idrissi-Yaghir, and Christoph M. Friedrich
Subjects: Informatik, Medizin
Published: 2022

26. Big Imaging Data: Klinische Bildanalyse mit Radiomics und Deep Learning

Author: Felix Nensa, Aydin Demircioglu, and Sven Koitka
Subjects: Gynecology, 03 medical and health sciences, medicine.medical_specialty, 0302 clinical medicine, Radiomics, 030220 oncology & carcinogenesis, Political science, medicine, 030218 nuclear medicine & medical imaging
Abstract: ZusammenfassungRadiomics ist eine Methode der medizinischen Bildanalyse, bei der quantitative Merkmale aus Bilddaten extrahiert und mittels Machine Learning zu prädiktiven Modellen weiterverarbeitet werden. Ziel dieser Arbeit ist es, die technischen Grundlagen von Radiomics und mögliche klinische Anwendungen unter besonderer Berücksichtigung nuklearmedizinischer Daten zu erläutern. Dabei wird zunächst die klassische Radiomics-Methode besprochen, welche auf einer exakten Segmentierung der zu analysierenden Pathologie beruht und bei der die Features manuell definiert werden müssen. Anschließend wird auf das noch wenig verbreitete, allerdings vielversprechende Deep Learning basierte Radiomics eingegangen, dessen Vorteile darin liegen, dass ausschließlich datengetrieben gearbeitet wird und daher weder exakte Segmentierungen noch manuelle Definitionen der Features benötigt werden. Abschließend werden einige Anwendungen von Radiomics besprochen, die zukünftig im klinischen Alltag eine Rolle spielen könnten.
Published: 2019

27. Contrast agent dose reduction in computed tomography with deep learning using a conditional generative adversarial network

Author: Johannes Haubold, Axel Wetter, Alexander Radbruch, Michael Forsting, Sven Koitka, Felix Nensa, Patrizia Haubold, Lale Umutlu, and René Hosch
Subjects: medicine.medical_specialty, Image quality, media_common.quotation_subject, Medizin, Computed tomography, 02 engineering and technology, Signal-To-Noise Ratio, 030218 nuclear medicine & medical imaging, Reduction (complexity), 03 medical and health sciences, 0302 clinical medicine, Deep Learning, Similarity (network science), Consistency (statistics), Image processing, computer-assisted, 0202 electrical engineering, electronic engineering, information engineering, medicine, Contrast (vision), Animals, Humans, Radiology, Nuclear Medicine and imaging, media_common, medicine.diagnostic_test, Drug Tapering, business.industry, Deep learning, Contrast media, Pattern recognition, General Medicine, Tomography, spiral computed, Imaging Informatics and Artificial Intelligence, 020201 artificial intelligence & image processing, Dose reduction, Radiology, Artificial intelligence, business, Tomography, X-Ray Computed
Abstract: Objectives To reduce the dose of intravenous iodine-based contrast media (ICM) in CT through virtual contrast-enhanced images using generative adversarial networks. Methods Dual-energy CTs in the arterial phase of 85 patients were randomly split into an 80/20 train/test collective. Four different generative adversarial networks (GANs) based on image pairs, which comprised one image with virtually reduced ICM and the original full ICM CT slice, were trained, testing two input formats (2D and 2.5D) and two reduced ICM dose levels (−50% and −80%). The amount of intravenous ICM was reduced by creating virtual non-contrast series using dual-energy and adding the corresponding percentage of the iodine map. The evaluation was based on different scores (L1 loss, SSIM, PSNR, FID), which evaluate the image quality and similarity. Additionally, a visual Turing test (VTT) with three radiologists was used to assess the similarity and pathological consistency. Results The −80% models reach an SSIM of > 98%, PSNR of > 48, L1 of between 7.5 and 8, and an FID of between 1.6 and 1.7. In comparison, the −50% models reach a SSIM of > 99%, PSNR of > 51, L1 of between 6.0 and 6.1, and an FID between 0.8 and 0.95. For the crucial question of pathological consistency, only the 50% ICM reduction networks achieved 100% consistency, which is required for clinical use. Conclusions The required amount of ICM for CT can be reduced by 50% while maintaining image quality and diagnostic accuracy using GANs. Further phantom studies and animal experiments are required to confirm these initial results. Key Points • The amount of contrast media required for CT can be reduced by 50% using generative adversarial networks. • Not only the image quality but especially the pathological consistency must be evaluated to assess safety. • A too pronounced contrast media reduction could influence the pathological consistency in our collective at 80%.
Published: 2020

28. Fully automated body composition analysis in routine CT imaging using 3D semantic segmentation convolutional neural networks

Author: Lennard Kroll, Eugen Malamutmann, Sven Koitka, Arzu Oezcelik, and Felix Nensa
Subjects: FOS: Computer and information sciences, medicine.medical_specialty, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Medizin, Convolutional neural network, Hounsfield scale, Abdomen, FOS: Electrical engineering, electronic engineering, information engineering, Image Processing, Computer-Assisted, Medicine, Humans, Radiology, Nuclear Medicine and imaging, Segmentation, Neuroradiology, business.industry, Thoracic cavity, Image and Video Processing (eess.IV), Ultrasound, Correction, Deep learning, General Medicine, Electrical Engineering and Systems Science - Image and Video Processing, Semantics, Computer-assisted image analysis, medicine.anatomical_structure, Imaging Informatics and Artificial Intelligence, Body Composition, Body region, Radiology, Neural Networks, Computer, business, Tomography, X-Ray Computed
Abstract: Objectives Body tissue composition is a long-known biomarker with high diagnostic and prognostic value not only in cardiovascular, oncological, and orthopedic diseases but also in rehabilitation medicine or drug dosage. In this study, the aim was to develop a fully automated, reproducible, and quantitative 3D volumetry of body tissue composition from standard CT examinations of the abdomen in order to be able to offer such valuable biomarkers as part of routine clinical imaging. Methods Therefore, an in-house dataset of 40 CTs for training and 10 CTs for testing were fully annotated on every fifth axial slice with five different semantic body regions: abdominal cavity, bones, muscle, subcutaneous tissue, and thoracic cavity. Multi-resolution U-Net 3D neural networks were employed for segmenting these body regions, followed by subclassifying adipose tissue and muscle using known Hounsfield unit limits. Results The Sørensen Dice scores averaged over all semantic regions was 0.9553 and the intra-class correlation coefficients for subclassified tissues were above 0.99. Conclusions Our results show that fully automated body composition analysis on routine CT imaging can provide stable biomarkers across the whole abdomen and not just on L3 slices, which is historically the reference location for analyzing body composition in the clinical routine. Key Points • Our study enables fully automated body composition analysis on routine abdomen CT scans. • The best segmentation models for semantic body region segmentation achieved an averaged Sørensen Dice score of 0.9553. • Subclassified tissue volumes achieved intra-class correlation coefficients over 0.99.
Published: 2020

29. Mimicking the radiologists’ workflow : Estimating pediatric hand bone age with stacked deep neural networks

Author: Ming Qu, Felix Nensa, Sven Koitka, Christoph M. Friedrich, Asja Fischer, and Moon S. Kim
Subjects: Computer science, Medizin, Health Informatics, Machine learning, computer.software_genre, Workflow, Age Determination by Skeleton, Radiologists, medicine, Humans, Radiology, Nuclear Medicine and imaging, Child, Radiological and Ultrasound Technology, Artificial neural network, business.industry, Ossification, Deep learning, Infant, Bone age, Computer Graphics and Computer-Aided Design, Object detection, Hand Bones, Test set, Bone maturation, Neural Networks, Computer, Computer Vision and Pattern Recognition, Artificial intelligence, medicine.symptom, business, computer
Abstract: Pediatric endocrinologists regularly order radiographs of the left hand to estimate the degree of bone maturation in order to assess their patients for advanced or delayed growth, physical development, and to monitor consecutive therapeutic measures. The reading of such images is a labor-intensive task that requires a lot of experience and is normally performed by highly trained experts like pediatric radiologists. In this paper we build an automated system for pediatric bone age estimation that mimics and accelerates the workflow of the radiologist without breaking it. The complete system is based on two neural network based models: on the one hand a detector network, which identifies the ossification areas, on the other hand gender and region specific regression networks, which estimate the bone age from the detected areas. With a small annotated dataset an ossification area detection network can be trained, which is stable enough to work as part of a multi-stage approach. Furthermore, our system achieves competitive results on the RSNA Pediatric Bone Age Challenge test set with an average error of 4.56 months. In contrast to other approaches, especially purely encoder-based architectures, our two-stage approach provides self-explanatory results. By detecting and evaluating the individual ossification areas, thus simulating the workflow of the Tanner-Whitehouse procedure, the results are interpretable for a radiologist.
Published: 2020

30. Correction to: Fully automated body composition analysis in routine CT imaging using 3D semantic segmentation convolutional neural networks

Author: Eugen Malamutmann, Sven Koitka, Felix Nensa, Lennard Kroll, and Arzu Oezcelik
Subjects: medicine.medical_specialty, medicine.diagnostic_test, business.industry, Medizin, MEDLINE, Interventional radiology, Mistake, General Medicine, Composition analysis, Convolutional neural network, Medicine, Radiology, Nuclear Medicine and imaging, Medical physics, Segmentation, Radiology, Ct imaging, business, Neuroradiology
Abstract: The original version of this article, published on 18 September 2020, unfortunately contained a mistake.
Published: 2020

31. Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Author: Hyo-Eun Kim, Jiashi Feng, Stephen H. Friend, Ljubomir Buturovic, Dezső Ribli, Luis Caballero, Li Shen, Fredrik Strand, Yaroslav Nikulin, Krzysztof J. Geras, Kyunghyun Cho, Elias Chaibub Neto, Rami Ben-Ari, Christoph I. Lee, Zequn Jie, Imane Nedjar, Felix Nensa, Darvin Yi, Shivanthan A.C. Yohanandan, Bruce Hoff, Justin Guinney, Jaime S. Cardoso, Russell B. McBride, Mengling Feng, Yiqiu Shen, Simona Rabinovici-Cohen, Ethan Goan, Stefan Harrer, Sven Koitka, Michael Kawczynski, Hari Trivedi, Karl Trygve Kalleberg, Christoph M. Friedrich, F. Albiol, Dimitri Perrin, Jose Costa Pereira, Umar Asif, Bibo Shi, Zbigniew Wojna, Antonio Jimeno Yepes, Peter Lindholm, Berkman Sahiner, Sijia Wang, Thea Norman, Weiva Sieh, Joyce Cahoon, Gerard Cardoso Negrie, Pavitra Krishnaswamy, Diana S. M. Buist, Alberto Albiol, Lester Mackey, Hwejin Jung, Laurie R. Margolies, Gaurav Pandey, Can Son Khoo, William Lotter, Yuanfang Guan, Thomas Yu, Andrew D. Trister, Stephen Morrell, Gustavo Stolovitzky, A. Gregory Sorensen, Clinton Fookes, Mehmet Eren Ahsen, David D. Cox, Jae Ho Sohn, Hao Du, Thomas Schaffter, Joseph H. Rothstein, Eduardo Castro, Joseph Y. Lo, Daniel L. Rubin, and Obioma Pelka
Subjects: Adult, medicine.medical_specialty, Medizin, MEDLINE, Breast Neoplasms, Diagnostic accuracy, Sensitivity and Specificity, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, Deep Learning, 0302 clinical medicine, Breast cancer, Artificial Intelligence, Image Interpretation, Computer-Assisted, Radiologists, medicine, False positive paradox, Humans, Mammography, Risk factor, Early Detection of Cancer, Aged, Sweden, medicine.diagnostic_test, Screening mammography, business.industry, Correction, General Medicine, Middle Aged, medicine.disease, United States, 3. Good health, Online Only, 030220 oncology & carcinogenesis, Female, Other, Radiology, Artificial intelligence, business, Validation cohort, Algorithms
Abstract: Importance: Mammography screening currently relies on subjective human interpretation. Artificial intelligence (AI) advances could be used to increase mammography screening accuracy by reducing missed cancers and false positives. Objective: To evaluate whether AI can overcome human mammography interpretation limitations with a rigorous, unbiased evaluation of machine learning algorithms. Design, Setting, and Participants: In this diagnostic accuracy study conducted between September 2016 and November 2017, an international, crowdsourced challenge was hosted to foster AI algorithm development focused on interpreting screening mammography. More than 1100 participants comprising 126 teams from 44 countries participated. Analysis began November 18, 2016. Main Outcomes and Measurements: Algorithms used images alone (challenge 1) or combined images, previous examinations (if available), and clinical and demographic risk factor data (challenge 2) and output a score that translated to cancer yes/no within 12 months. Algorithm accuracy for breast cancer detection was evaluated using area under the curve and algorithm specificity compared with radiologists' specificity with radiologists' sensitivity set at 85.9% (United States) and 83.9% (Sweden). An ensemble method aggregating top-performing AI algorithms and radiologists' recall assessment was developed and evaluated. Results: Overall, 144 231 screening mammograms from 85 580 US women (952 cancer positive ≤12 months from screening) were used for algorithm training and validation. A second independent validation cohort included 166 578 examinations from 68 008 Swedish women (780 cancer positive). The top-performing algorithm achieved an area under the curve of 0.858 (United States) and 0.903 (Sweden) and 66.2% (United States) and 81.2% (Sweden) specificity at the radiologists' sensitivity, lower than community-practice radiologists' specificity of 90.5% (United States) and 98.5% (Sweden). Combining top-performing algorithms and US radiologist assessments resulted in a higher area under the curve of 0.942 and achieved a significantly improved specificity (92.0%) at the same sensitivity. Conclusions and Relevance: While no single AI algorithm outperformed radiologists, an ensemble of AI algorithms combined with radiologist assessment in a single-reader screening environment improved overall accuracy. This study underscores the potential of using machine learning methods for enhancing mammography screening interpretation. CA extern
Published: 2020

32. Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences

Author: Sven Koitka, Marcel Trotzek, and Christoph M. Friedrich
Subjects: FOS: Computer and information sciences, Word embedding, Computer Science - Computation and Language, Artificial neural network, business.industry, Computer science, Medizin, Context (language use), 02 engineering and technology, Convolutional neural network, Linguistics, Computer Science Applications, Computer Science - Information Retrieval, Metadata, Informatik, Computational Theory and Mathematics, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Task analysis, The Internet, business, Computation and Language (cs.CL), Information Retrieval (cs.IR), Information Systems
Abstract: Depression is ranked as the largest contributor to global disability and is also a major reason for suicide. Still, many individuals suffering from forms of depression are not treated for various reasons. Previous studies have shown that depression also has an effect on language usage and that many depressed individuals use social media platforms or the internet in general to get information or discuss their problems. This paper addresses the early detection of depression using machine learning models based on messages on a social platform. In particular, a convolutional neural network based on different word embeddings is evaluated and compared to a classification based on user-level linguistic metadata. An ensemble of both approaches is shown to achieve state-of-the-art results in a current early detection task. Furthermore, the currently popular ERDE score as metric for early detection systems is examined in detail and its drawbacks in the context of shared tasks are illustrated. A slightly modified metric is proposed and compared to the original score. Finally, a new word embedding was trained on a large corpus of the same domain as the described task and is evaluated as well., This work has been submitted to the IEEE and has been accepted for future publication in IEEE Transactions on Knowledge and Data Engineering. Copyright may be transferred without notice, after which this version may no longer be accessible. 14 pages, 3 figures, 7 tables
Published: 2018

33. Early Detection of Depression Based on Linguistic Metadata Augmented Classifiers Revisited

Author: Marcel Trotzek, Sven Koitka, and Christoph M. Friedrich
Subjects: Artificial neural network, business.industry, Computer science, Medizin, Early detection, 02 engineering and technology, computer.software_genre, Clef, Task (project management), Metadata, Important research, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Social media, Artificial intelligence, business, computer, Natural language processing
Abstract: Early detection of depression based on written texts has become an important research area due to the rise of social media platforms and because many affected individuals are still not treated. During the eRisk task for early detection of depression at CLEF 2017, FHDO Biomedical Computer Science Group (BCSG) submitted results based on five text classification models. This paper builds upon this work to examine the task and especially the \(ERDE_o\) metric in further detail and to analyze how an additional type of metadata features can help in this task. Finally, different prediction thresholds and ensembles of the developed models are utilized to investigate the possible improvements, and a newly proposed alternative early detection metric is evaluated.
Published: 2018

34. Radiology objects in COntext (ROCO) : A multimodal image dataset

Author: Christoph M. Friedrich, Felix Nensa, Sven Koitka, Obioma Pelka, and Johannes Rückert
Subjects: medicine.medical_specialty, business.industry, Computer science, Deep learning, Unified Medical Language System, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Context (language use), 02 engineering and technology, 01 natural sciences, Convolutional neural network, 010309 optics, Set (abstract data type), Informatik, Binary classification, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Medical imaging, medicine, 020201 artificial intelligence & image processing, Artificial intelligence, Radiology, business, Image retrieval
Abstract: This work introduces a new multimodal image dataset, with the aim of detecting the interplay between visual elements and semantic relations present in radiology images. The objective is accomplished by retrieving all image-caption pairs from the open-access biomedical literature database PubMedCentral, as these captions describe the visual content in their semantic context. All compound, multi-pane, and non-radiology images were eliminated using an automatic binary classifier fine-tuned with a deep convolutional neural network system. Radiology Objects in COntext (ROCO) dataset contains over 81k radiology images with several medical imaging modalities including Computer Tomography, Ultrasound, X-Ray, Fluoroscopy, Positron Emission Tomography, Mammography, Magnetic Resonance Imaging, Angiography. All images in ROCO have corresponding caption, keywords, Unified Medical Language Systems Concept Unique Identifiers and Semantic Type. An out-of-class set with 6k images ranging from synthetic radiology figures to digital arts is provided, to improve prediction and classification performance. Adopting ROCO, systems for caption and keywords generation can be modeled, which allows multimodal representation for datasets lacking text representation. Systems with the goal of image structuring and semantic information tagging can be created using ROCO, which is beneficial and of assistance for image and information retrieval purposes.
Published: 2018

35. Optimized Convolutional Neural Network Ensembles for Medical Subfigure Classification

Author: Christoph M. Friedrich and Sven Koitka
Subjects: business.industry, Computer science, Deep learning, Pattern recognition, 02 engineering and technology, Machine learning, computer.software_genre, Convolutional neural network, 030218 nuclear medicine & medical imaging, Image (mathematics), Domain (software engineering), Task (project management), 03 medical and health sciences, 0302 clinical medicine, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), Labeled data, Preprocessor, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: Automatic classification systems are required to support medical literature databases like PubMedCentral, which allow an easy access to millions of articles. FHDO Biomedical Computer Science Group (BCSG) participated at the ImageCLEF 2016 Subfigure Classification Task to improve existing approaches for classifying figures from medical literature. In this work, a data analysis is conducted in order to improve image preprocessing for deep learning approaches. Evaluations on the dataset show better ensemble classification accuracies using only visual information with an optimized training, in comparison to the mixed feature approaches of BCSG at ImageCLEF 2016. Additionally, a self-training approach is investigated to generate more labeled data in the medical domain.
Published: 2017

36. Nmfgpu4R: Gpu-accelerated computation of the non-negative matrix factorization (NMF) using CUDA capable hardware

Author: Christoph M. Friedrich and Sven Koitka
Subjects: Statistics and Probability, 020203 distributed computing, Numerical Analysis, Computer science, Computation, 02 engineering and technology, Parallel computing, Non-negative matrix factorization, Computational science, CUDA, Factorization, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Statistics, Probability and Uncertainty

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

36 results on '"Sven Koitka"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources