Back to Search Start Over

Inter- and intra-software reproducibility of computed tomography lung density measurements.

Authors :
Kirby M
Hatt C
Obuchowski N
Humphries SM
Sieren J
Lynch DA
Fain SB
Source :
Medical physics [Med Phys] 2020 Jul; Vol. 47 (7), pp. 2962-2969. Date of Electronic Publication: 2020 Mar 31.
Publication Year :
2020

Abstract

Purpose: Multiple commercial, open-source, and academic software tools exist for objective quantification of lung density in computed tomography (CT) images. The purpose of this study was to evaluate the intersoftware reproducibility of CT lung density measurements.<br />Methods: Computed tomography images from 50 participants from the COPDGene <superscript>TM</superscript> cohort study were randomly selected for analysis; n = 10 participants across each global initiative for chronic obstructive lung disease (GOLD) grade (GOLD 0-IV). Academic-based groups (n = 4) and commercial vendors (n = 4) participated anonymously to generate CT lung density measurements using their software tools. Computed tomography total lung volume (TLV), percentage of the low attenuation areas in the lung with Hounsfield unit (HU) values below -950HU (LAA <subscript>950</subscript> ), and the HU value corresponding to the 15th percentile on the parenchymal density histogram (Perc15) were included in the analysis. The intersoftware bias and reproducibility coefficient (RDC) was generated with and without quality assurance (QA) for manual correction of the lung segmentation; intrasoftware bias and RDC was also generated by repeated measurements on the same images.<br />Results: Intersoftware mean bias was within ±0.22 mL, ±0.46%, and ±0.97 HU for TLV, LAA <subscript>950</subscript> and Perc15, respectively. The RDC was 0.35 L, 1.2% and 1.8 HU for TLV, LAA <subscript>950</subscript> and Perc15, respectively. Intersoftware RDC remained unchanged following QA: 0.35 L, 1.2% and 1.8 HU for TLV, LAA <subscript>950</subscript> and Perc15, respectively. All software investigated had an intrasoftware RDC of 0. The RDC was comparable for TLV, LAA <subscript>950</subscript> and Perc15 measurements, respectively, for academic-based groups/commercial vendor-based software tools: 0.39 L/0.32 L, 1.2%/1.2%, and 1.7 HU/1.6 HU. Multivariable regression analysis showed that academic-based software tools had greater within-subject standard deviation of TLV than commercial vendors, but no significant differences between academic and commercial groups were found for LAA <subscript>950</subscript> or Perc15 measurements.<br />Conclusions: Computed tomography total lung volume and lung density measurement bias and reproducibility was reported across eight different software tools. Bias was negligible across vendors, reproducibility was comparable for software tools generated by academic-based groups and commercial vendors, and segmentation QA had negligible impact on measurement variability between software tools. In summary, results from this study report the amount of additional measurement variability that should be accounted for when using different software tools to measure lung density longitudinally with well-standardized image acquisition protocols. However, intrasoftware reproducibility was deterministic for all cases so use of the same software tool to reduce variability for serial studies is highly recommended.<br /> (© 2020 American Association of Physicists in Medicine.)

Details

Language :
English
ISSN :
2473-4209
Volume :
47
Issue :
7
Database :
MEDLINE
Journal :
Medical physics
Publication Type :
Academic Journal
Accession number :
32160310
Full Text :
https://doi.org/10.1002/mp.14130