Back to Search Start Over

Inter-Reader Reliability of Early FDG-PET/CT Response Assessment Using the Deauville Scale after 2 Cycles of Intensive Chemotherapy (OEPA) in Hodgkin's Lymphoma.

Authors :
Regine Kluge
Lidia Chavdarova
Martha Hoffmann
Carsten Kobe
Bogdan Malkowski
Françoise Montravers
Lars Kurch
Thomas Georgi
Markus Dietlein
W Hamish Wallace
Jonas Karlen
Ana Fernández-Teijeiro
Michaela Cepelova
Lorrain Wilson
Eva Bergstraesser
Osama Sabri
Christine Mauz-Körholz
Dieter Körholz
Dirk Hasenclever
Source :
PLoS ONE, Vol 11, Iss 3, p e0149072 (2016)
Publication Year :
2016
Publisher :
Public Library of Science (PLoS), 2016.

Abstract

PURPOSE:The five point Deauville (D) scale is widely used to assess interim PET metabolic response to chemotherapy in Hodgkin lymphoma (HL) patients. An International Validation Study reported good concordance among reviewers in ABVD treated advanced stage HL patients for the binary discrimination between score D1,2,3 and score D4,5. Inter-reader reliability of the whole scale is not well characterised. METHODS:Five international expert readers scored 100 interim PET/CT scans from paediatric HL patients. Scans were acquired in 51 European hospitals after two courses of OEPA chemotherapy (according to the EuroNet-PHL-C1 study). Images were interpreted in direct comparison with staging PET/CTs. RESULTS:The probability that two random readers concord on the five point D score of a random case is only 42% (global kappa = 0.24). Aggregating to a three point scale D1,2 vs. D3 vs. D4,5 improves concordance to 60% (kappa = 0.34). Concordance if one of two readers assigns a given score is 70% for score D1,2 only 36% for score D3 and 64% for D4,5. Concordance for the binary decisions D1,2 vs. D3,4,5 is 67% and 86% for D1,2,3 vs D4,5 (kappa = 0.36 resp. 0.56). If one reader assigns D1,2,3 concordance probability is 92%, but only 64% if D4,5 is called. Discrepancies occur mainly in mediastinum, neck and skeleton. CONCLUSION:Inter-reader reliability of the five point D-scale is poor in this interobserver analysis of paediatric patients who underwent OEPA. Inter-reader variability is maximal in cases assigned to D2 or D3. The binary distinction D1,2,3 versus D4,5 is the most reliable criterion for clinical decision making.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
11
Issue :
3
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.54031301ed1e42bb8c35755bdd381c69
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0149072