Back to Search Start Over

A Multicenter Pilot Evaluation of the National Institutes of Health Chronic Graft-versus-Host Disease (cGVHD) Therapeutic Response Measures: Feasibility, Interrater Reliability, and Minimum Detectable Change

Authors :
Mitchell, Sandra A.
Jacobsohn, David
Thormann Powers, Kimberly E.
Carpenter, Paul A.
Flowers, Mary E.D.
Cowen, Edward W.
Schubert, Mark
Turner, Maria L.
Lee, Stephanie J.
Martin, Paul
Bishop, Michael R.
Baird, Kristin
Bolaños-Meade, Javier
Boyd, Kevin
Fall-Dickson, Jane M.
Gerber, Lynn H.
Guadagnini, Jean-Pierre
Imanguli, Matin
Krumlauf, Michael C.
Lawley, Leslie
Source :
Biology of Blood & Marrow Transplantation. Nov2011, Vol. 17 Issue 11, p1619-1629. 11p.
Publication Year :
2011

Abstract

The lack of standardized criteria for measuring therapeutic response is a major obstacle to the development of new therapeutic agents for chronic graft-versus-host disease (cGVHD). National Institutes of Health (NIH) consensus criteria for evaluating therapeutic response were published in 2006. We report the results of 4 consecutive pilot trials evaluating the feasibility and estimating the interrater reliability and minimum detectable change of these response criteria. Hematology-oncology clinicians with limited experience in applying the NIH cGVHD response criteria (n = 34) participated in a 2.5-hour training session on response evaluation in cGVHD. Feasibility and interrater reliability between subspecialty cGVHD experts and this panel of clinician raters were examined in a sample of 25 children and adults with cGVHD. The minimum detectable change was calculated using the standard error of measurement. Clinicians’ impressions of the brief training session, the photo atlas, and the response criteria documentation tools were generally favorable. Performing and documenting the full set of response evaluations required a median of 21 minutes (range: 12-60 minutes) per rater. The Schirmer tear test required the greatest time of any single test (median: 9 minutes). Overall, interrater agreement for skin and oral manifestations was modest; however, in the third and fourth trials, the agreement between clinicians and experts for all dimensions except movable sclerosis approached satisfactory values. In the final 2 trials, the threshold for defining change exceeding measurement error was 19% to 22% body surface area (BSA) for erythema, 18% to 26% BSA for movable sclerosis, 17% to 21% BSA for nonmovable sclerosis, and 2.1 to 2.6 points on the 15-point NIH Oral cGHVD scale. Agreement between clinician-expert pairs was moderate to substantial for the measures of functional capacity and for the gastrointestinal and global cGVHD rating scales. These results suggest that the NIH response criteria are feasible for use, and these reliability estimates are encouraging, because they were observed following a single 2.5-hour training session given at multiple transplant centers, with no opportunity for iterative training and calibration. Research is needed to evaluate inter- and intrarater reliability in larger samples, and to evaluate these response criteria as predictors of outcomes in clinical trials. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10838791
Volume :
17
Issue :
11
Database :
Academic Search Index
Journal :
Biology of Blood & Marrow Transplantation
Publication Type :
Academic Journal
Accession number :
66946863
Full Text :
https://doi.org/10.1016/j.bbmt.2011.04.002