Back to Search Start Over

Derivation and External Validation of Machine Learning- Based Model for Detection of Pancreatic Cancer.

Authors :
Wansu Chen
Yichen Zhou
Fagen Xie
Butler, Rebecca K.
Jeon, Christie Y.
Luong, Tiffany Q.
Zhou, Botao
Yu-Chen Lin
Lustigova, Eva
Pisegna, Joseph R.
Sungjin Kim
Wu, Bechien U.
Source :
American Journal of Gastroenterology (Lippincott Williams & Wilkins). Jan2023, Vol. 118 Issue 1, p157-167. 11p.
Publication Year :
2023

Abstract

INTRODUCTION: There is currently no widely accepted approach to screening for pancreatic cancer (PC). We aimed to develop and validate a risk prediction model for pancreatic ductal adenocarcinoma (PDAC), the most common form of PC, across 2 health systems using electronic health records. METHODS: This retrospective cohort study consisted of patients aged 50–84 years having at least 1 clinic-based visit over a 10-year study period at Kaiser Permanente Southern California (model training, internal validation) and the Veterans Affairs (VA, external testing). Random survival forests models were built to identify the most relevant predictors from >500 variables and to predict risk of PDAC within 18 months of cohort entry. RESULTS: The Kaiser Permanente Southern California cohort consisted of 1.8 million patients (mean age 61.6) with 1,792 PDAC cases. The 18-month incidence rate of PDAC was 0.77 (95% confidence interval 0.73–0.80)/1,000 person-years. The final main model contained age, abdominal pain, weight change, HbA1c, and alanine transaminase change (c-index: mean = 0.77, SD = 0.02; calibration test: P value 0.4, SD 0.3). The final early detection model comprised the same features as those selected by the main model except for abdominal pain (c-index: 0.77 and SD 0.4; calibration test: P value 0.3 and SD 0.3). The VA testing cohort consisted of 2.7 million patients (mean age 66.1) with an 18-month incidence rate of 1.27 (1.23–1.30)/1,000 person-years. The recalibrated main and early detection models based on VA testing data sets achieved a mean c-index of 0.71 (SD 0.002) and 0.68 (SD 0.003), respectively. DISCUSSION: Using widely available parameters in electronic health records, we developed and externally validated parsimonious machine learning-based models for detection of PC. These models may be suitable for real-time clinical application. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00029270
Volume :
118
Issue :
1
Database :
Academic Search Index
Journal :
American Journal of Gastroenterology (Lippincott Williams & Wilkins)
Publication Type :
Academic Journal
Accession number :
162016849
Full Text :
https://doi.org/10.14309/ajg.0000000000002050