Back to Search Start Over

Using Quasispecies Patterns of Hepatitis B Virus to Predict Hepatocellular Carcinoma With Deep Sequencing and Machine Learning.

Authors :
Chen S
Zhang Z
Wang Y
Fang M
Zhou J
Li Y
Dai E
Feng Z
Wang H
Yang Z
Li Y
Huang X
Jia J
Li S
Huang C
Tong L
Xiao X
He Y
Duan Y
Zhu S
Gao C
Source :
The Journal of infectious diseases [J Infect Dis] 2021 Jun 04; Vol. 223 (11), pp. 1887-1896.
Publication Year :
2021

Abstract

Background: Hepatitis B virus (HBV) infection is one of the main leading causes of hepatocellular carcinoma (HCC) worldwide. However, it remains uncertain how the reverse-transcriptase (rt) gene contributes to HCC progression.<br />Methods: We enrolled a total of 307 patients with chronic hepatitis B (CHB) and 237 with HBV-related HCC from 13 medical centers. Sequence features comprised multidimensional attributes of rt nucleic acid and rt/s amino acid sequences. Machine-learning models were used to establish HCC predictive algorithms. Model performances were tested in the training and independent validation cohorts using receiver operating characteristic curves and calibration plots.<br />Results: A random forest (RF) model based on combined metrics (10 features) demonstrated the best predictive performances in both cross and independent validation (AUC, 0.96; accuracy, 0.90), irrespective of HBV genotypes and sequencing depth. Moreover, HCC risk scores for individuals obtained from the RF model (AUC, 0.966; 95% confidence interval, .922-.989) outperformed α-fetoprotein (0.713; .632-.784) in distinguishing between patients with HCC and those with CHB.<br />Conclusions: Our study provides evidence for the first time that HBV rt sequences contain vital HBV quasispecies features in predicting HCC. Integrating deep sequencing with feature extraction and machine-learning models benefits the longitudinal surveillance of CHB and HCC risk assessment.<br /> (© The Author(s) 2020. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1537-6613
Volume :
223
Issue :
11
Database :
MEDLINE
Journal :
The Journal of infectious diseases
Publication Type :
Academic Journal
Accession number :
33049037
Full Text :
https://doi.org/10.1093/infdis/jiaa647