Back to Search Start Over

MFCNV: A New Method to Detect Copy Number Variations From Next-Generation Sequencing Data.

Authors :
Zhao, Haiyong
Huang, Tihao
Li, Junqing
Liu, Guojun
Yuan, Xiguo
Source :
Frontiers in Genetics; 5/15/2020, Vol. 11, p1-9, 9p
Publication Year :
2020

Abstract

Copy number variation (CNV) is a very important phenomenon in tumor genomes and plays a significant role in tumor genesis. Accurate detection of CNVs has become a routine and necessary procedure for a deep investigation of tumor cells and diagnosis of tumor patients. Next-generation sequencing (NGS) technique has provided a wealth of data for the detection of CNVs at base-pair resolution. However, such task is usually influenced by a number of factors, including GC-content bias, sequencing errors, and correlations among adjacent positions within CNVs. Although many existing methods have dealt with some of these artifacts by designing their own strategies, there is still a lack of comprehensive consideration of all the factors. In this paper, we propose a new method, MFCNV, for an accurate detection of CNVs from NGS data. Compared with existing methods, the characteristics of the proposed method include the following: (1) it makes a full consideration of the intrinsic correlations among adjacent positions in the genome to be analyzed, (2) it calculates read depth, GC-content bias, base quality, and correlation value for each genome bin and combines them as multiple features for the evaluation of genome bins, and (3) it addresses the joint effect among the factors via training a neural network algorithm for the prediction of CNVs. We test the performance of the MFCNV method by using simulation and real sequencing data and make comparisons with several peer methods. The results demonstrate that our method is superior to other methods in terms of sensitivity, precision, and F1-score and can detect many CNVs that other methods have not discovered. MFCNV is expected to be a complementary tool in the analysis of mutations in tumor genomes and can be extended to be applied to the analysis of single-cell sequencing data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
16648021
Volume :
11
Database :
Complementary Index
Journal :
Frontiers in Genetics
Publication Type :
Academic Journal
Accession number :
143311646
Full Text :
https://doi.org/10.3389/fgene.2020.00434