1. XY-Meta: A High-Efficiency Search Engine for Large-Scale Metabolome Annotation with Accurate FDR Estimation.
- Author
-
Li D, Liu B, Zheng H, Xiao X, Li Z, Luan E, Li W, Yang Y, Wang Y, Long Q, Song J, and Zhang G
- Subjects
- Databases, Protein, Peptides analysis, Proteins analysis, Algorithms, Metabolomics, Peptides metabolism, Proteins metabolism, Search Engine
- Abstract
FDR control has been a huge challenge for large-scale metabolome annotation. Although recent research indicated that the target-decoy strategy could be implemented to estimate FDR, it is hard to perform FDR control due to the difficulty of getting a reliable decoy database because of the complex fragmentation mechanism of metabolites and ubiquitous isomers. To tackle this problem, we developed a decoy generation method, which generates forged spectra from the reference target database by preserving the original reference signals to simulate the presence of isomers of metabolites. Benchmarks on GNPS data sets in Passatutto showed that the decoy database generated by our method is closer to the actual FDR than other methods, especially in the low FDR range (0-0.05). Large-scale metabolite annotation on 35 data sets showed that strict FDR reduced the number of annotated metabolites but increased the spectral efficiency, indicating the necessity of quality control. We recommended that the FDR threshold should be set to 0.01 in large-scale metabolite annotation. We implemented decoy generation, database search, and FDR control into a search engine called XY-Meta. It facilitates large-scale metabolome annotation applications.
- Published
- 2020
- Full Text
- View/download PDF