Author: "Shu, Kunxian" / Topic: tandem mass spectrometry - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Shu, Kunxian"' showing total 4 results

Start Over Author "Shu, Kunxian" Topic tandem mass spectrometry

4 results on '"Shu, Kunxian"'

1. Deep learning embedder method and tool for mass spectra similarity search.

Author: Qin C, Luo X, Deng C, Shu K, Zhu W, Griss J, Hermjakob H, Bai M, and Perez-Riverol Y
Subjects: Algorithms, Cluster Analysis, Databases, Protein, Humans, Proteomics, Software, Deep Learning, Tandem Mass Spectrometry
Abstract: Spectral similarity calculation is widely used in protein identification tools and mass spectra clustering algorithms while comparing theoretical or experimental spectra. The performance of the spectral similarity calculation plays an important role in these tools and algorithms especially in the analysis of large-scale datasets. Recently, deep learning methods have been proposed to improve the performance of clustering algorithms and protein identification by training the algorithms with existing data and the use of multiple spectra and identified peptide features. While the efficiency of these algorithms is still under study in comparison with traditional approaches, their application in proteomics data analysis is becoming more common. Here, we propose the use of deep learning to improve spectral similarity comparison. We assessed the performance of deep learning for spectral similarity, with GLEAMS and a newly trained embedder model (DLEAMSE), which uses high-quality spectra from PRIDE Cluster. Also, we developed a new bioinformatics tool (mslookup - https://github.com/bigbio/DLEAMSE/) that allows users to quickly search for spectra in previously identified mass spectra publish in public repositories and spectral libraries. Finally, we released a human database to enable bioinformaticians and biologists to search for identified spectra in their machines. SIGNIFICANCE STATEMENT: Spectral similarity calculation plays an important role in proteomics data analysis. With deep learning's ability to learn the implicit and effective features from large-scale training datasets, deep learning-based MS/MS spectra embedding models has emerged as a solution to improve mass spectral clustering similarity calculation algorithms. We compare multiple similarity scoring and deep learning methods in terms of accuracy (compute the similarity for a pair of the mass spectrum) and computing-time performance. The benchmark results showed no major differences in accuracy between DLEAMSE and normalized dot product for spectrum similarity calculations. The DLEAMSE GPU implementation is faster than NDP in preprocessing on the GPU server and the similarity calculation of DLEAMSE (Euclidean distance on 32-D vectors) takes about 1/3 of dot product calculations. The deep learning model (DLEAMSE) encoding and embedding steps needed to run once for each spectrum and the embedded 32-D points can be persisted in the repository for future comparison, which is faster for future comparisons and large-scale data. Based on these, we proposed a new tool mslookup that enables the researcher to find spectra previously identified in public data. The tool can be also used to generate in-house databases of previously identified spectra to share with other laboratories and consortiums., (Copyright © 2020. Published by Elsevier B.V.)
Published: 2021
Full Text: View/download PDF

2. A Comprehensive Evaluation of MS/MS Spectrum Prediction Tools for Shotgun Proteomics.

Author: Xu R, Sheng J, Bai M, Shu K, Zhu Y, and Chang C
Subjects: Algorithms, Machine Learning, Search Engine, Proteomics, Tandem Mass Spectrometry
Abstract: Spectrum prediction using machine learning or deep learning models is an emerging method in computational proteomics. Several deep learning-based MS/MS spectrum prediction tools have been developed and showed their potentials not only for increasing the sensitivity and accuracy of data-dependent acquisition search engines, but also for building spectral libraries for data-independent acquisition analysis. Different tools with their unique algorithms and implementations may result in different performances. Hence, it is necessary to systematically evaluate these tools to find out their preferences and intrinsic differences. In this study, multiple datasets with different collision energies, enzymes, instruments, and species, are used to evaluate the performances of the deep learning-based MS/MS spectrum prediction tools, as well as, the machine learning-based tool MS2PIP. The evaluations may provide helpful insights and guidelines of spectrum prediction tools for the corresponding researchers., (© 2020 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.)
Published: 2020
Full Text: View/download PDF

3. [Progress in the spectral library based protein identification strategy].

Author: Yu D, Ma J, Xie Z, Bai M, Zhu Y, and Shu K
Subjects: Databases, Protein, Peptides analysis, Peptide Library, Proteins analysis, Proteomics, Tandem Mass Spectrometry
Abstract: Exponential growth of the mass spectrometry (MS) data is exhibited when the mass spectrometry-based proteomics has been developing rapidly. It is a great challenge to develop some quick, accurate and repeatable methods to identify peptides and proteins. Nowadays, the spectral library searching has become a mature strategy for tandem mass spectra based proteins identification in proteomics, which searches the experiment spectra against a collection of confidently identified MS/MS spectra that have been observed previously, and fully utilizes the abundance in the spectrum, peaks from non-canonical fragment ions, and other features. This review provides an overview of the implement of spectral library search strategy, and two key steps, spectral library construction and spectral library searching comprehensively, and discusses the progress and challenge of the library search strategy.
Published: 2018
Full Text: View/download PDF

4. PGPointNovo: an efficient neural network-based tool for parallel de novo peptide sequencing.

Author: Xu, Xiaofang, Yang, Chunde, He, Qiang, Shu, Kunxian, Xinpu, Yuan, Chen, Zhiguang, Zhu, Yunping, and Chen, Tao
Subjects: AMINO acid sequence, TANDEM mass spectrometry, SOURCE code, BIOINFORMATICS, COMPUTATIONAL biology
Abstract: Summary De novo peptide sequencing for tandem mass spectrometry data is not only a key technology for novel peptide identification, but also a precedent task for many downstream tasks, such as vaccine and antibody studies. In recent years, neural network models for de novo peptide sequencing have manifested a remarkable ability to accommodate various data sources and outperformed conventional peptide identification tools. However, the excellent model is computationally expensive, taking up to 1 week to process about 400 000 spectrums. This article presents PGPointNovo, a novel neural network-based tool for parallel de novo peptide sequencing. PGPointNovo uses data parallelization technology to accelerate training and inference and optimizes the training obstacles caused by large batch sizes. The results of extensive experiments conducted on multiple datasets of different sizes demonstrate that compared with PointNovo the excellent neural network-based de novo peptide sequencing tool, PGPointNovo, accelerates de novo peptide sequencing by up to 7.35× without precision or recall compromises. Availability and implementation The source code and the parameter settings are available at https://github.com/shallFun4Learning/PGPointNovo. Supplementary information Supplementary data are available at Bioinformatics Advances online. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Shu, Kunxian"'

1. Deep learning embedder method and tool for mass spectra similarity search.

2. A Comprehensive Evaluation of MS/MS Spectrum Prediction Tools for Shotgun Proteomics.

3. [Progress in the spectral library based protein identification strategy].

4. PGPointNovo: an efficient neural network-based tool for parallel de novo peptide sequencing.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

4 results on '"Shu, Kunxian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources