Back to Search Start Over

Combining full-length gene assay and SpliceAI to interpret the splicing impact of all possible SPINK1coding variants

Authors :
Wu, Hao
Lin, Jin-Huan
Tang, Xin-Ying
Marenne, Gaëlle
Zou, Wen-Bin
Schutz, Sacha
Masson, Emmanuelle
Génin, Emmanuelle
Fichou, Yann
Le Gac, Gerald
Férec, Claude
Liao, Zhuan
Chen, Jian-Min
Source :
Human Genomics; December 2024, Vol. 18 Issue: 1
Publication Year :
2024

Abstract

Background: Single-nucleotide variants (SNVs) within gene coding sequences can significantly impact pre-mRNA splicing, bearing profound implications for pathogenic mechanisms and precision medicine. In this study, we aim to harness the well-established full-length gene splicing assay (FLGSA) in conjunction with SpliceAI to prospectively interpret the splicing effects of all potential coding SNVs within the four-exon SPINK1gene, a gene associated with chronic pancreatitis. Results: Our study began with a retrospective analysis of 27 SPINK1coding SNVs previously assessed using FLGSA, proceeded with a prospective analysis of 35 new FLGSA-tested SPINK1coding SNVs, followed by data extrapolation, and ended with further validation. In total, we analyzed 67 SPINK1coding SNVs, which account for 9.3% of the 720 possible coding SNVs. Among these 67 FLGSA-analyzed SNVs, 12 were found to impact splicing. Through detailed comparison of FLGSA results and SpliceAI predictions, we inferred that the remaining 653 untested coding SNVs in the SPINK1gene are unlikely to significantly affect splicing. Of the 12 splice-altering events, nine produced both normally spliced and aberrantly spliced transcripts, while the remaining three only generated aberrantly spliced transcripts. These splice-impacting SNVs were found solely in exons 1 and 2, notably at the first and/or last coding nucleotides of these exons. Among the 12 splice-altering events, 11 were missense variants (2.17% of 506 potential missense variants), and one was synonymous (0.61% of 164 potential synonymous variants). Notably, adjusting the SpliceAI cut-off to 0.30 instead of the conventional 0.20 would improve specificity without reducing sensitivity. Conclusions: By integrating FLGSA with SpliceAI, we have determined that less than 2% (1.67%) of all possible coding SNVs in SPINK1significantly influence splicing outcomes. Our findings emphasize the critical importance of conducting splicing analysis within the broader genomic sequence context of the study gene and highlight the inherent uncertainties associated with intermediate SpliceAI scores (0.20 to 0.80). This study contributes to the field by being the first to prospectively interpret all potential coding SNVs in a disease-associated gene with a high degree of accuracy, representing a meaningful attempt at shifting from retrospective to prospective variant analysis in the era of exome and genome sequencing.

Details

Language :
English
ISSN :
14739542 and 14797364
Volume :
18
Issue :
1
Database :
Supplemental Index
Journal :
Human Genomics
Publication Type :
Periodical
Accession number :
ejs65601726
Full Text :
https://doi.org/10.1186/s40246-024-00586-9