five

Precise Discovery of Novel N‑Terminal Proteoforms beyond the Limitations of Proteogenomics and De Novo Sequencing

收藏
Figshare2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Precise_Discovery_of_Novel_N_Terminal_Proteoforms_beyond_the_Limitations_of_Proteogenomics_and_i_De_Novo_i_Sequencing/29662211
下载链接
链接失效反馈
官方服务:
资源简介:
Alternative splicing-mediated protein N-terminal sequence variation is closely associated with diseases, but its identification by mass spectrometry faces technical bottlenecks. Traditional proteogenomic methods cannot identify novel N-terminal proteins undetected in transcriptome data, while de novo sequencing has limitations in accuracy and traceability. To address this, we developed the first dedicated algorithm, NovelNSeq, which is specifically designed to parse signature peptides (novel N-terminal extension peptides) of novel N-terminal proteins from mass spectrometry data without relying on transcriptome data or de novo sequencing. NovelNSeq fully exploits peptide encoding rules, demonstrating significantly higher accuracy than de novo sequencing algorithms such as PEAKS, pNovo3, SpliceNovo, Casanovo, and InstaNovo, and enables tracing back the peptide encoding mechanisms. Using NovelNSeq, we identified and validated novel N-terminal proteoforms from human genes CALM2, CAPNS1, and CPNE7 in mass spectrometry data where large-scale proteogenomics failed to detect them, which establishes NovelNSeq as an essential complement to conventional approaches. Furthermore, we revealed that a recently reported AAG-initiated novel N-terminus in human ATP9A is actually generated through translational frameshifting from the canonical ATG start codon, highlighting the need for rigorous validation of noncanonical start codon annotations in novel N-terminal proteoforms.
二维码
社区交流群
二维码
科研交流群
商业服务