five

Supplementary Data: Finding Haplotypic Signatures in Proteins

收藏
DataCite Commons2023-09-27 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Supplementary_Data_Identifying_Protein_Haplotypes_by_Mass_Spectrometry/21408117/2
下载链接
链接失效反馈
官方服务:
资源简介:
Supplementary data related to the paper "Identifying Protein Haplotypes by Mass Spectrometry" <br> SD1: FASTA file including all target protein sequences (Ensembl reference proteome, protein haplotype sequences, contaminant sequences), excluding decoys SD2: FASTA file including all target and decoy sequences SD3: List of all peptide-spectrum matches (PSMs) with all related metadata and quality control measures SD4: List of substitutions identified, along with IDs of corresponding PSMs SD5: List of variant PSMs and peptide candidates suggested by PepQuery, along with confidence scores for each peptide candidate <br> To reproduce the post-processing steps, you can use the pipeline published at https://github.com/ProGenNo/IdentifyingHaplotypesByMS The repository also contains additional explanations of supplementary files contents.

本数据集为论文《基于质谱鉴定蛋白质单倍型》(Identifying Protein Haplotypes by Mass Spectrometry)的补充数据。 SD1:包含所有目标蛋白质序列的FASTA文件(涵盖Ensembl参考蛋白质组、蛋白质单倍型序列、污染序列),不含诱饵序列。 SD2:包含所有目标与诱饵序列的FASTA文件。 SD3:包含所有肽段-谱匹配(peptide-spectrum matches, PSMs)的列表,附带全部相关元数据与质控措施。 SD4:包含已鉴定的氨基酸替换及其对应PSM编号的列表。 SD5:包含变异型PSM列表与PepQuery推荐的肽段候选物列表,附带每个肽段候选物的置信度评分。 若需复现后续处理流程,可使用发布于https://github.com/ProGenNo/IdentifyingHaplotypesByMS 的分析管线。该仓库还包含补充文件内容的详细说明。
提供机构:
figshare
创建时间:
2023-03-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作