five

XMAn_Homo_Sapiens_Mutated_Peptide_Database_cancer_fasta

收藏
DataCite Commons2024-03-24 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/XMAn_A_Homo_sapiens_Mutated_Peptide_Database/2825557/2
下载链接
链接失效反馈
官方服务:
资源简介:
To enable the identification of mutated peptide sequences in complex biological samples, in this work, a cancer protein database with mutation information collected from several public resources such as COSMIC, IARC P53, OMIM and UniProtKB, was developed. In-house developed Perl-scripts were used to search and process the data, and to translate each gene-level mutation into a mutated peptide sequence. The cancer mutation database comprises a total of 872,125 peptide entries from 25,642 protein IDs. A description line for each entry provides the parent protein ID and name, the cDNA- and protein-level mutation site and type, the originating database, and the cancer tissue type and corresponding hits. The database is FASTA formatted to enable data retrieval by commonly used tandem MS search engines. <br>

为实现复杂生物样本中突变肽序列的鉴定,本研究构建了一款携带有突变信息的癌症蛋白质数据库,其数据源自COSMIC、IARC P53、OMIM及UniProtKB等多个公共资源。本研究采用自研Perl脚本(Perl script)完成数据检索与处理工作,并将每个基因层面的突变转换为对应的突变肽序列。该癌症突变数据库总计包含来自25642个蛋白质ID的872125条肽条目。每条条目均附带描述行,内容涵盖父蛋白质ID与名称、cDNA及蛋白质层面的突变位点与突变类型、来源数据库,以及癌症组织类型与对应匹配结果。该数据库采用FASTA格式(FASTA),可通过主流串联质谱(tandem MS)搜索引擎实现数据检索。
提供机构:
figshare
创建时间:
2016-08-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作