XMAn_Homo_Sapiens_Mutated_Peptide_Database_cancer_fasta
收藏DataCite Commons2024-03-24 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/XMAn_A_Homo_sapiens_Mutated_Peptide_Database/2825557/2
下载链接
链接失效反馈官方服务:
资源简介:
To enable the identification of mutated peptide sequences in complex
biological samples, in this work, a cancer protein database with mutation information collected from several
public resources such as COSMIC, IARC P53, OMIM and UniProtKB, was developed. In-house
developed Perl-scripts were used to search and process the data, and to
translate each gene-level mutation into a mutated peptide sequence. The
cancer mutation database comprises a total of 872,125 peptide entries from 25,642 protein IDs. A description
line for each entry provides the parent protein ID and name, the cDNA- and
protein-level mutation site and type, the originating database, and the cancer tissue type and corresponding hits. The database is FASTA
formatted to enable data retrieval by commonly used tandem MS search engines. <br>
为实现复杂生物样本中突变肽序列的鉴定,本研究构建了一款携带有突变信息的癌症蛋白质数据库,其数据源自COSMIC、IARC P53、OMIM及UniProtKB等多个公共资源。本研究采用自研Perl脚本(Perl script)完成数据检索与处理工作,并将每个基因层面的突变转换为对应的突变肽序列。该癌症突变数据库总计包含来自25642个蛋白质ID的872125条肽条目。每条条目均附带描述行,内容涵盖父蛋白质ID与名称、cDNA及蛋白质层面的突变位点与突变类型、来源数据库,以及癌症组织类型与对应匹配结果。该数据库采用FASTA格式(FASTA),可通过主流串联质谱(tandem MS)搜索引擎实现数据检索。
提供机构:
figshare
创建时间:
2016-08-14



