five

Database of conformational collections from the PDB clustered at 30% sequence identity and 50% coverage (CIF format)

收藏
DataCite Commons2024-07-05 更新2024-08-19 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Database_of_conformational_collections_from_the_PDB_clustered_at_30_sequence_identity_and_50_coverage_CIF_format_/25112744
下载链接
链接失效反馈
官方服务:
资源简介:
This archive contains the structural output of the database generated by the DANCE application at the level of the entire PDB with 30% identity and 50% coverage. Files related to an ensemble are prefixed with ID1_ID2_, where ID1 is the first member in alphabetical order, and ID2 is the reference for the structural alignment. Each CIF file contains multiple models. The beginning of the file includes the length of the alignment and a list of the model IDs present in the CIF file. We use _atom_site.auth_seq_id to describe the position of each residue in the considered model within the Multiple Sequence Alignment (MSA).

本归档文件包含DANCE应用程序生成的数据库结构输出,该数据库以完整蛋白质数据库(Protein Data Bank, PDB)为构建范围,序列同一性为30%、覆盖率为50%。 与结构系综相关的文件均以ID1_ID2_作为前缀,其中ID1为按字母顺序排序的首个成员,ID2为结构比对的参考结构。 每个晶体学信息文件(Crystallographic Information File, CIF)均包含多个模型。文件首部包含比对总长度以及该CIF文件中所有模型ID的列表。我们通过_atom_site.auth_seq_id字段描述目标模型内各残基在多序列比对(Multiple Sequence Alignment, MSA)中的对应位置。
提供机构:
figshare
创建时间:
2024-07-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作