five

Structural data used to test a new geometric deep learning RNA scoring function emulating fully de novo modeling conditions

收藏
DataCite Commons2025-07-07 更新2025-04-16 收录
下载链接:
https://purl.stanford.edu/sq987cc0358
下载链接
链接失效反馈
官方服务:
资源简介:
This deposition contains two primary subdirectories, each compressed. (Depositions in the Stanford Digital Repository must have nominally flat directory structures, solved by tarballs.) The subdirectory "nonnative_secstruct" corresponds to the originally published version of the paper. The subdirectory "xtal_secstruct" corresponds to the corrected version of the paper. For 11 of the 16 RNA molecules in this benchmark (benchmark 2 of the paper), all of the candidate models in "nonnative_secstruct" were constructed using incorrect Watson-Crick base pairing, in contrast to what would typically happen in actual blind structure prediction (where modelers might try multiple secondary structures and pick those that generate realistic 3D models). In "xtal_sectruct", the candidate models were regenerated using correct Watson-Crick base pairing. Each of these primary subdirectories contains 16 .tar.gz compressed directories, each of which contains 5,000 structural models in PDB format of 16 distinct RNA molecules. These models were used to benchmark a new scoring function for RNA structure. The first order of business in choosing the cases studied in this benchmark was ensuring that they did not overlap with any RNA molecules studied previously in the ARES project. Thus, this benchmark does not represent a perfectly comprehensive account of RNA structure and was not meant to: it is just one of several ways you could select a set of structures complementary to those studied previously in the paper. It is *likely* that, therefore, these models will be of archival value alone; alternatively, these models, or the modeling problems they address, could make up part of a larger comprehensive benchmark. To that end, in each primary subdirectory we have also included inputs.tar.gz, which provides all the files necessary for rerunning these benchmark cases yourself. Models were generated with FARFAR2, code for RNA fragment assembly documented extensively at https://new.rosettacommons.org/docs/latest/FARFAR2. That documentation should demystify the executable commands found in each README_FARFAR file. If you are already familiar with the Das lab's repository for RNA benchmarking, you can use that system to set up replications of the xtal_secstruct simulations at https://github.com/DasLab/rna_benchmark, using the benchmark definition file ares_benchmark2.txt. If you are interested in training a new RNA scoring function or sampling method, consider the FARFAR2-Classics and FARFAR2-Puzzles benchmarks, available at https://purl.stanford.edu/wn364wz7925.
提供机构:
Stanford Digital Repository
创建时间:
2021-07-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作