five

GOPREDSIM Embeddings Adapted to FANTASIA v2

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/15095844
下载链接
链接失效反馈
官方服务:
资源简介:
This file, embeddings_fantasia_gopredsim_2022.dump, is a PostgreSQL custom-format dump containing a comprehensive relational database. It was generated from the massive TOPGO data provided by the goPredSim project and includes data for 309,278 protein entries along with 2,243,531 annotations, as verified against the original TOPGO file. In addition, the database incorporates a mechanism developed by our team for generating protein embeddings using the Rostlab/prot_t5_xl_uniref50 model into a relational database supporting vectors. To utilize this dump within Fantasia, simply configure the embeddings_url parameter with the URL of this Zenodo file. Fantasia’s containerized environment fully supports this process through the initialize command, which sets up the necessary database structure and loads the dump automatically. Restoration Instructions To load this dump into PostgreSQL, you must use pg_restore v16 or higher. Ensure database is clean and run the following command: pg_restore -U usuario -h localhost -p 5432 -d BioData ./embeddings_fantasia_gopredsim_2022.dump Caution: The restoration process requires pg_restore version 16 (or higher) to ensure full compatibility with all database structures contained in this dump.
创建时间:
2025-03-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作