learn2therm relational database
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/learn2therm_relational_database/23581932
下载链接
链接失效反馈官方服务:
资源简介:
DescriptionA dataset of prokaryotes, proteins, organism pairs, and protein homologous pairs across temperature, up to 102 celsius. The data is presented in the form of a duckdb relational database, and contains 24 million proteins belonging to 9.5k prokaryotes. The proteins have been pairs by homology, producing 70 millions pairs of mesophilic-thermophilic proteins. Please see the README contained within the zipped file for instructions on usage and access of the relational data file. See the code repository, in this Figshare project and the main paper (TODO) for details on how this data was created
Contents`learn2therm.ddb` - duckdb file containing the data
`environment.yaml` - minimal conda environment necessary to access the data
`README.md` - instructions on accessing the data, including some example queries. Also contains the schema of the tables in the relational database.
- `csvs/` , Colon-seperated dump files from the data tables
All material in this project:- Database pipeline source code: 10.6084/m9.figshare.23589390
- (here) Database: 10.6084/m9.figshare.23581932
- Classifier source code: 10.6084/m9.figshare.23589210
- Trained classifier: 10.6084/m9.figshare.23582325
ManuscriptTODO
创建时间:
2023-08-24



