five

Brown-Algae.dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14364745
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset folder contains essential files required for the execution and testing of the iCulture pipeline. This folder includes: 1. Reference Databases: • Pfam-A.hmm and associated index files (*.h3f, *.h3i, *.h3m, *.h3p): • These files constitute the Pfam database used for HMMER annotation of protein sequences. • The database enables the identification of protein domains and families within query sequences. 2. Input FASTA Files: • brown-algae_dataset.fa: • This FASTA file is generated from the brown algae dataset downloaded from The Phaeoexplorer Project. • It contains protein sequences from brown algae, formatted for compatibility with the pipeline. • This file is used as an input for clustering and annotation during the pipeline execution. 3. Sample Input Files: • These files are provided to facilitate testing and ensure reproducibility of the pipeline results.   Purpose: This directory serves as a centralized location for storing datasets and databases necessary for running the iCulture pipeline, particularly in reproducibility-focused workflows. Note: If you wish to use the pipeline with custom datasets, replace the example files in this folder with your own, following the required format.
创建时间:
2024-12-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作