Brown-Algae.dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14364745
下载链接
链接失效反馈官方服务:
资源简介:
The dataset folder contains essential files required for the execution and testing of the iCulture pipeline. This folder includes:
1. Reference Databases:
• Pfam-A.hmm and associated index files (*.h3f, *.h3i, *.h3m, *.h3p):
• These files constitute the Pfam database used for HMMER annotation of protein sequences.
• The database enables the identification of protein domains and families within query sequences.
2. Input FASTA Files:
• brown-algae_dataset.fa:
• This FASTA file is generated from the brown algae dataset downloaded from The Phaeoexplorer Project.
• It contains protein sequences from brown algae, formatted for compatibility with the pipeline.
• This file is used as an input for clustering and annotation during the pipeline execution.
3. Sample Input Files:
• These files are provided to facilitate testing and ensure reproducibility of the pipeline results.
Purpose:
This directory serves as a centralized location for storing datasets and databases necessary for running the iCulture pipeline, particularly in reproducibility-focused workflows.
Note: If you wish to use the pipeline with custom datasets, replace the example files in this folder with your own, following the required format.
创建时间:
2024-12-29



