five

Stramenopile dataset for positive selection

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4725039
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains the data (sequences, annotations, and intermediary files) collected and produced during the preparation of the following preprint: https://doi.org/10.1101/2021.01.12.426341. Description of files: The samples.zip file contains for each taxa: The functional annotations from Interproscan with extension ".tsv" in a TSV format. The genome annotations with extension ".gff" in a gff3 format. The protein sequences with extension ".faa" in a fasta format. The corresponding coding DNA sequences with extension ".fna" in a fasta format. The all_ann.csv file contains all annotations from the tested genes with added information for positive selection and orthology status in a CSV format. The go_mapping.csv file contains the mapping of GO terms to protein accessions of the dataset in a CSV format. The protein_families.poff.tsv contains the proteinortho output file corresponding to the classification of the genes in the dataset into ortholog groups in a TSV format. The families.zip file contains the intermediary files for each of the selected orthogroups: Tree files in newick format in the folder "trees". Protein sequences in the folder  "faas". Coding DNA sequences in the folder "fnas". Log outputs from the FUBAR analysis in the folder "logs". Codon alignments in the folder "codon_alns" The families_fubar.zip file contains the same files as before for the subset of orthogroups with a positive result in the FUBAR analysis plus log output from the aBSREL analysis in the log folder.
创建时间:
2021-04-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作