3. Comparative population transcriptomics in krill: orthogroups (FASTA, TSV files)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/3_Comparative_population_transcriptomics_in_krill_orthogroups_FASTA_TSV_files_/24039510
下载链接
链接失效反馈官方服务:
资源简介:
This item contains a gzipped archive with ~13,000 orthogroups used to study molecular evolution in this project.
Archive:
krill.orthogroups.tar.gz
Contents of archive (FILE,SIZE,SPECIES,SAMPLES,SNPs):
krill.proteinortho.tsv - the primary output table from Proteinortho. Describes which protein sequences from which species belong to the same orthogroup. Format according to the standard output of the program.krill.proteinortho.tsv.seqs.csv - a processed table that also contains the actual sequences line by line (see below).the alignments directory, which contains all OGs in unaligned and aligned files in FASTA format (see below).Format of the krill.proteinortho.tsv.seqs.csv table
The fields are:
NR = orthogroup numberORTHO_GROUP = orthogroup IDN_SPECIES = the number of speciesN_GENES = the number of genes/sequences in this orthogroupN_MATCHING[o] = number of sequences matching outgroup species for this orthogroupN_NON_MATCHING = number of sequences matching ingroup species for this orthogroupHEADER = the name of this particular sequenceSEQ = the protein sequenceContents of the alignments directory
Each orthogroup is represented by up to four FASTA files:
OG*.cds.ginsi.fasta.orig = the original, unaligned and unfiltered sequencesOG*.cds.ginsi.fasta = the aligned and filtered sequencesOG*.cds.ginsi.fasta.without_cold_euphausia.fasta = the aligned and filtered sequences after removing cold-associated Euphausia speciesOG*.cds.ginsi.fasta.without_cold_thysanoessa.fasta = the aligned and filtered sequences after removing cold-associated Thysanoessa species
创建时间:
2023-10-19



