3. Comparative population transcriptomics in krill: orthogroups (FASTA, TSV files)
收藏DataCite Commons2025-01-15 更新2024-07-13 收录
下载链接:
https://figshare.scilifelab.se/articles/dataset/3_Comparative_population_transcriptomics_in_krill_orthogroups_FASTA_TSV_files_/24039510
下载链接
链接失效反馈官方服务:
资源简介:
This item contains a gzipped archive with ~13,000 orthogroups used to study molecular evolution in this project.<b>Archive:</b>krill.orthogroups.tar.gz<b>Contents of archive (FILE,SIZE,SPECIES,SAMPLES,SNPs):</b><b>krill.proteinortho.tsv</b> - the primary output table from Proteinortho. Describes which protein sequences from which species belong to the same orthogroup. Format according to the standard output of the program.<b>krill.proteinortho.tsv.seqs.csv</b> - a processed table that also contains the actual sequences line by line (see below).the <b>alignments</b> directory, which contains all OGs in unaligned and aligned files in FASTA format (see below).<b>Format of the krill.proteinortho.tsv.seqs.csv table</b>The fields are:NR = orthogroup numberORTHO_GROUP = orthogroup IDN_SPECIES = the number of speciesN_GENES = the number of genes/sequences in this orthogroupN_MATCHING[o] = number of sequences matching outgroup species for this orthogroupN_NON_MATCHING = number of sequences matching ingroup species for this orthogroupHEADER = the name of this particular sequenceSEQ = the protein sequence<b>Contents of the alignments directory</b>Each orthogroup is represented by up to four FASTA files:OG*.cds.ginsi.fasta.orig = the original, unaligned and unfiltered sequencesOG*.cds.ginsi.fasta = the aligned and filtered sequencesOG*.cds.ginsi.fasta.without_cold_euphausia.fasta = the aligned and filtered sequences after removing cold-associated <i>Euphausia</i> speciesOG*.cds.ginsi.fasta.without_cold_thysanoessa.fasta = the aligned and filtered sequences after removing cold-associated <i>Thysanoessa</i> species
提供机构:
Uppsala University
创建时间:
2023-10-19



