ORFs and annotations of the single-end transcriptome derived from the forward unpaired reads of Savalia savaglia RNAseq data
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/3rtbr7c9s8
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains Open Reading Frames (ORFs) and annotations of the original single-end transcriptome outputs, obtained from the forward broken paired-end RNAseq of the false black coral Savalia savaglia. The dataset includes the following files:
From TransDecoder analyses:
• Assembly_Ss_SE.Trinity.fasta.transdecoder.cds:
Nucleotide sequences for coding regions of the final candidate ORFs.
• Assembly_Ss_SE.Trinity.fasta.transdecoder.gff3:
Positions within the target transcripts of the final selected ORFs.
• Assembly_Ss_SE.Trinity.fasta.transdecoder.pep:
Peptide sequences for the final candidate ORFs, with shorter candidates within longer ORFs removed.
• Assembly_Ss_SE.Trinity.fasta.transdecoder.bed:
BED-formatted file describing ORF positions, suitable for viewing using GenomeView or IGV.
• blastp.outfmt6.w_pct_hit_length:
File providing percentages of hit lengths from BLASTp results, including top hit's length and percent of the length covered in the alignment.
• pfam.domtblout:
PFAM domain annotations for the predicted proteins.
From Trinotate analyses:
• myTrinotate_SE_Ss.tsv:
Comprehensive annotation file with results from Trinotate, including protein domain identification and other annotations.
• Trinotate_SE_Ss_report.cXp_summary.html:
HTML report summarizing the annotation results from Trinotate, providing an overview of the functional annotations and transcript features.
本数据集包含取自假黑珊瑚(Savalia savaglia)正向断裂双端RNA测序所得原始单端转录组输出的开放阅读框(Open Reading Frames, ORFs)及其注释信息。数据集包含如下文件:
TransDecoder分析产物:
• Assembly_Ss_SE.Trinity.fasta.transdecoder.cds:最终候选开放阅读框编码区域的核苷酸序列。
• Assembly_Ss_SE.Trinity.fasta.transdecoder.gff3:最终筛选出的开放阅读框在目标转录本上的位置信息。
• Assembly_Ss_SE.Trinity.fasta.transdecoder.pep:最终候选开放阅读框的肽序列,已移除长开放阅读框内部的较短候选序列。
• Assembly_Ss_SE.Trinity.fasta.transdecoder.bed:采用BED格式的开放阅读框位置描述文件,可通过GenomeView或IGV进行可视化查看。
• blastp.outfmt6.w_pct_hit_length:包含BLASTp比对结果比对长度占比的文件,涵盖最优比对的序列长度及比对覆盖的长度百分比。
• pfam.domtblout:预测蛋白质的PFAM结构域注释文件。
Trinotate分析产物:
• myTrinotate_SE_Ss.tsv:包含Trinotate分析结果的综合注释文件,涵盖蛋白质结构域识别及其他注释信息。
• Trinotate_SE_Ss_report.cXp_summary.html:Trinotate注释结果的HTML汇总报告,可直观展示功能注释及转录本特征概况。
创建时间:
2024-10-10



