five

ORFs and annotations of the single-end transcriptome derived from the forward unpaired reads of Savalia savaglia RNAseq data

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/3rtbr7c9s8
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains Open Reading Frames (ORFs) and annotations of the original single-end transcriptome outputs, obtained from the forward broken paired-end RNAseq of the false black coral Savalia savaglia. The dataset includes the following files: From TransDecoder analyses: • Assembly_Ss_SE.Trinity.fasta.transdecoder.cds: Nucleotide sequences for coding regions of the final candidate ORFs. • Assembly_Ss_SE.Trinity.fasta.transdecoder.gff3: Positions within the target transcripts of the final selected ORFs. • Assembly_Ss_SE.Trinity.fasta.transdecoder.pep: Peptide sequences for the final candidate ORFs, with shorter candidates within longer ORFs removed. • Assembly_Ss_SE.Trinity.fasta.transdecoder.bed: BED-formatted file describing ORF positions, suitable for viewing using GenomeView or IGV. • blastp.outfmt6.w_pct_hit_length: File providing percentages of hit lengths from BLASTp results, including top hit's length and percent of the length covered in the alignment. • pfam.domtblout: PFAM domain annotations for the predicted proteins. From Trinotate analyses: • myTrinotate_SE_Ss.tsv: Comprehensive annotation file with results from Trinotate, including protein domain identification and other annotations. • Trinotate_SE_Ss_report.cXp_summary.html: HTML report summarizing the annotation results from Trinotate, providing an overview of the functional annotations and transcript features.

本数据集包含取自假黑珊瑚(Savalia savaglia)正向断裂双端RNA测序所得原始单端转录组输出的开放阅读框(Open Reading Frames, ORFs)及其注释信息。数据集包含如下文件: TransDecoder分析产物: • Assembly_Ss_SE.Trinity.fasta.transdecoder.cds:最终候选开放阅读框编码区域的核苷酸序列。 • Assembly_Ss_SE.Trinity.fasta.transdecoder.gff3:最终筛选出的开放阅读框在目标转录本上的位置信息。 • Assembly_Ss_SE.Trinity.fasta.transdecoder.pep:最终候选开放阅读框的肽序列,已移除长开放阅读框内部的较短候选序列。 • Assembly_Ss_SE.Trinity.fasta.transdecoder.bed:采用BED格式的开放阅读框位置描述文件,可通过GenomeView或IGV进行可视化查看。 • blastp.outfmt6.w_pct_hit_length:包含BLASTp比对结果比对长度占比的文件,涵盖最优比对的序列长度及比对覆盖的长度百分比。 • pfam.domtblout:预测蛋白质的PFAM结构域注释文件。 Trinotate分析产物: • myTrinotate_SE_Ss.tsv:包含Trinotate分析结果的综合注释文件,涵盖蛋白质结构域识别及其他注释信息。 • Trinotate_SE_Ss_report.cXp_summary.html:Trinotate注释结果的HTML汇总报告,可直观展示功能注释及转录本特征概况。
创建时间:
2024-10-10
二维码
社区交流群
二维码
科研交流群
商业服务