Fasting northern elephant seal pup blubber transcriptome
收藏NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/Fasting_northern_elephant_seal_pup_blubber_transcriptome/5746227
下载链接
链接失效反馈官方服务:
资源简介:
De novo Trinity transcriptome assembly from blubber (adipose tissue) collected from weaned northern elephant seal (Mirounga angustirostris, NES) pups during their post-weaning fast. Samples were collected from independent cohorts of 6 pups each, "early fasting" (1-2 weeks post weaning) and "late fasting" (6-8 weeks post weaning) and total RNA was isolated (RIN: 7.6-9.0). Twelve strand-specific libraries were prepared according to Illumina protocol with Ribo-zero depletion (human/rat/mouse), and sequenced (125 bp paired-end reads) on one lane of Illumina HiSeq 2000, producing an average of 41.5 million reads per sample. De novo assembly was conducted using all 12 samples and Trinity v2.1.1 with default settings, including adapter trimming ,but without abundance normalization. The assembly contains 1,830,330 transcripts (contigs) in 1,635,200 gene clusters with mean and median contig length of 835 bp and 425 bp, respectively. Mapping rate of reads to assembly was 87% and the assembly contains 81.2% complete, 36.3% duplicated, and 15.5% fragmented vertebrate BUSCOs, with only 3.2% missing from this dataset. The assembly was annotated by BLASTx using DIAMOND v0.8.31 with "very sensitive" option against the UniProt/SwissProt database (downloaded 8/20/17), with e-value threshold of 0.001. We identified 389,363 homologs in the transcriptome. The raw assembly file is called FastingNESPupTrinityAssembly.gz and the annotation file is called FastingNESPupAssemblyAnnotation.xlsx (also available as a tab delimited text file).
本数据集针对断奶北象海豹(*Mirounga angustirostris*,北象海豹,NES)幼崽断奶后禁食阶段采集的鲸脂(脂肪组织)样本,进行了从头Trinity转录组组装。样本分为两组独立队列,每组各6只幼崽,分别为「早期禁食组」(断奶后1~2周)与「晚期禁食组」(断奶后6~8周),并提取了总RNA(RNA完整性数值,RNA Integrity Number, RIN:7.6~9.0)。共制备12个链特异性文库,遵循Illumina建库流程,采用Ribo-zero(人/大鼠/小鼠)试剂盒进行核糖体RNA去除,随后使用Illumina HiSeq 2000的单个测序通道进行测序,获得125 bp双端reads,每个样本平均产出4150万条reads。采用全部12个样本的数据,以Trinity v2.1.1默认参数开展从头组装,该流程包含接头修剪步骤,但未执行丰度标准化。最终组装结果包含1,830,330条转录本(重叠群,contigs),归类于1,635,200个基因簇;转录本的平均长度与中位长度分别为835 bp与425 bp。将reads比对至组装序列的比对率为87%;该组装覆盖81.2%的完整脊椎动物单拷贝同源基因基准集(BUSCO, Benchmarking Universal Single-Copy Orthologs),36.3%为重复拷贝,15.5%为碎片化序列,本数据集仅缺失3.2%的脊椎动物BUSCO。本组装通过DIAMOND v0.8.31的BLASTx「极敏感」模式,针对2017年8月20日下载的UniProt/SwissProt数据库进行注释,设置e-value阈值为0.001,最终在该转录组中鉴定出389,363个同源转录本。原始组装文件命名为FastingNESPupTrinityAssembly.gz,注释文件命名为FastingNESPupAssemblyAnnotation.xlsx(亦可通过制表符分隔的文本文件格式获取)。
创建时间:
2018-01-20



