Short Tailed Shearwater DREAM gene sequencing fastq files
收藏Research Data Australia2024-12-14 收录
下载链接:
https://researchdata.edu.au/short-tailed-shearwater-fastq-files/1356120
下载链接
链接失效反馈官方服务:
资源简介:
This data set includes unprocessed sample .fastq files from two separate Illumina NextSeq runs, labelled as 'Run_1' and 'Run_2', respectively.Sample names: e.g. STS15059, 'STS' is the abbreviation of Short-tailed shearwater. The first two digits of the numeric refer to the year of collection e.g. '15' = 2015. Finally, the following number refers to the sequential unique ID for that year, e.g. '059' is the fifty-ninth sample for the years' collection.Leg bands are also recorded and are generally a 5-digit number and are unique to the individual bird. Longitudinal samples can be identified using these band IDs. E.g. in Run_2, an individual with the band number: 52196, was collected in 2015 as 'STS15065' and again in 2017 as 'STS17044'.Run_1: N = 35 individual samples are split across 4 lanes e.g. 'STS16020_S35_L001(/L002/L003/L004)_R1_001/fastq' and need to be merged before conversion to .fasta format and downstream analysis.Run_2: N = 36 individual samples were provided as a single merged file from the service provider, e.g. 'STS15059_S34_R1_001.fastq'.Sample_info: This excel spreadsheet has information on samples as follows: 'Band': 5-digit number on leg band.'Sample': Sample number within run. 'UID': The unique ID for collection year e.g. STS15007.'Age': The known-age of the animal rounded to whole year.'Index (NebNext)': The NEB index used for NGS sample identification.'Note': Additional information on if a sample was a between or within run replicate or longitudinal replicate.Analysis of these data were published in: R. De Paoli-Iseppi et al. 2019. Age estimation in a long-lived seabird (Ardenna tenuirostris) using DNA methylation-based biomarkers, Molecular Ecology Resources
本数据集包含来自两次独立Illumina NextSeq测序的未处理样本FASTQ(.fastq)文件,分别标记为Run_1与Run_2。
样本命名示例为STS15059,其中“STS”为短尾鹱(Short-tailed shearwater)的缩写。样本编号的前两位数字代表采集年份,例如“15”即对应2015年;后续数字为该年度采集样本的连续唯一ID,例如“059”代表该年度的第59份采集样本。
数据集同时记录了样本个体的脚环信息:脚环通常为5位数字,且每只个体的脚环编号唯一,可通过脚环ID识别纵向采集的同个体样本。例如在Run_2中,脚环编号为52196的个体,于2015年以STS15065的样本名被采集,又于2017年以STS17044的样本名再次被采集。
Run_1包含35份独立样本,这些样本被分配至4个测序泳道,示例文件名格式为"STS16020_S35_L001(/L002/L003/L004)_R1_001.fastq",需先进行合并,再转换为FASTA(.fasta)格式以开展后续分析。
Run_2包含36份独立样本,均由测序服务方提供为单份合并文件,示例文件名格式为"STS15059_S34_R1_001.fastq"。
样本信息表:本Excel电子表格包含如下样本相关信息:
- 脚环(Band):样本个体的5位数字脚环编号
- 样本号(Sample):对应测序泳道内的样本编号
- 唯一采集ID(UID):对应采集年度的唯一样本ID,示例为STS15007
- 年龄(Age):经确认的动物实际年龄,已取整至整年
- 索引(NebNext):用于下一代测序(NGS)样本识别的NEB索引
- 备注(Note):补充说明样本是否为批间重复、批内重复或纵向重复样本
本数据集的相关分析成果已发表于:R. De Paoli-Iseppi 等,2019年。《基于DNA甲基化生物标志物的长寿命海鸟(Ardenna tenuirostris,短尾鹱)年龄估算》,《Molecular Ecology Resources(分子生态学资源)》
提供机构:
Australian Antarctic Division



