five

Supplemental Material for Hamlin et al., 2019

收藏
DataCite Commons2020-07-14 更新2025-04-15 收录
下载链接:
https://gsajournals.figshare.com/articles/Supplemental_Material_for_Hamlin_et_al_2019/9835208/1
下载链接
链接失效反馈
官方服务:
资源简介:
Data associated with Hamlin et al. Raw PacBio reads for the three Candida strains are available at the NCBI short-read archive (SRA) under BioProject PRJNA533645. The phased diploid assemblies for the three oak strains are associated with the overall BioProject PRJNA543321. Individual GenBank accession numbers for primary contigs and haplotigs, respectively, for each strain are as follows: NCYC 4144: GCA_005890765.1 and GCA_005890695.1; NCYC 4145: GCA_005890775.1 and GCA_005890685.1; NCYC 4146: GCA_005890745.1 and GCA_005890705.1. Coordinates of haplotigs relative to their respective primary assembly are available in Files S10, S11, and S12. Annotations of positions of centromeres, telomeric repeats, confirmed LOH regions, assembly gaps, uncertain regions with unexpectedly low heterozygosity, and regions that were not polished by FALCON-Unzip are available for primary assemblies in Files S1, S2, and S3. Annotations of unpolished regions for alternative haplotig assemblies are provided in Files S4, S5 and S6. A full description of software version numbers for phased assembly is provided in File S7 and the configuration files used to run FALCON and FALCON-Unzip are provided in Files S8 and S9.<br><br>

Hamlin等人相关数据。三种念珠菌(Candida)菌株的原始PacBio测序读段可通过NCBI短读档(SRA)获取,所属生物项目(BioProject)编号为PRJNA533645。三种橡树菌株的分相二倍体组装(phased diploid assemblies)数据与总生物项目PRJNA543321相关联。各菌株的主contig和单倍型contig(haplotig)的GenBank登录号(GenBank accession number)分别如下:NCYC 4144:GCA_005890765.1 和 GCA_005890695.1;NCYC 4145:GCA_005890775.1 和 GCA_005890685.1;NCYC 4146:GCA_005890745.1 和 GCA_005890705.1。单倍型contig相对于各自主组装的坐标信息可在文件S10、S11和S12中获取。主组装的注释信息包括着丝粒(centromere)位置、端粒重复序列(telomeric repeat)、已确认的杂合性缺失(LOH)区域、组装间隙、杂合度异常偏低的不确定区域以及未经过FALCON-Unzip抛光的区域,这些信息可在文件S1、S2和S3中获取。替代单倍型contig组装中未抛光区域的注释信息可在文件S4、S5和S6中获取。分相组装所用软件的版本号完整说明见文件S7,运行FALCON和FALCON-Unzip的配置文件见文件S8和S9。
提供机构:
GSA Journals
创建时间:
2019-09-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作