five

Data supporting: Chromosome-level genome of the transformable northern wattle, Acacia crassicarpa

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.573n5tbdr
下载链接
链接失效反馈
官方服务:
资源简介:
The genus Acacia is a large group of woody legumes containing an enormous amount of morphological diversity in leaf shape. This diversity is at least in part the result of an innovation in leaf development where many Acacia species are capable of developing leaves of both bifacial and unifacial morphology. While not unique in the plant kingdom, unifaciality is most commonly associated with monocots, and its developmental genetic mechanisms have yet to be explored beyond this group. Here we identify an accession of Acacia crassicarpa with high regeneration rates and isolate a clone for genome sequencing. We generate a chromosome-level assembly of this readily transformable clone and using comparative analyses confirm a whole genome duplication unique to Caesalpinoid legumes. This resource will be important for future work examining genome evolution in legumes and the unique developmental genetic mechanisms underlying unifacial morphogenesis in Acacia. Methods The genomes were assembled using hifiasm using Omni-C reads to generate haplotype-resolved assemblies. The larger of the two haplotype assemblies was then used for scaffolding using SALSA and the Omni-C reads. Masked assemblies were generated using RepeatMasker (v.4.0.7) using an A. crassicarpa de novo repeat library made with RepeatModeler (v.2.0.1). BRAKER3 was used to identify protein coding genes of the softmasked genome on scaffolds larger than 1 Mb. The dataset contains the following: Acra_USDA_v1.100k.fa                            Scaffolds larger than 100 kb from the SALSA scaffolding step. Acra_USDA_v1.1M.hrd.msk                     Hardmasked scaffolds larger than 1 Mb. Acra_USDA_v1.1M.sft.msk                      Softmasked scaffolds larger than 1 Mb. Acra_USDA_v1_cds.fa                             Coding sequences for Acra_USDA_v1.fsa assembly. Acra_USDA_v1.fsa                                   Scaffolds larger than 1 Mb. No masking. Acra_USDA_v1.gtf                                    Annotations for protein coding genes from the Acra_USDA_v1.fsa assembly. Acra_USDA_v1.hap1.fa                            Haplotype 1 before SALSA scaffolding (no size filtering). Acra_USDA_v1.hap2.fa                            Haplotype 2 before SALSA scaffolding (no size filtering). Acra_USDA_v1_proteins.fa                      Protein sequences for Acra_USDA_v1.fsa assembly. Assembly_pipeline_ACRA3RX.txt            Computational pipeline for assembling and annotating genome.
创建时间:
2023-11-27
二维码
社区交流群
二维码
科研交流群
商业服务