The de novo assembly of a European wild boar genome revealed unique patterns of chromosomal structural variations and segmental duplications
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/records/10905692
下载链接
链接失效反馈官方服务:
资源简介:
https://onlinelibrary.wiley.com/doi/10.1111/age.13181
The de novo assembly of a European wild boar genome revealed unique patterns of chromosomal structural variations and segmental duplications
Jianhai Chen, Jie Zhong, Xuefei He, Xiaoyu Li, Pan Ni, Toni Safner, Nikica Šprem, Jianlin Han
The rapid progress of sequencing technology has greatly facilitated the de novo genome assembly of pig breeds. However, the assembly of the wild boar genome is still lacking, hampering our understanding of chromosomal and genomic evolution during domestication from wild boars into domestic pigs. Here, we sequenced and de novo assembled a European wild boar genome (ASM2165605v1) using the long-range information provided by 10× Linked-Reads sequencing. We achieved a high-quality assembly with contig N50 of 26.09 Mb. Additionally, 1.64% of the contigs (222) with lengths from 107.65 kb to 75.36 Mb covered 90.3% of the total genome size of ASM2165605v1 (~2.5 Gb). Mapping analysis revealed that the contigs can fill 24.73% (93/376) of the gaps present in the orthologous regions of the updated pig reference genome (Sscrofa11.1). We further improved the contigs into chromosome level with a reference-assistant scaffolding method. Using the ‘assembly-to-assembly’ approach, we identified intra-chromosomal large structural variations (SVs, length >1 kb) between ASM2165605v1 and Sscrofa11.1 assemblies. Interestingly, we found that the number of SV events on the X chromosome deviated significantly from the linear models fitting autosomes (R2 > 0.64, p < 0.001). Specifically, deletions and insertions were deficient on the X chromosome by 66.14 and 58.41% respectively, whereas duplications and inversions were excessive on the X chromosome by 71.96 and 107.61% respectively. We further used the large segmental duplications (SDs, >1 kb) events as a proxy to understand the large-scale inter-chromosomal evolution, by resolving parental-derived relationships for SD pairs. We revealed a significant excess of SD movements from the X chromosome to autosomes (p < 0.001), consistent with the expectation of meiotic sex chromosome inactivation. Enrichment analyses indicated that the genes within derived SD copies on autosomes were significantly related to biological processes involving nervous system, lipid biosynthesis and sperm motility (p < 0.01). Together, our analyses of the de novo assembly of ASM2165605v1 provides insight into the SVs between European wild boar and domestic pig, in addition to the ongoing process of meiotic sex chromosome inactivation in driving inter-chromosomal interaction between the sex chromosome and autosomes.
The work has been pulished here: https://onlinelibrary.wiley.com/doi/full/10.1111/age.13181
The current dataset include the genome annotation files.
For the whole-genomic assembly, please check NCBI:
https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_021656055.1/
GenBank
Genome size
2.5 Gb
Total ungapped length
2.4 Gb
Number of scaffolds
12,642
Scaffold N50
28.3 Mb
Scaffold L50
25
Number of contigs
41,323
Contig N50
157.9 kb
Contig L50
4,562
GC percent
42
Genome coverage
56.0x
Assembly level
Scaffold
Assembly methods
Sequencing technology 10xgenomics Assembly method Supernova v. 2.1.1
part_** are genome fasta for the GCA_021656055.1
You could use the following to combine and uncompress.
cat part_* > archive_combined.zip
unzip archive_combined.zip
创建时间:
2024-04-07



