five

Mastacembelus armatus (zig-zag eel) genome assembly, fMasArm1.2, alternate haplotype. fMasArm1.2 alternate haplotype

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJEB31553
下载链接
链接失效反馈
官方服务:
资源简介:
This project provides the genome assembly of Mastacembelus armatus, common name zig-zag eel, based on a sample provided by Byrappa Venkatesh. The assembly fMasArm1.2 is based on ~52x PacBio Sequel data, ~99x coverage Illumina HiSeqX data from a 10X Genomics Chromium library generated at the Wellcome Sanger Institute, as well as two-enzyme BioNano Saphyr data generated at the Rockefeller University Vertebrate Genome Laboratory and ~245x coverage HiSeqX data from a Hi-C library prepared by Arima Genomics. An initial PacBio assembly was made using Falcon-unzip and the primary contigs extended by merging with a miniasm assembly. The contigs were then scaffolded using the 10X data with scaff10x, and using synteny with Lates calcarifer (Asian seabass). After using the PacBio data to gap fill with PBJelly and polish with Arrow, the assembly was polished again using the 10X Illumina data and freebayes. Finally, the assembly was further manually curated using gEVAL to correct mis-joins and improve concordance with the BioNano and Arima Hi-C data. This was the fMasArm1.1 assembly. A updated fMasArm1.2 assembly has been further curated using the Hi-C data to separate out haplotigs, identify further breaks and joins and produce chromosomal-level scaffolds. Chromosomes have been named by synteny to medaka. The assembly is provided by the Wellcome Sanger Institute and Cambridge University team (https://www.sanger.ac.uk/science/data/vertebrate-genomes-sequencing) of the Vertebrate Genomes Project (http://vertebrategenomesproject.org). The data under this project are made available subject to the Genome10K data use policies (https://genome10k.soe.ucsc.edu/data-use-policies).
创建时间:
2019-04-01
二维码
社区交流群
二维码
科研交流群
商业服务