five

Scalable and cost-efficient custom gene library assembly from oligopools

收藏
DataCite Commons2026-04-09 更新2026-04-25 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.bk3j9kdrh
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains all next-generation sequencing (NGS) data generated to evaluate the OMEGA (Oligo-based Multiplexed Efficient Gene Assembly) platform for multiplexed gene library construction. It includes PacBio HiFi long-read sequencing (BAM with index and metadata), Illumina paired-end sequencing (FASTQ), and Oxford Nanopore long-read sequencing (FASTQ) across assembly validation (Rubisco and Cas9 libraries), amplicon-based quality control (PCR and JSCAN), and functional screening of GFP variant libraries. Data are organized by sequencing platform and experiment type, including replicate-level presort libraries and fluorescence-activated cell sorting (FACS) populations spanning multiple fluorescence bins and negative controls. These files enable reconstruction of full-length variants, quantification of assembly accuracy and uniformity, and analysis of sequence–function relationships. All data are standard DNA sequencing formats compatible with common open-source tools and are suitable for benchmarking gene assembly methods, developing analysis pipelines, and training machine learning models. All sequences are synthetic or non-pathogenic research constructs; no human or clinical data are included, and there are no ethical or legal restrictions on reuse.
提供机构:
Dryad
创建时间:
2026-04-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作