five

Supporting data for "Streamlining remote nanopore data access with slow5curl"

收藏
DataCite Commons2025-05-26 更新2024-07-13 收录
下载链接:
http://gigadb.org/dataset/102514
下载链接
链接失效反馈
官方服务:
资源简介:
As adoption of nanopore sequencing technology continues to advance, the need to maintain large volumes of raw current signal data for reanalysis with updated algorithms is a growing challenge. Here we introduce slow5curl, a software package designed to streamline nanopore data sharing, accessibility and reanalysis. <i>Slow5curl</i> allows a user to fetch a specified read or group of reads from a raw nanopore dataset stored on a remote server, such as a public data repository, without downloading the entire file. <i>Slow5curl</i> uses an index to quickly fetch specific reads from a large dataset in SLOW5/BLOW5 format and highly parallelised data access requests to maximise download speeds. Using all public nanopore data from the Human Pangenome Reference Consortium (&gt;22 TB), we demonstrate how <i>Slow5curl</i> can be used to quickly fetch and reanalyse raw signal reads corresponding to a set of target genes from each individual in large cohort dataset (<i>n</i> = 91), minimising the time, egress costs, and local storage requirements for their reanalysis. We provide <i>Slow5curl</i> as a free, open-source package that will reduce frictions in data sharing for the nanopore community.
提供机构:
GigaScience Database
创建时间:
2024-03-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作