five

Reads and pairwise distances from 10 samples of diatoms in Geneva lake

收藏
DataCite Commons2025-05-16 更新2024-07-13 收录
下载链接:
https://entrepot.recherche.data.gouv.fr/citation?persistentId=doi:10.57745/NKTRHO
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains 55 hdf5 files related to 10 samples (one per month) of benthic diatoms collected in Geneva lake at monthly interval in the same location (close to UMR Carrtel on the shore of the lake). For each sample, DNA has been extracted, a fragment amplified (a marker of 312 bp in rbcL fragment), and sequenced. Next, all pairwise distances between reads have been computed (from Smith-Waterman local alignment score), within and between samples. This has led to 55 hdf5 files organized each as follows as far as h5 datasets are concerned: sequence identifiers (seqid): one h5 dataset if within a sample, two if between samples sequences (word): one h5 dataset if within a sample, two if between samples pairwise distances between sequences (h5 dataset distances). Pairwise distances have been computed through DARI project i2015037360 (8 millions of hours, 2016, give, to AF) at IDRIS on Turing and Ada machines. As there are 10 samples, there are 10 files for within sample distances, and 45 files (n(n-1)/2 with n=10) for between samples istances. There are 55 hdf5 samples, labeled L1 to L10 within each sample, and Lx_Ly beween samples, with x y . (Note that the files are ordered according to lexicographic order of their names).
提供机构:
Recherche Data Gouv
创建时间:
2023-02-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作