five

Optimization of miRNA-seq Data Pre-Processing. Homo sapiens

收藏
NIAID Data Ecosystem2026-03-08 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA278977
下载链接
链接失效反馈
官方服务:
资源简介:
Next-generation sequencing is currently the platform of choice for the discovery and quantification of miRNAs. Despite this, there is no clear consensus on how the data should be pre-processed prior to conducting downstream analyses. Often overlooked, data pre-processing is an essential step in data analysis: the presence of unreliable features and noise can affect the conclusions drawn from downstream analyses. Using a spike-in dilution study, we evaluated the effects of several general-purpose aligners (BWA, Bowtie, Bowtie 2 and Novoalign), and normalization methods (counts-per-million, total count scaling, upper quartile scaling, Trimmed Mean of M, DESeq, linear regression, cyclic loess and quantile) with respect to the final miRNA count data distribution, variance, bias and accuracy of differential expression analysis. Overall design: A spike-in dataset was created using a 12x12 cyclic Latin Square design. Twelve miRNAs from the Arabidopsis Thaliana genome that are not present in the human genome were selected as spike-in sequences. RNA oligonucleotides were synthesized with phosphorylated 5’ ends and added at 12 different concentrations (0, 0.1, 0.2, 0.8, 1.6, 6.4, 12.8, 51.2, 102.4, 409.6, 819.2, 3276.8 amol) to 1μg Universal Human Reference RNA, with each concentration appearing once in each row and column of the design matrix.
创建时间:
2015-03-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作