Optimization of miRNA-seq Data Pre-Processing. Homo sapiens

NIAID Data Ecosystem2026-03-08 收录

下载链接：

https://www.ncbi.nlm.nih.gov/bioproject/PRJNA278977

下载链接

链接失效反馈

官方服务：

资源简介：

Next-generation sequencing is currently the platform of choice for the discovery and quantification of miRNAs. Despite this, there is no clear consensus on how the data should be pre-processed prior to conducting downstream analyses. Often overlooked, data pre-processing is an essential step in data analysis: the presence of unreliable features and noise can affect the conclusions drawn from downstream analyses. Using a spike-in dilution study, we evaluated the effects of several general-purpose aligners (BWA, Bowtie, Bowtie 2 and Novoalign), and normalization methods (counts-per-million, total count scaling, upper quartile scaling, Trimmed Mean of M, DESeq, linear regression, cyclic loess and quantile) with respect to the final miRNA count data distribution, variance, bias and accuracy of differential expression analysis. Overall design: A spike-in dataset was created using a 12x12 cyclic Latin Square design. Twelve miRNAs from the Arabidopsis Thaliana genome that are not present in the human genome were selected as spike-in sequences. RNA oligonucleotides were synthesized with phosphorylated 5’ ends and added at 12 different concentrations (0, 0.1, 0.2, 0.8, 1.6, 6.4, 12.8, 51.2, 102.4, 409.6, 819.2, 3276.8 amol) to 1μg Universal Human Reference RNA, with each concentration appearing once in each row and column of the design matrix.

创建时间：

2015-03-19

5,000+

优质数据集

54 个

任务类型

进入经典数据集