five

Supporting data for "BrumiR: A toolkit for de novo discovery of microRNAs from sRNA-seq data."

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/102250
下载链接
链接失效反馈
官方服务:
资源简介:
MicroRNAs (miRNAs) are small non-coding RNAs that are key players in the regulation of gene expression. In the last decade, with the increasing accessibility of high-throughput sequencing technologies, different methods have been developed to identify miRNAs, most of which rely on pre-existing reference genomes. However, when a reference genome is absent or is not of high quality, such identification becomes more difficult. In this context, we developed BrumiR, an algorithm that is able to discover miRNAs directly and exclusively from sRNA-seq data. We benchmarked BrumiR with datasets encompassing animal and plant species using real and simulated sRNA-seq experiments. The results demonstrate that BrumiR reaches the highest recall for miRNA discovery, while at the same time being much faster and more efficient than the state-of-the-art tools evaluated. The latter allows BrumiR to analyze a large number of sRNA-seq experiments, from plants or animal species. Moreover, BrumiR detects additional information regarding other expressed sequences (sRNAs, isomiRs, etc.), thus maximizing the biological insight gained from sRNA-seq experiments. Additionally, when a reference genome is available, BrumiR provides a new mapping tool (BrumiR2ref) that performs an <i>a posteriori</i> exhaustive search to identify the precursor sequences.<br>Finally, we also provide a machine learning classifier based on a Random Forest model that evaluates the sequence-derived features to further refine the prediction obtained from BrumiR-core. The code of BrumiR and all the algorithms that compose the BrumiR-toolkit are freely available in GitHub.
提供机构:
GigaScience Database
创建时间:
2022-08-30
二维码
社区交流群
二维码
科研交流群
商业服务