five

Supporting data for "RNAProt: An efficient and feature-rich RNA binding protein binding site predictor"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100916
下载链接
链接失效反馈
官方服务:
资源简介:
CLIP-seq is the state-of-the-art technique to experimentally determine transcriptome-wide binding sites of RNA-binding proteins (RBPs). However, it relies on gene expression which can be highly variable between conditions, and thus cannot provide a complete picture of the RBP binding landscape. This creates a demand for computational methods to predict missing binding sites. Although there exist various methods using traditional machine learning and lately also deep learning, we encountered several problems: many of these are not well documented or maintained, making them difficult to install and use, or not even available. In addition, there can be efficiency issues, as well as little flexibility regarding options or supported features.<br>Here we present RNAProt, an efficient and feature-rich computational RBP binding site prediction framework based on recurrent neural networks (RNNs). We compare RNAProt with one traditional machine learning approach and two deep learning methods, demonstrating its state-of-the-art predictive performance, while at the same time offering better runtime efficiency. We further show that its implemented visualizations capture known binding preferences and thus can help to understand what is learned. Since RNAProt supports various additional features (including user-defined ones which no other tool offers), we also present their influence on benchmark set performance. Finally, we show the benefits of incorporating additional features, specifically structure information, when learning the binding sites of a hairpin loop binding RBP.<br>RNAProt provides a complete framework for RBP binding site predictions, from dataset generation over model training to the evaluation of binding preferences and prediction. It offers state-of-the-art predictive performance as well as superior runtime efficiency, while at the same supporting more features and input types than any other tool available so far. RNAProt is easy to install and use, comes with comprehensive documentation, and is accompanied by informative statistics and visualizations. All this makes RNAProt a valuable tool to apply in future RBP binding site research.
提供机构:
GigaScience Database
创建时间:
2021-07-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作