five

FLYNC: A Machine Learning-Driven Framework for Discovering Long Non-Coding RNAs in Drosophila melanogaster

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP547743
下载链接
链接失效反馈
官方服务:
资源简介:
Non-coding RNAs have increasingly recognized roles in critical molecular mechanisms of disease. However, the non-coding genome of Drosophila melanogaster, one of the most powerful disease model organisms, has been understudied. Here, we present FLYNC – FLY Non-Coding RNA discovery and classification – a novel machine learning model that predicts the probability of a newly identified RNA transcript being a long non-coding RNA (lncRNA). Integrated into an end-to-end bioinformatics pipeline capable of processing single-cell or bulk RNA sequencing data, FLYNC outputs potential new non-coding RNA genes. FLYNC leverages large-scale genomic and transcriptomic datasets to identify patterns and features that distinguish non-coding genes from protein-coding genes, thereby facilitating lncRNA prediction. We demonstrate the application of FLYNC to publicly available Drosophila adult head bulk transcriptome and single-cell transcriptomic data from Drosophila neural stem cell lineages and identify several novel tissue- and cell-specific lncRNAs. We have further experimentally validated the existence of a set of FLYNC predicted lncRNAs by RT-PCR and RNA PolII binding. Overall, our findings demonstrate that FLYNC serves as a robust tool for identifying lncRNAs in Drosophila melanogaster, transcending current limitations in ncRNA identification and harnessing the potential of machine learning. Overall design: We performed targeted DamID experiments to determine RNA Polymerase II-subunit DNA association in neural stem cells in the Drosophila melanogaster developing brain.
创建时间:
2026-01-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作