five

Data from: A foundational benchmarking workflow for transposable element discovery pipelines

收藏
DataCite Commons2026-04-16 更新2026-04-25 收录
下载链接:
https://idn.duke.edu/ark:/87924/r4m61sj94
下载链接
链接失效反馈
官方服务:
资源简介:
Raw simulated sequence data (FASTAs TE insert TXT files and log files) from GARLIC (<a href="URL">https://github.com/caballero/Garlic</a>) simulations of five genomic models (A. thaliana D. melanogaster H. sapiens M. lucifugus & S. cerevisiae).\ Downstream GFF and CSV files for the five simulated FASTA sequences produced with the associated publication's Snakemake workflow.\ TE annotation GFF produced by Earl Grey (<a href="URL">https://github.com/TobyBaril/EarlGrey</a>) EDTA (<a href="URL">https://github.com/oushujun/EDTA</a>) and RepeatModeler2 (<a href="URL">https://github.com/Dfam-consortium/RepeatModeler</a>). <br></br>TE library FASTA sequences produced by RepeatModeler2.\ Zipped archives of input test data for use with the associated Snakemake workflow (Manuscript DOI not yet available final GitHub link in progress).
提供机构:
Duke Research Data Repository
创建时间:
2025-12-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作