five

Data from: A foundational benchmarking workflow for transposable element discovery pipelines

收藏
DataCite Commons2026-04-16 更新2026-04-25 收录
下载链接:
https://research.repository.duke.edu/record/509
下载链接
链接失效反馈
官方服务:
资源简介:
Raw simulated sequence data (FASTAs TE insert TXT files and log files) from GARLIC (https://github.com/caballero/Garlic) simulations of five genomic models (A. thaliana D. melanogaster H. sapiens M. lucifugus &amp; S. cerevisiae).\ Downstream GFF and CSV files for the five simulated FASTA sequences produced with the associated publication's Snakemake workflow.\ TE annotation GFF produced by Earl Grey (https://github.com/TobyBaril/EarlGrey) EDTA (https://github.com/oushujun/EDTA) and RepeatModeler2 (https://github.com/Dfam-consortium/RepeatModeler). <br>TE library FASTA sequences produced by RepeatModeler2.\ Zipped archives of input test data for use with the associated Snakemake workflow (Manuscript DOI not yet available final GitHub link in progress).
提供机构:
Duke Research Data Repository
创建时间:
2026-04-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作