Local synthetic datasets generation - manuscript data
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10607868
下载链接
链接失效反馈官方服务:
资源简介:
Datasets generated under the work "Semantically Rich Local Dataset Generation for Explainable AI in Genomics".
Relevant files and directories:
- datasets.tar.gz: Full copy of the GitHub repository, except it contains the datasets generated in the manuscript.
The repo structure is organized as follows:
1_hyperparameter_search - Contains the Optuna output and the datasets generated from the top 5 trials of each strategy.
2_performanceComparison - Contains the datasets generated for the top trial of each strategy across more seeds.
3_ablation_studies - Directory with the output of all experiments that evaluated the impact of some hyperparameters on the evolutionary search.
4_generalization - Datasets generated from diverse input sequences.
data/cache - Fasta of the human genome along with transcript cache used to extract exon triplets.
figures.ipynb - Notebook to generate the manuscript figures.
创建时间:
2024-04-12



