five

Cross-Modality TFM Pipeline — Reproducibility Bundle (NeurIPS 2026 submission)

收藏
DataCite Commons2026-05-02 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.19872885
下载链接
链接失效反馈
官方服务:
资源简介:
Cross-Modality TFM Pipeline — Reproducibility Bundle This archive contains the data needed to reproduce the headline results of the NeurIPS 2026 ED Track submission "When Tabular Foundation Models Transfer Across Modalities: A Systematic Evaluation Across 95 Datasets". Contents data/ — extracted features (.npz) for the 35 Panel A datasets, one directory per dataset. Each features.npz contains the feature matrix (already passed through the relevant frozen encoder: DINOv2, CLIP, AST, Wav2Vec2, HuBERT, SBERT, ChemBERTa, Phikon, mel-stats, MFCC, or catch22) with train/val/test splits and labels. results/ — 24 result parquet files containing per-cell scores, baselines, cell verdicts, and aggregate panels. The headline numbers in the paper (94.3% / 77.1% / 96.6% / 91.5%) are computed from panel_a_canonical.parquet and panel_b_canonical.parquet. Reproducing the headline results The reproduction scripts and full pipeline code are in the anonymized code repository (URL provided in the supplementary material of the paper). With the code repo cloned and this archive extracted alongside, the headline numbers are reproduced by: python supplementary/reproduce_main_results.py --quick# Expects:#   Panel A oracle   33/35 = 94.3%#   Panel A deployed 27/35 = 77.1%#   Panel B oracle   57/59 = 96.6%#   Panel B deployed 54/59 = 91.5% For the full pipeline re-run (ETF training + TabICL inference per dataset), see the README in the code repo. License This archive is released under the Creative Commons Attribution 4.0 International License (CC-BY 4.0). Source datasets remain under their original licenses; users are responsible for complying with each upstream dataset's terms when using individual feature files. Authors Anonymous Authors (NeurIPS 2026 double-blind submission). Author identities will be disclosed after acceptance.
提供机构:
Zenodo
创建时间:
2026-04-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作