Multimodal AutoML Benchmark
收藏arXiv2021-11-04 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2111.02705v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含18个多模态数据表,每个数据表都包含一些文本字段,并源自真实的商业应用。这些数据集在样本大小、问题类型(分类和回归任务的混合)、特征数量(文本列的数量从1到28不等)以及预测信号在文本与数值/分类特征之间的分解方式上存在显著差异。
This dataset contains 18 multimodal tabular datasets, each including multiple text fields and originating from real-world commercial applications. These datasets exhibit significant differences across multiple dimensions: sample size, task types (a mix of classification and regression tasks), number of features (with the count of text columns ranging from 1 to 28), and the decomposition pattern of predictive signals between text and numerical/categorical features.
创建时间:
2021-11-04



