five

Hilbot-FI Dataset: A Low-Resource Financial Intent Classification Dataset

收藏
DataCite Commons2026-05-05 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20019096
下载链接
链接失效反馈
官方服务:
资源简介:
The Hilbot-FI Dataset is a low-resource financial intent classification dataset designed for evaluating NLP models under realistic short-text and class-imbalanced conditions. The dataset contains 1,525 total samples across 33 intent labels, with predefined train/test splits of 1,220 training samples and 305 test samples. The processed vocabulary contains 542 unique tokens. The dataset combines structured financial-record-derived queries with conversational financial intent patterns, reflecting the mixed nature of real-world financial assistant inputs. It is intended for research on financial conversational systems, decision-support chatbots, class imbalance, short-text classification, and data-centric evaluation of classical, neural, hybrid, and transformer-based NLP models. The release includes the complete dataset, predefined train/test splits, intent patterns, a data dictionary, label definitions, citation metadata, and license information.
提供机构:
Zenodo
创建时间:
2026-05-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作