five

ArielPorath/vcrop-smoke-test

收藏
Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/ArielPorath/vcrop-smoke-test
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含三个配置:folds、ntv3_8m_smoke和raw,用于序列分类任务。folds配置提供四折交叉验证分割,每个折叠包含训练、评估和测试集,每个示例有序列字符串、整数标签和来源字符串特征。ntv3_8m_smoke配置在folds基础上增加了NLP预处理特征,包括输入ID列表、注意力掩码列表和嵌入列表,适用于基于Transformer模型的实验。raw配置包含原始训练数据,共100个示例。数据集总下载大小约为940KB,总数据集大小约为780KB。

This dataset includes three configurations: folds, ntv3_8m_smoke, and raw, designed for sequence classification tasks. The folds configuration provides four-fold cross-validation splits, each with training, evaluation, and test sets, featuring sequence strings, integer labels, and origin strings. The ntv3_8m_smoke configuration adds NLP preprocessed features such as input IDs lists, attention mask lists, and embedding lists, suitable for Transformer-based model experiments. The raw configuration contains raw training data with 100 examples. The total download size is approximately 940KB, and the total dataset size is about 780KB.
提供机构:
ArielPorath
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作