lingvenvist/animacy-nl-gold-standard-mid
收藏Hugging Face2024-07-14 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/lingvenvist/animacy-nl-gold-standard-mid
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含句子、标记、动画标签和目标索引等特征。数据集被分为训练集、测试集和验证集,其中训练集有10034个示例,测试集有2150个示例,验证集有2151个示例。总下载大小为2457408字节,数据集总大小为5009321字节。
The dataset includes four main features: sentences, tokens, anim_tags, and target-indexes. Sentences are of string type, tokens are sequences of strings, anim_tags are sequences of class labels with three categories (N, A, H), and target-indexes are sequences of integers. The dataset is divided into train, test, and validation sets, containing 10034, 2150, and 2151 samples respectively. The total download size of the dataset is 2457408 bytes, and the total size is 5009321 bytes. The dataset configuration is set to default, with data files stored in corresponding split directories.
提供机构:
lingvenvist
原始信息汇总
数据集概述
数据集特征
- sentences: 字符串类型
- tokens: 字符串序列
- anim_tags: 分类标签序列
- 标签名称:
- 0: N
- 1: A
- 2: H
- 标签名称:
- target-indexes: 整数序列
数据集分割
- train:
- 字节数: 3488455
- 样本数: 10034
- test:
- 字节数: 757650
- 样本数: 2150
- validation:
- 字节数: 763216
- 样本数: 2151
数据集大小
- 下载大小: 2457408 字节
- 数据集总大小: 5009321 字节
配置
- config_name: default
- 数据文件路径:
- train: data/train-*
- test: data/test-*
- validation: data/validation-*
- 数据文件路径:



