prakod/GLuecos_POS_EN_HI_UD
收藏Hugging Face2024-05-31 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/prakod/GLuecos_POS_EN_HI_UD
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: words
sequence: string
- name: label1
sequence: string
- name: label2
sequence: string
splits:
- name: dev_Devanagari
num_bytes: 78683
num_examples: 201
- name: train_Devanagari
num_bytes: 466424
num_examples: 1312
- name: test_Devanagari
num_bytes: 78612
num_examples: 225
download_size: 135888
dataset_size: 623719
configs:
- config_name: default
data_files:
- split: dev_Devanagari
path: data/dev_Devanagari-*
- split: train_Devanagari
path: data/train_Devanagari-*
- split: test_Devanagari
path: data/test_Devanagari-*
---
The dataset includes three main features: words, label1, and label2, all of which are string sequences. The dataset is divided into three parts: development set (dev_Devanagari), training set (train_Devanagari), and test set (test_Devanagari). Each part has specific byte counts and example counts. The download size of the dataset is 135888 bytes, and the total size is 623719 bytes. The dataset configuration is set to default, with data file paths specified according to different splits.
提供机构:
prakod
原始信息汇总
数据集概述
数据集特征
- words: 字符串序列
- label1: 字符串序列
- label2: 字符串序列
数据集分割
- dev_Devanagari:
- 示例数量: 201
- 数据大小: 78683 字节
- train_Devanagari:
- 示例数量: 1312
- 数据大小: 466424 字节
- test_Devanagari:
- 示例数量: 225
- 数据大小: 78612 字节
数据集大小
- 下载大小: 135888 字节
- 总数据集大小: 623719 字节
配置文件
- config_name: default
- data_files:
- dev_Devanagari: data/dev_Devanagari-*
- train_Devanagari: data/train_Devanagari-*
- test_Devanagari: data/test_Devanagari-*



