FanChen0116/bus_few4_40x_pvi
收藏Hugging Face2023-09-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/FanChen0116/bus_few4_40x_pvi
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: int64
- name: tokens
sequence: string
- name: labels
sequence:
class_label:
names:
'0': O
'1': I-from_location
'2': B-from_location
'3': B-leaving_date
'4': I-leaving_date
'5': I-to_location
'6': B-to_location
- name: request_slot
sequence: string
splits:
- name: train
num_bytes: 345681
num_examples: 1400
- name: validation
num_bytes: 6900
num_examples: 35
- name: test
num_bytes: 70618
num_examples: 377
download_size: 45026
dataset_size: 423199
---
# Dataset Card for "bus_few4_40x_pvi"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes four main features: id (int64), tokens (sequence of strings), labels (sequence with multiple class labels such as locations and dates), and request_slot (sequence of strings). The dataset is divided into three parts: train (1400 examples), validation (35 examples), and test (377 examples). The total download size of the dataset is 45026 bytes, and the total size is 423199 bytes.
提供机构:
FanChen0116
原始信息汇总
数据集概述
数据集信息
特征
- id: 数据类型为
int64。 - tokens: 序列类型为
string。 - labels: 序列类型,包含以下类别标签:
0: O1: I-from_location2: B-from_location3: B-leaving_date4: I-leaving_date5: I-to_location6: B-to_location
- request_slot: 序列类型为
string。
数据分割
- train: 包含 1400 个样本,总字节数为 345681。
- validation: 包含 35 个样本,总字节数为 6900。
- test: 包含 377 个样本,总字节数为 70618。
数据大小
- 下载大小: 45026 字节。
- 数据集大小: 423199 字节。



