asas-ai/Arabic-Dataset-for-Commonsense-Validationion
收藏Hugging Face2024-05-05 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asas-ai/Arabic-Dataset-for-Commonsense-Validationion
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: int64
- name: sent1
dtype: string
- name: sent2
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 1421331
num_examples: 10000
- name: dev
num_bytes: 134510
num_examples: 1000
download_size: 840782
dataset_size: 1555841
license: cc0-1.0
task_categories:
- text-classification
language:
- ar
pretty_name: Commonsense-Validationion
size_categories:
- 10K<n<100K
tags:
- Commonsense Validationion
---
# Dataset Card for "Arabic-Dataset-for-Commonsense-Validationion"
## Paper:
Tawalbeh, Saja, and Mohammad Al-Smadi. "Is this sentence valid? an arabic dataset for commonsense validation." arXiv preprint arXiv:2008.10873 (2020).
提供机构:
asas-ai
原始信息汇总
数据集概述
基本信息
- 名称: Arabic-Dataset-for-Commonsense-Validationion
- 语言: 阿拉伯语 (ar)
- 许可证: CC0-1.0
数据集结构
- 特征:
- id: 整数 (int64)
- sent1: 字符串
- sent2: 字符串
- label: 整数 (int64)
数据集划分
- 训练集:
- 示例数量: 10000
- 字节数: 1421331
- 验证集:
- 示例数量: 1000
- 字节数: 134510
数据集大小
- 下载大小: 840782 字节
- 总大小: 1555841 字节
任务类别
- 文本分类
数据集标签
- Commonsense Validationion
数据集规模
- 10K<n<100K



