ai4bharat/IndicXParaphrase-Translated
收藏Hugging Face2024-01-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ai4bharat/IndicXParaphrase-Translated
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: english
dtype: string
- name: sentence1
dtype: string
- name: sentence2
dtype: string
- name: label
dtype:
class_label:
names:
'0': '0'
'1': '1'
- name: itv2 as sentence1
dtype: string
- name: itv2 as sentence2
dtype: string
- name: itv2 bn sentence1
dtype: string
- name: itv2 bn sentence2
dtype: string
- name: itv2 gu sentence1
dtype: string
- name: itv2 gu sentence2
dtype: string
- name: itv2 hi sentence1
dtype: string
- name: itv2 hi sentence2
dtype: string
- name: itv2 kn sentence1
dtype: string
- name: itv2 kn sentence2
dtype: string
- name: itv2 ml sentence1
dtype: string
- name: itv2 ml sentence2
dtype: string
- name: itv2 mr sentence1
dtype: string
- name: itv2 mr sentence2
dtype: string
- name: itv2 or sentence1
dtype: string
- name: itv2 or sentence2
dtype: string
- name: itv2 pa sentence1
dtype: string
- name: itv2 pa sentence2
dtype: string
- name: itv2 te sentence1
dtype: string
- name: itv2 te sentence2
dtype: string
splits:
- name: test
num_bytes: 5457704
num_examples: 2002
download_size: 2021259
dataset_size: 5457704
---
# Dataset Card for "indic-para"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
ai4bharat
原始信息汇总
数据集概述
特征信息
- english: 字符串类型
- sentence1: 字符串类型
- sentence2: 字符串类型
- label: 类别标签,包含两个类别:0 和 1
- itv2 as sentence1: 字符串类型
- itv2 as sentence2: 字符串类型
- itv2 bn sentence1: 字符串类型
- itv2 bn sentence2: 字符串类型
- itv2 gu sentence1: 字符串类型
- itv2 gu sentence2: 字符串类型
- itv2 hi sentence1: 字符串类型
- itv2 hi sentence2: 字符串类型
- itv2 kn sentence1: 字符串类型
- itv2 kn sentence2: 字符串类型
- itv2 ml sentence1: 字符串类型
- itv2 ml sentence2: 字符串类型
- itv2 mr sentence1: 字符串类型
- itv2 mr sentence2: 字符串类型
- itv2 or sentence1: 字符串类型
- itv2 or sentence2: 字符串类型
- itv2 pa sentence1: 字符串类型
- itv2 pa sentence2: 字符串类型
- itv2 te sentence1: 字符串类型
- itv2 te sentence2: 字符串类型
数据分割
- test: 包含 2002 个样本,数据大小为 5457704 字节
数据集大小
- 下载大小: 2021259 字节
- 数据集大小: 5457704 字节



