Vipplav/tel-input-target-pairs-dataset

Name: Vipplav/tel-input-target-pairs-dataset
Creator: Vipplav
Published: 2024-07-20 11:59:38
License: 暂无描述

Hugging Face2024-07-20 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/Vipplav/tel-input-target-pairs-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含泰卢固语（te）的文本生成任务数据。数据集的特征包括原始句子（original_sentence）、输入ID（input_ids）、目标ID（target_ids）、输入标记（input_tokens）和目标标记（target_tokens）。数据集的分割为训练集（train），包含6,857,784个示例，大小为4,856,609,836字节。数据集的下载大小为1,999,709,103字节，数据集大小为4,856,609,836字节。数据集的许可证为MIT，任务类别为文本生成，规模类别为1M<n<10M。

This dataset is primarily used for text generation tasks, featuring Telugu original sentences, input IDs, target IDs, input tokens, and target tokens. The dataset is divided into a training set, containing 6,857,784 samples and 4,856,609,836 bytes. The dataset size is between 1M and 10M, and it follows the MIT license.

提供机构：

Vipplav

5,000+

优质数据集

54 个

任务类型

进入经典数据集