Lots-of-LoRAs/task427_hindienglish_corpora_hi-en_language_identification

Name: Lots-of-LoRAs/task427_hindienglish_corpora_hi-en_language_identification
Creator: Lots-of-LoRAs
Published: 2024-12-30 23:29:54
License: 暂无描述

Hugging Face2024-12-30 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Lots-of-LoRAs/task427_hindienglish_corpora_hi-en_language_identification

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个用于印地语和英语语言识别任务的文本数据集，由众包方式创建。数据集包含训练集、验证集和测试集，总共6499个样本。数据集以纯文本格式存储，包含输入文本、输出文本和样本ID三个字段。

This is a text dataset for Hindi-English language identification task, created through crowdsourcing. The dataset includes train, validation, and test sets, totaling 6499 samples. The dataset is stored in plain text format, containing three fields: input text, output text, and sample ID.

提供机构：

Lots-of-LoRAs

5,000+

优质数据集

54 个

任务类型

进入经典数据集