behbudiy/alpaca-cleaned-uz

Name: behbudiy/alpaca-cleaned-uz
Creator: behbudiy
Published: 2024-09-17 08:58:44
License: 暂无描述

Hugging Face2024-09-17 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/behbudiy/alpaca-cleaned-uz

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: instruction dtype: string - name: input dtype: string - name: output dtype: string splits: - name: train num_bytes: 44358425 num_examples: 51760 download_size: 25083635 dataset_size: 44358425 configs: - config_name: default data_files: - split: train path: data/train-* --- ### Dataset Summary This dataset is a translation of the yahma/alpaca-cleaned dataset into Uzbek, leveraging the Google Translate API. The original dataset is a cleaned version of the Stanford Alpaca dataset, which contains instruction-following data for fine-tuning large language models. The cleaned version improves upon the original Alpaca dataset by removing low-quality data and inconsistencies in formatting, which helps enhance the quality and robustness of models trained on it.

提供机构：

behbudiy

5,000+

优质数据集

54 个

任务类型

进入经典数据集