five

saillab/alpaca-english-cleaned

收藏
Hugging Face2024-09-20 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/saillab/alpaca-english-cleaned
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en pretty_name: English alpaca-52k size_categories: - 100K<n<1M --- This repository contains the dataset used for the TaCo paper. Please refer to the paper for more details: [OpenReview](https://openreview.net/forum?id=02MLWBj8HP) If you have used our dataset, please cite it as follows: **Citation** ``` @inproceedings{upadhayay2024taco, title={TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in {LLM}s through Translation-Assisted Chain-of-Thought Processes}, author={Bibek Upadhayay and Vahid Behzadan}, booktitle={5th Workshop on practical ML for limited/low resource settings, ICLR}, year={2024}, url={https://openreview.net/forum?id=02MLWBj8HP} } ``` The original dataset [(Alpaca-52K)](https://github.com/tatsu-lab/stanford_alpaca?tab=readme-ov-file#data-release) was translated using Google Translate. **Copyright and Intended Use** This dataset has been released under CC BY-NC, intended for academic and research purposes only. Please review the licenses and terms and conditions of Alpaca-52K, Dolly-15K, and Google Cloud Translation before using this dataset for any purpose other than research.

语言: - 英语 展示名称:English Alpaca-52K 规模分类: - 10万 < 样本量 < 100万 本仓库包含用于TaCo论文的数据集。 如需了解更多细节,请参阅该论文:[OpenReview](https://openreview.net/forum?id=02MLWBj8HP) 若您使用了本数据集,请按以下格式引用: **引用格式** @inproceedings{upadhayay2024taco, title={TaCo: 通过翻译辅助思维链流程提升大语言模型(LLM)在低资源语言上的跨语言迁移能力}, author={Bibek Upadhayay and Vahid Behzadan}, booktitle={第五届低资源/有限资源场景下实用机器学习研讨会,ICLR}, year={2024}, url={https://openreview.net/forum?id=02MLWBj8HP} } 原始数据集[(Alpaca-52K)](https://github.com/tatsu-lab/stanford_alpaca?tab=readme-ov-file#data-release) 已通过谷歌翻译(Google Translate)完成翻译。 **版权与使用规范** 本数据集采用CC BY-NC协议发布,仅可用于学术与研究用途。若您将本数据集用于研究以外的场景,请务必先查阅Alpaca-52K、Dolly-15K以及谷歌云翻译(Google Cloud Translation)的许可协议与相关条款。
提供机构:
saillab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作