XeTute/Pakistan-China-Alpaca
收藏Hugging Face2025-02-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/XeTute/Pakistan-China-Alpaca
下载链接
链接失效反馈官方服务:
资源简介:
本数据集是一个经过精心挑选的文本样本集合,专注于亚洲的各种方面,包括科学、技术、工程和数学(STEM)、因果关系问答、巴基斯坦文化和中国文化。该数据集由131个样本组成,全部为人工生成,旨在用于微调基础语言模型。建议将该数据集与其他更广泛的语料库结合使用,以实现模型的全面适应。
This dataset provides a curated collection of text samples focused on various aspects of Asia, including Science, Technology, Engineering, and Mathematics (STEM), Causal Question & Answering, Pakistani Culture, and Chinese Culture. Comprising 131 artificially generated samples, this dataset is intended for fine-tuning foundational language models and has been used in the partial training of the Intellect V0.2 1.6B model. It is recommended to use this dataset in conjunction with broader corpora for well-rounded model adaptation.
提供机构:
XeTute



