Kasimyildirim/DataSynthesis
收藏Hugging Face2024-12-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Kasimyildirim/DataSynthesis
下载链接
链接失效反馈官方服务:
资源简介:
1. anthracite-org/kalo-opus-instruct-22k-no-refusal:该数据集包含大量指令-响应对,适用于训练和评估过程。包含超过22,000个指令-响应对,提供JSON或Parquet格式,可用于自然语言处理(NLP)和机器学习项目。
2. Magpie-Align/Magpie-Pro-300K-Filtered:该数据集包含高质量且经过过滤的数据样本,特别针对语言模型进行了优化。包含超过300,000个数据样本,提供Parquet格式,适用于语言模型训练和验证过程。
3. mlabonne/FineTome-100k:该数据集包含各种文本样本,适用于训练和测试过程。包含100,000个文本样本,提供Parquet格式,可用于文本分类和自然语言处理项目。
This document summarizes the features and use cases of various datasets. Specifically:
1. anthracite-org/kalo-opus-instruct-22k-no-refusal: A collection containing over 22,000 instruction-response pairs, designed for use in training and evaluation processes, suitable for natural language processing and machine learning projects.
2. Magpie-Align/Magpie-Pro-300K-Filtered: Includes over 300,000 high-quality and filtered data samples, optimized specifically for language models, suitable for language model training and validation processes.
3. mlabonne/FineTome-100k: A collection of 100,000 text samples, suitable for training and testing processes, suitable for text classification and natural language processing projects.
提供机构:
Kasimyildirim



