mlabonne/smoltalk-flat
收藏Hugging Face2024-11-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mlabonne/smoltalk-flat
下载链接
链接失效反馈官方服务:
资源简介:
smoltalk-flat数据集是HuggingFaceTB/smoltalk数据集的一个扁平化版本,旨在解决与大多数微调框架的兼容性问题。数据集包含messages特征,其中每个消息包含content和role字段,以及一个source字段。数据集分为训练集和测试集,分别包含大量数据和示例。
This dataset is a flattened version of the HuggingFaceTB/smoltalk dataset, primarily for compatibility issues with most fine-tuning frameworks. The dataset includes message content and role information, as well as the source of the data. It is divided into training and test sets, containing 1043917 and 54948 samples respectively. The dataset size is 4236145754 bytes, with a download size of 2110004410 bytes.
提供机构:
mlabonne



