Emm9625/textwork-00-dedupe-optimal_threshold
收藏Hugging Face2025-01-18 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Emm9625/textwork-00-dedupe-optimal_threshold
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了三个子数据集:smol-constraints、smol-rewrite和smol-summarize。每个子数据集都由训练集和测试集组成,包含文本内容和对应的角色信息。smol-constraints子数据集的训练集有18479个示例,测试集有1420个示例;smol-rewrite子数据集的训练集有39994个示例,测试集有2641个示例;smol-summarize子数据集的训练集有70404个示例,测试集有4635个示例。
The dataset consists of three sub-datasets: smol-constraints, smol-rewrite, and smol-summarize, each of which includes a training set and a test set with text content and corresponding role information. The smol-constraints sub-dataset has 18,479 examples in the training set and 1,420 examples in the test set; the smol-rewrite sub-dataset has 39,994 examples in the training set and 2,641 examples in the test set; the smol-summarize sub-dataset has 70,404 examples in the training set and 4,635 examples in the test set.
提供机构:
Emm9625



