argilla/synthetic-text-classification-news-multi-label
收藏Hugging Face2024-12-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/argilla/synthetic-text-classification-news-multi-label
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多标签文本分类数据集,主要用于新闻分类。数据集包含100个训练样本,每个样本包含文本和对应的标签。标签类别包括体育、政治、科学、世界新闻、科技、娱乐和商业。数据集是通过distilabel工具生成的,并且提供了一个pipeline.yaml文件,用于复现生成数据集的流程。
This dataset is a multi-label text classification dataset primarily used for news categorization. It contains 100 training examples, each consisting of text and corresponding labels. The label categories include sports, politics, science, world-news, tech, entertainment, and business. The dataset was generated using the distilabel tool and includes a pipeline.yaml file to reproduce the dataset generation process.
提供机构:
argilla



