five

freddiezhang/honordata

收藏
Hugging Face2022-12-19 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/freddiezhang/honordata
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是用于honor项目的AutoTrain自动处理数据集,主要用于文本分类任务。数据集的语言为英语,包含文本、目标标签和元数据字段。数据集分为训练和验证两个部分,分别包含3212和804个样本。
提供机构:
freddiezhang
原始信息汇总

AutoTrain Dataset for project: honor

数据集描述

该数据集是为项目“honor”自动处理的数据集。

语言

数据集的语言代码为BCP-47标准的en

数据集结构

数据实例

数据集中的样本示例如下:

json [ { "text": "Kimchi (kimchee) is a Korean dish which is well known throughout the world. It is a spicy, tangy and pungent food that contains pickled vegetables. The word "Kimchi" comes from the Korean "Kim" meaning "turn" and "Chi" meaning "sauce".\n\nKimchi consists of vegetables which are salted, fermented and seasoned. It is an important part of the Korean diet. The two main methods of preparing Kimchi are fermentation and salting. Fermented Kimchi is made by mixing cabbage, radish and other vegetables with a specific kind of salt and sugar. Salted Kimchi is made by mixing cabbage, radish and other vegetables with a specific amount of salt and some vinegar.\n\nThe standard vegetables used in preparing Kimchi include cabbage, radish, turnip and Chinese cabbage. However, there are many different variations of Kimchi. Some of the variations include Kimchi with beef, Kimchi with fish and Kimchi with soybean paste.\n\nThe preparation of Kimchi is considered to be an important part of Korean culture. It is prepared in a ritualistic manner. The Korean culture also consider it as a "doorway" to a familys hearth.", "target": 1, "feat_meta.pile_set_name": "GPT-3" }, { "text": "So how did you survive the terrible British summer of 2015? (Mine was miserable. There were too many weekends at home in the garden, thats all I can say.) Well, its a new year and a new season of Doctor Who, with Peter Capaldi as our time-travelling hero.\n\nHeres the first photo of Capaldi in costume:\n\nAnd heres how it all begins...\n\nThis story is called The Magicians Apprentice and features Missy (the Master, if you didnt know).\n\nAnd heres a trailer:\n\nAll we can say is: A spooky church? The Doctor having to answer questions about his mistakes? Yes, please! We cant wait to see more.\n\nDoctor Who series 9 begins on Saturday 19 September on BBC One.", "target": 1, "feat_meta.pile_set_name": "GPT-3" } ]

数据集字段

数据集包含以下字段:

json { "text": "Value(dtype=string, id=None)", "target": "ClassLabel(names=[human, machine], id=None)", "feat_meta.pile_set_name": "Value(dtype=string, id=None)" }

数据集分割

数据集被分割为训练集和验证集,分割大小如下:

分割名称 样本数量
训练集 3212
验证集 804
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作