five

j1song/my-first-dataset

收藏
Hugging Face2024-07-15 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/j1song/my-first-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是通过Argilla创建的,包含一个配置文件`argilla.yaml`和与HuggingFace `datasets`库兼容的记录。数据集主要用于文本分类任务,包含文本字段和标签字段,标签字段的允许值为[joy, sadness]。数据集的结构包括字段、问题、建议、元数据和注释指南。数据集的加载可以通过Argilla或`datasets`库完成。

This dataset has been created with Argilla and contains a configuration file `argilla.yaml` and records compatible with HuggingFace `datasets`. The dataset is primarily used for text classification tasks, containing text fields and label fields with allowed values [joy, sadness]. The dataset structure includes fields, questions, suggestions, metadata, and annotation guidelines. The dataset can be loaded using Argilla or the `datasets` library.
提供机构:
j1song
原始信息汇总

Dataset Card for my-first-dataset

Dataset Description

Dataset Summary

  • Configuration File: argilla.yaml
  • Records Format: Compatible with HuggingFace datasets
  • Annotation Guidelines: Available if defined in Argilla

Supported Tasks and Leaderboards

  • Tasks: Multiple NLP tasks depending on configuration
  • Leaderboards: None

Languages

  • Languages: [More Information Needed]

Dataset Structure

Data in Argilla

  • Fields:
    • text: Type text, Required
  • Questions:
    • label: Type label_selection, Required, Values: [joy, sadness]
  • Suggestions:
    • label-suggestion: Type label_selection, Optional, Values: [joy, sadness]
  • Metadata: Optional, additional information about dataset records
  • Guidelines: Optional, instructions for annotators

Data Instances

  • Argilla Format: json { "external_id": null, "fields": { "text": "I am so happy today" }, "metadata": {}, "responses": [ { "status": "submitted", "user_id": "d57588b3-43f8-41a5-b38e-8961cc64b1fc", "values": { "label": { "value": "joy" } } } ], "suggestions": [], "vectors": {} }

  • HuggingFace datasets Format: json { "external_id": null, "label": [ { "status": "submitted", "user_id": "d57588b3-43f8-41a5-b38e-8961cc64b1fc", "value": "joy" } ], "label-suggestion": null, "label-suggestion-metadata": { "agent": null, "score": null, "type": null }, "metadata": "{}", "text": "I am so happy today" }

Data Fields

  • Fields:
    • text: Type text
  • Questions:
    • label: Type label_selection, Values: [joy, sadness]
  • Suggestions:
    • label-suggestion: Type label_selection, Values: [joy, sadness]
  • Metadata: Optional, additional information
  • external_id: Optional, external ID for dataset record

Data Splits

  • Splits: train

Dataset Creation

Curation Rationale

  • Curation Rationale: [More Information Needed]

Source Data

  • Initial Data Collection and Normalization: [More Information Needed]
  • Source Language Producers: [More Information Needed]

Annotations

  • Annotation Guidelines: Text classification dataset, classify texts by selecting the correct label.
  • Annotation Process: [More Information Needed]
  • Annotators: [More Information Needed]

Personal and Sensitive Information

  • Personal and Sensitive Information: [More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

  • Social Impact: [More Information Needed]

Discussion of Biases

  • Biases: [More Information Needed]

Other Known Limitations

  • Limitations: [More Information Needed]

Additional Information

Dataset Curators

  • Curators: [More Information Needed]

Licensing Information

  • Licensing: [More Information Needed]

Citation Information

  • Citation: [More Information Needed]

Contributions

  • Contributions: [More Information Needed]
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作