five

xiyuez/red-dot-design-award-product-description

收藏
Hugging Face2023-07-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/xiyuez/red-dot-design-award-product-description
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: odc-by task_categories: - text-generation language: - en pretty_name: Red Dot Design Award Dataset size_categories: - 10k<n<100K --- # Red Dot Design Award Dataset This dataset contains information about the products that have won the Red Dot Design Award, a prestigious international design competition. The data was extracted from the official website of the award: <https://www.red-dot.org/>. ## Task The task for this dataset is text generation, specifically product description generation. Given a product name and category, the goal is to generate a concise and informative description that highlights the features and benefits of the product. ## Limitations The dataset may have some limitations, such as: - The data may contain false or outdated information, as it reflects the information available on the website at the time of extraction. - The data only covers the products that have won the award, which may introduce some selection bias or limit the diversity of the data. - The data is only in English, although the website also has a German version that could be crawled in the future. - The data does not include any images of the products, which could be useful for multimodal language models. Images are planned to be scraped in the future. ## License This public extract is licensed under the Open Data Commons Attribution License: <http://opendatacommons.org/licenses/by/1.0/>. ## Data Format The dataset consists of 21183 unique rows, each containing the following columns: - `product`: The name of the product that won the award. - `category`: The category of the product, such as "Video Camera", "Bathroom Shelf", or "Mobile Home". - `description`: A short paragraph describing the product, its features, and its benefits. There is no predefined train/test split for this dataset. Near-duplicates have been removed. ## Data Quality The data quality may vary depending on the source and accuracy of the information on the website. We have not verified, filtered, or modified the data in any way. The data may contain content that is toxic, biased, copyrighted, or false. Use of this dataset is at your own risk. We do not provide any warranties or liability. ## Acknowledgements We would like to acknowledge the Red Dot Design Award for hosting and maintaining the website that provided the data for this dataset. We do not claim any ownership or affiliation with the award or the website.
提供机构:
xiyuez
原始信息汇总

Red Dot Design Award Dataset 概述

基本信息

  • 许可证: odc-by
  • 任务类别: 文本生成
  • 语言: 英语
  • 数据集名称: Red Dot Design Award Dataset
  • 数据集大小: 10k<n<100K

数据集描述

该数据集包含赢得红点设计奖的产品信息,这是一个国际知名的设计竞赛。数据从官方网站提取。

任务

数据集的任务是文本生成,特别是产品描述生成。给定产品名称和类别,目标是生成一个简洁且信息丰富的描述,突出产品的特性和优势。

数据格式

  • 行数: 21183 行
  • 列信息:
    • product: 获奖产品名称
    • category: 产品类别
    • description: 产品描述,包括特性和优势
  • 数据分割: 无预定义的训练/测试分割
  • 去重: 已移除近似重复项

数据限制

  • 数据可能包含错误或过时信息。
  • 数据仅覆盖获奖产品,可能引入选择偏差。
  • 数据仅提供英文版本。
  • 数据不包含产品图像。

数据质量

数据质量可能因来源和网站信息准确性而异。数据未经验证、过滤或修改,可能包含有害、偏见、版权或虚假内容。使用此数据集需自行承担风险。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作