five

ptx0/not-a-hotdog

收藏
Hugging Face2024-05-14 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ptx0/not-a-hotdog
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit dataset_info: features: - name: filename # Example: id dtype: string # Example: int32 - name: image # Example: text dtype: image # Example: string --- # Not a Hot-dog _(The name of this dataset is a reference to the show Silicon Valley.)_ ## Dataset Details ### Dataset Description This dataset is a small collection of user-submitted images that contain objects that are not hot dogs, but might be perceptibly shaped like one. - **Curated by:** Public users - **Language(s) (NLP):** None (no captions included) - **License:** MIT ### Dataset Sources The images provided were submitted by random internet users on Reddit. ## Uses This dataset may be used to train safety checking neural networks, or low-rank adaptation networks that might be useful for a funny joke or two. ### Direct Use This dataset does not have any captions or phrases supplied with it. The `image` column contains a byte string of the JPEG data as read from storage. ### Out-of-Scope Use This data should not be used to generate offensive content. ## Dataset Structure Fields: - filename (str) - image hash (str) - width (int) - height (int) - image (bytes, JPEG) ## Dataset Creation ### Curation Rationale A LoRA exists that seems to use a similar dataset, but the dataset was not provided or available at any point in time. This dataset is an attempt to reproduce the same results. ### Source Data - User-submitted photographs #### Personal and Sensitive Information Some images may contain faces or identities of individuals. By using this dataset, you agree not to attempt to discover the identity of these people. ## Bias, Risks, and Limitations This dataset's bias is that of its users', as all images were hand-selected for inclusion.
提供机构:
ptx0
原始信息汇总

Not a Hot-dog 数据集概述

数据集描述

该数据集是一个由用户提交的小型图像集合,包含的物体不是热狗,但可能看起来像热狗。

  • 数据集来源: 公共用户
  • 语言(NLP): 无(不包含标题)
  • 许可证: MIT

数据集来源

图像由Reddit上的随机互联网用户提交。

用途

该数据集可用于训练安全检查神经网络或低秩适应网络,可能用于一些有趣的笑话。

直接用途

该数据集不包含任何标题或短语。

image列包含从存储中读取的JPEG数据的字节字符串。

超出范围的用途

该数据不应被用于生成冒犯性内容。

数据集结构

字段:

  • filename (字符串)
  • image hash (字符串)
  • width (整数)
  • height (整数)
  • image (字节, JPEG)

数据集创建

筛选理由

存在一个似乎使用类似数据集的LoRA,但该数据集在任何时间点都不可用。此数据集旨在重现相同的结果。

源数据

  • 用户提交的照片

个人和敏感信息

一些图像可能包含个人面孔或身份。使用此数据集时,您同意不试图发现这些人的身份。

偏见、风险和限制

该数据集的偏见是其用户的选择偏见,因为所有图像都是手动选择以包含在内。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作