ptx0/not-a-hotdog
收藏Hugging Face2024-05-14 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ptx0/not-a-hotdog
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: filename # Example: id
dtype: string # Example: int32
- name: image # Example: text
dtype: image # Example: string
---
# Not a Hot-dog
_(The name of this dataset is a reference to the show Silicon Valley.)_
## Dataset Details
### Dataset Description
This dataset is a small collection of user-submitted images that contain objects that are not hot dogs, but might be perceptibly shaped like one.
- **Curated by:** Public users
- **Language(s) (NLP):** None (no captions included)
- **License:** MIT
### Dataset Sources
The images provided were submitted by random internet users on Reddit.
## Uses
This dataset may be used to train safety checking neural networks, or low-rank adaptation networks that might be useful for a funny joke or two.
### Direct Use
This dataset does not have any captions or phrases supplied with it.
The `image` column contains a byte string of the JPEG data as read from storage.
### Out-of-Scope Use
This data should not be used to generate offensive content.
## Dataset Structure
Fields:
- filename (str)
- image hash (str)
- width (int)
- height (int)
- image (bytes, JPEG)
## Dataset Creation
### Curation Rationale
A LoRA exists that seems to use a similar dataset, but the dataset was not provided or available at any point in time. This dataset is an attempt to reproduce the same results.
### Source Data
- User-submitted photographs
#### Personal and Sensitive Information
Some images may contain faces or identities of individuals. By using this dataset, you agree not to attempt to discover the identity of these people.
## Bias, Risks, and Limitations
This dataset's bias is that of its users', as all images were hand-selected for inclusion.
提供机构:
ptx0
原始信息汇总
Not a Hot-dog 数据集概述
数据集描述
该数据集是一个由用户提交的小型图像集合,包含的物体不是热狗,但可能看起来像热狗。
- 数据集来源: 公共用户
- 语言(NLP): 无(不包含标题)
- 许可证: MIT
数据集来源
图像由Reddit上的随机互联网用户提交。
用途
该数据集可用于训练安全检查神经网络或低秩适应网络,可能用于一些有趣的笑话。
直接用途
该数据集不包含任何标题或短语。
image列包含从存储中读取的JPEG数据的字节字符串。
超出范围的用途
该数据不应被用于生成冒犯性内容。
数据集结构
字段:
- filename (字符串)
- image hash (字符串)
- width (整数)
- height (整数)
- image (字节, JPEG)
数据集创建
筛选理由
存在一个似乎使用类似数据集的LoRA,但该数据集在任何时间点都不可用。此数据集旨在重现相同的结果。
源数据
- 用户提交的照片
个人和敏感信息
一些图像可能包含个人面孔或身份。使用此数据集时,您同意不试图发现这些人的身份。
偏见、风险和限制
该数据集的偏见是其用户的选择偏见,因为所有图像都是手动选择以包含在内。



