rocca/emojis
收藏Hugging Face2022-04-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/rocca/emojis
下载链接
链接失效反馈官方服务:
资源简介:
A collection of 38,176 emoji images from Facebook, Google, Apple, WhatsApp, Samsung, [JoyPixels](https://www.joypixels.com/), Twitter, [emojidex](https://www.emojidex.com/), LG, [OpenMoji](https://openmoji.org/), and Microsoft. It includes all the emojis for these apps/platforms as of early 2022.
* Counts: Facebook=3664, Google=3664, Apple=3961, WhatsApp=3519, Samsung=3752, JoyPixels=3538, Twitter=3544, emojidex=2040, LG=3051, OpenMoji=3512, Microsoft=3931.
* Sizes: Facebook=144x144, Google=144x144, Apple=144x144, WhatsApp=144x144, Samsung=108x108, JoyPixels=144x144, Twitter=144x144, emojidex=144x144, LG=136x128, OpenMoji=144x144, Microsoft=144x144.
* The tar files directly contain the image files (they're not inside a parent folder).
* The emoji code points are at the end of the filename, but there are some adjustments needed to parse them into the Unicode character consistently across all sets of emojis in this dataset. Here's some JavaScript code to convert the file name of an emoji image into the actual Unicode emoji character:
```js
let filename = ...;
let fixedFilename = filename.replace(/(no|light|medium|medium-light|medium-dark|dark)-skin-tone/, "").replace(/__/, "_").replace(/--/, "-");
let emoji = String.fromCodePoint(...fixedFilename.split("_")[1].split(".")[0].split("-").map(hex => parseInt(hex, 16)));
```
## Facebook examples:

## Google examples:

## Apple examples:

## WhatsApp examples:

## Samsung examples:

## JoyPixels examples:

## Twitter examples:

## emojidex examples:

## LG examples:

## OpenMoji examples:

## Microsoft examples:

提供机构:
rocca
原始信息汇总
数据集概述
数据集内容
- 来源:Facebook, Google, Apple, WhatsApp, Samsung, JoyPixels, Twitter, emojidex, LG, OpenMoji, Microsoft
- 总数:38,176个emoji图像
- 时间:截至2022年初
详细信息
- 图像数量与尺寸:
- Facebook: 3664个, 144x144
- Google: 3664个, 144x144
- Apple: 3961个, 144x144
- WhatsApp: 3519个, 144x144
- Samsung: 3752个, 108x108
- JoyPixels: 3538个, 144x144
- Twitter: 3544个, 144x144
- emojidex: 2040个, 144x144
- LG: 3051个, 136x128
- OpenMoji: 3512个, 144x144
- Microsoft: 3931个, 144x144
文件结构
- 存储方式:tar文件直接包含图像文件,无父文件夹
文件名解析
- 文件名格式:文件名末尾包含emoji代码点,需通过特定JavaScript代码转换为Unicode emoji字符
- 转换代码示例: js let filename = ...; let fixedFilename = filename.replace(/(no|light|medium|medium-light|medium-dark|dark)-skin-tone/, "").replace(/__/, "").replace(/--/, "-"); let emoji = String.fromCodePoint(...fixedFilename.split("")[1].split(".")[0].split("-").map(hex => parseInt(hex, 16)));



