iapp/thai_famous_people_images_dataset
收藏Hugging Face2024-10-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/iapp/thai_famous_people_images_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: image
dtype: image
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 1237076986.168
num_examples: 4222
download_size: 1108712631
dataset_size: 1237076986.168
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: apache-2.0
task_categories:
- question-answering
language:
- th
- en
tags:
- image
pretty_name: Thai Famous People Images Dataset
size_categories:
- 1K<n<10K
---
# Thai Famous People Image Dataset
## Dataset Description
The Thai Famous People Image Dataset is a collection of images and descriptions of famous Thai personalities. This dataset is designed to provide a comprehensive resource for researchers, developers, and enthusiasts interested in Thai culture, history, and notable figures. The data was extracted from the Thai Wikipedia dump in September 2024, ensuring up-to-date and relevant information.
## Maintainer
Kobkrit Viriyayudhakorn (kobkrit@iapp.co.th)
## License
Apache 2.0
### Key Features:
- Images of famous Thai people
- Textual descriptions of each personality
- Titles or names of the individuals
## Dataset Creation
This dataset was created using a custom script that processes information extracted from the Thai Wikipedia dump of September 2024. The script performs the following steps:
1. Extracts relevant articles and images from the Wikipedia dump
2. Processes and cleans the extracted data
3. Retrieves images from either local files or URLs
4. Combines images with corresponding textual descriptions and titles
5. Creates a Hugging Face Dataset object
6. Pushes the dataset to the Hugging Face Hub
## Dataset Structure
The dataset consists of the following fields:
- `image`: An image of the famous Thai person
- `text`: A textual description of the person extracted from their Wikipedia article
- `title`: The name or title of the person
## Usage
You can use this dataset for various tasks, including but not limited to:
- Image classification of Thai personalities
- Text-to-image or image-to-text generation tasks
- Cultural and historical research on Thai public figures
- Analysis of Thai Wikipedia content
To use the dataset, you can load it using the Hugging Face `datasets` library:
提供机构:
iapp



