five

iapp/thai_famous_people_images_dataset

收藏
Hugging Face2024-10-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/iapp/thai_famous_people_images_dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: image dtype: image - name: text dtype: string - name: title dtype: string splits: - name: train num_bytes: 1237076986.168 num_examples: 4222 download_size: 1108712631 dataset_size: 1237076986.168 configs: - config_name: default data_files: - split: train path: data/train-* license: apache-2.0 task_categories: - question-answering language: - th - en tags: - image pretty_name: Thai Famous People Images Dataset size_categories: - 1K<n<10K --- # Thai Famous People Image Dataset ## Dataset Description The Thai Famous People Image Dataset is a collection of images and descriptions of famous Thai personalities. This dataset is designed to provide a comprehensive resource for researchers, developers, and enthusiasts interested in Thai culture, history, and notable figures. The data was extracted from the Thai Wikipedia dump in September 2024, ensuring up-to-date and relevant information. ## Maintainer Kobkrit Viriyayudhakorn (kobkrit@iapp.co.th) ## License Apache 2.0 ### Key Features: - Images of famous Thai people - Textual descriptions of each personality - Titles or names of the individuals ## Dataset Creation This dataset was created using a custom script that processes information extracted from the Thai Wikipedia dump of September 2024. The script performs the following steps: 1. Extracts relevant articles and images from the Wikipedia dump 2. Processes and cleans the extracted data 3. Retrieves images from either local files or URLs 4. Combines images with corresponding textual descriptions and titles 5. Creates a Hugging Face Dataset object 6. Pushes the dataset to the Hugging Face Hub ## Dataset Structure The dataset consists of the following fields: - `image`: An image of the famous Thai person - `text`: A textual description of the person extracted from their Wikipedia article - `title`: The name or title of the person ## Usage You can use this dataset for various tasks, including but not limited to: - Image classification of Thai personalities - Text-to-image or image-to-text generation tasks - Cultural and historical research on Thai public figures - Analysis of Thai Wikipedia content To use the dataset, you can load it using the Hugging Face `datasets` library:
提供机构:
iapp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作