five

intro/flickr8k

收藏
Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/intro/flickr8k
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc0-1.0 task_categories: - image-to-text - text-generation pretty_name: Flickr8k Captions With Splits size_categories: - 1K<n<10K configs: - config_name: default data_files: - split: train path: train/** - split: dev path: dev/** - split: test path: test/** dataset_info: features: - name: image dtype: image - name: file_name dtype: string - name: split dtype: string - name: caption_0 dtype: string - name: caption_1 dtype: string - name: caption_2 dtype: string - name: caption_3 dtype: string - name: caption_4 dtype: string --- # Flickr8k Captions With Splits This dataset repackages the local Flickr8k image-caption corpus into a Hugging Face `imagefolder` layout with per-split `metadata.csv` files. ## Structure - `train/`, `dev/`, and `test/` each contain image files and a `metadata.csv`. - Each row represents one image with five captions. - The `image` feature is inferred from `file_name` in each `metadata.csv`. ## Columns - `image`: image file loaded by the Hugging Face Datasets library - `file_name`: image filename stored as a string and used by `imagefolder` to map rows to image files - `split`: one of `train`, `dev`, `test` - `caption_0` to `caption_4`: the five raw Flickr8k captions for the image ## License - CC0: Public Domain ## Counts - Total included images: 8000 - Train images: 6000 - Dev images: 1000 - Test images: 1000 - Excluded caption entries without split or image file: 92 ## Source Mapping - Split assignments come from `Flickr_8k.trainImages.txt`, `Flickr_8k.devImages.txt`, and `Flickr_8k.testImages.txt`. - Captions come from `Flickr8k.token.txt`, using `#0` through `#4` as `caption_0` through `caption_4`. ## Notes - The original caption file contains 8,092 image-caption groups. - Only the 8,000 images with explicit split assignments are included in this Hugging Face-ready dataset. - Excluded rows are recorded in `excluded_rows.csv`. - Missing local image files are recorded in `missing_image_rows.csv`. ## Loading Example ```python from datasets import load_dataset dataset = load_dataset("intro/flickr8k") print(dataset["train"][0]) ``` ## Citation If you use this dataset, please cite: ```bibtex @article{hodosh2013framing, title={Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics}, author={Hodosh, Micah and Young, Peter and Hockenmaier, Julia}, journal={Journal of Artificial Intelligence Research}, volume={47}, pages={853--899}, year={2013}, url={http://www.jair.org/papers/paper3994.html} } ```
提供机构:
intro
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作