WSDMCup2023
收藏魔搭社区2026-01-06 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/toloka/WSDMCup2023
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for WSDMCup2023
## Dataset Description
- **Homepage:** [Toloka Visual Question Answering Challenge](https://toloka.ai/challenges/wsdm2023)
- **Repository:** [WSDM Cup 2023 Starter Pack](https://github.com/Toloka/WSDMCup2023)
- **Paper:** <https://arxiv.org/abs/2309.16511>
- **Leaderboard:** [CodaLab Competition Leaderboard](https://codalab.lisn.upsaclay.fr/competitions/7434#results)
- **Point of Contact:** research@toloka.ai
| Question | Image and Answer |
| --- | --- |
| What do you use to hit the ball? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/KUsGAc_eqdMcNxkBXzzl/KUsGAc_eqdMcNxkBXzzl_webp_1280_x2.webp" width="228" alt="What do you use to hit the ball?"> |
| What do people use for cutting? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/brXEVYckNLfQKcfNu4DF/brXEVYckNLfQKcfNu4DF_webp_1280_x2.webp" width="228" alt="What do people use for cutting?"> |
| What do we use to support the immune system and get vitamin C? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/HQ0A-ZvZCGCmYfTs83K7/HQ0A-ZvZCGCmYfTs83K7_webp_1280_x2.webp" width="228" alt="What do we use to support the immune system and get vitamin C?"> |
### Dataset Summary
The WSDMCup2023 Dataset consists of images associated with textual questions.
One entry (instance) in our dataset is a question-image pair labeled with the ground truth coordinates of a bounding box containing
the visual answer to the given question. The images were obtained from a CC BY-licensed subset of the Microsoft Common Objects in
Context dataset, [MS COCO](https://cocodataset.org/). All data labeling was performed on the [Toloka crowdsourcing platform](https://toloka.ai/).
Our dataset has 45,199 instances split among three subsets: train (38,990 instances), public test (1,705 instances),
and private test (4,504 instances). The entire train dataset was available for everyone since the start of the challenge.
The public test dataset was available since the evaluation phase of the competition but without any ground truth labels.
After the end of the competition, public and private sets were released.
## Dataset Citation
Please cite the challenge results or dataset description as follows.
- Ustalov D., Pavlichenko N., Koshelev S., Likhobaba D., and Smirnova A. [Toloka Visual Question Answering Benchmark](https://arxiv.org/abs/2309.16511). 2023. arXiv: [2309.16511 [cs.CV]](https://arxiv.org/abs/2309.16511).
```bibtex
@inproceedings{TolokaWSDMCup2023,
author = {Ustalov, Dmitry and Pavlichenko, Nikita and Koshelev, Sergey and Likhobaba, Daniil and Smirnova, Alisa},
title = {{Toloka Visual Question Answering Benchmark}},
year = {2023},
eprint = {2309.16511},
eprinttype = {arxiv},
eprintclass = {cs.CV},
language = {english},
}
```
### Supported Tasks and Leaderboards
Grounding Visual Question Answering
### Language
English
## Dataset Structure
### Data Instances
A data instance contains a URL to the picture, information about the image size - width and height, information about the ground truth bounding box - left top and right bottom dots, and contains the question related to the picture.
```
{'image': https://toloka-cdn.azureedge.net/wsdmcup2023/000000000013.jpg,
'width': 640,
'height': 427,
'left': 129,
'top': 192,
'right': 155,
'bottom': 212,
'question': What does it use to breath?}
```
### Data Fields
* image: contains URL to the image
* width: value in pixels of image width
* height: value in pixels of image height
* left: the x coordinate in pixels to determine the left-top dot of the bounding box
* top: the y coordinate in pixels to determine the left-top dot of the bounding box
* right: the x coordinate in pixels to determine the right-bottom dot of the bounding box
* bottom: the y coordinate in pixels to determine the right-bottom dot of the bounding box
* question: a question related to the picture
### Data Splits
There are four splits in the data: train, train_sample, test_public, and test_private. 'train' split contains the full pull for model training.
The 'train-sample' split contains the part of the 'train' split. The 'test_public' split contains public data to test the model.
The 'test_private' split contains private data for the final model test.
### Source Data
The images were obtained from a CC BY-licensed subset of the Microsoft Common Objects in
Context dataset, [MS COCO](https://cocodataset.org/).
### Annotations
All data labeling was performed on the [Toloka crowdsourcing platform](https://toloka.ai/).
Only annotators who self-reported the knowledge of English had access to the annotation task.
### Citation Information
* Competition: https://toloka.ai/challenges/wsdm2023
* CodaLab: https://codalab.lisn.upsaclay.fr/competitions/7434
* Dataset: https://doi.org/10.5281/zenodo.7057740
# WSDMCup2023 数据集卡片
## 数据集描述
- **主页:** [Toloka视觉问答挑战赛](https://toloka.ai/challenges/wsdm2023)
- **代码仓库:** [WSDM Cup 2023 入门包](https://github.com/Toloka/WSDMCup2023)
- **论文:** <https://arxiv.org/abs/2309.16511>
- **排行榜:** [CodaLab竞赛排行榜](https://codalab.lisn.upsaclay.fr/competitions/7434#results)
- **联络人:** research@toloka.ai
| 问题 | 图像与答案 |
| --- | --- |
| 你用什么击打棒球? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/KUsGAc_eqdMcNxkBXzzl/KUsGAc_eqdMcNxkBXzzl_webp_1280_x2.webp" width="228" alt="What do you use to hit the ball?"> |
| 人们用什么进行切割? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/brXEVYckNLfQKcfNu4DF/brXEVYckNLfQKcfNu4DF_webp_1280_x2.webp" width="228" alt="What do people use for cutting?"> |
| 我们用什么来支持免疫系统并获取维生素C? | <img src="https://tlkfrontprod.azureedge.net/portal-production/static/uploaded/images/HQ0A-ZvZCGCmYfTs83K7/HQ0A-ZvZCGCmYfTs83K7_webp_1280_x2.webp" width="228" alt="What do we use to support the immune system and get vitamin C?"> |
### 数据集概述
WSDMCup2023数据集由与文本问题相关联的图像组成。本数据集的每条条目(实例)均为一个问题-图像对,并标注有包含给定问题的视觉答案的边界框真实坐标。图像源自微软通用物体在上下文(Microsoft Common Objects in Context, [MS COCO](https://cocodataset.org/))数据集的CC BY许可子集。所有数据标注均在[Toloka众包平台](https://toloka.ai/)上完成。
本数据集共包含45199个实例,分为三个子集:训练集(38990个实例)、公开测试集(1705个实例)与私有测试集(4504个实例)。竞赛启动之初,完整训练集即对所有参赛者开放。公开测试集于竞赛评估阶段开放,但不提供任何真实标注。竞赛结束后,公开测试集与私有测试集均已公开。
## 数据集引用
请按照以下方式引用该竞赛成果或数据集描述:
- Ustalov D., Pavlichenko N., Koshelev S., Likhobaba D., Smirnova A. [Toloka视觉问答基准](https://arxiv.org/abs/2309.16511). 2023. arXiv: [2309.16511 [cs.CV]](https://arxiv.org/abs/2309.16511).
bibtex
@inproceedings{TolokaWSDMCup2023,
author = {Ustalov, Dmitry and Pavlichenko, Nikita and Koshelev, Sergey and Likhobaba, Daniil and Smirnova, Alisa},
title = {{Toloka Visual Question Answering Benchmark}},
year = {2023},
eprint = {2309.16511},
eprinttype = {arxiv},
eprintclass = {cs.CV},
language = {english},
}
### 支持任务与排行榜
视觉问答接地(Grounding Visual Question Answering)
### 语言
英语
## 数据集结构
### 数据实例
一个数据实例包含图像的URL链接、图像尺寸信息(宽度与高度)、真实边界框信息(左上角与右下角坐标),以及与该图像相关的问题。
{'image': https://toloka-cdn.azureedge.net/wsdmcup2023/000000000013.jpg,
'width': 640,
'height': 427,
'left': 129,
'top': 192,
'right': 155,
'bottom': 212,
'question': 它用什么呼吸?}
### 数据字段
* **image:** 图像的URL链接
* **width:** 图像宽度的像素值
* **height:** 图像高度的像素值
* **left:** 用于确定边界框左上角的x轴像素坐标
* **top:** 用于确定边界框左上角的y轴像素坐标
* **right:** 用于确定边界框右下角的x轴像素坐标
* **bottom:** 用于确定边界框右下角的y轴像素坐标
* **question:** 与该图像相关的问题
### 数据划分
数据集共包含四个划分:train(训练集)、train_sample(训练采样集)、test_public(公开测试集)与test_private(私有测试集)。其中`train`划分包含用于模型训练的完整数据集;`train_sample`划分包含`train`划分的部分子集;`test_public`划分包含用于模型测试的公开数据;`test_private`划分包含用于最终模型测试的私有数据。
### 源数据
图像源自微软通用物体在上下文(Microsoft Common Objects in Context, [MS COCO](https://cocodataset.org/))数据集的CC BY许可子集。
### 标注信息
所有数据标注均在[Toloka众包平台](https://toloka.ai/)上完成。仅自述掌握英语的标注者可参与标注任务。
### 引用信息
* 竞赛:https://toloka.ai/challenges/wsdm2023
* CodaLab:https://codalab.lisn.upsaclay.fr/competitions/7434
* 数据集:https://doi.org/10.5281/zenodo.7057740
提供机构:
maas
创建时间:
2025-09-15



