VisRAG-Ret-Test-PlotQA
收藏魔搭社区2025-12-18 更新2025-05-17 收录
下载链接:
https://modelscope.cn/datasets/OpenBMB/VisRAG-Ret-Test-PlotQA
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Description
This is a VQA dataset based on Scientific Plots from PlotQA dataset from [PlotQA](https://arxiv.org/abs/1909.00997).
### Load the dataset
```python
from datasets import load_dataset
import csv
def load_beir_qrels(qrels_file):
qrels = {}
with open(qrels_file) as f:
tsvreader = csv.DictReader(f, delimiter="\t")
for row in tsvreader:
qid = row["query-id"]
pid = row["corpus-id"]
rel = int(row["score"])
if qid in qrels:
qrels[qid][pid] = rel
else:
qrels[qid] = {pid: rel}
return qrels
corpus_ds = load_dataset("openbmb/VisRAG-Ret-Test-PlotQA", name="corpus", split="train")
queries_ds = load_dataset("openbmb/VisRAG-Ret-Test-PlotQA", name="queries", split="train")
qrels_path = "xxxx" # path to qrels file which can be found under qrels folder in the repo.
qrels = load_beir_qrels(qrels_path)
```
## 数据集描述
本数据集为基于PlotQA数据集所收录的科学图表构建的视觉问答(Visual Question Answering, VQA)数据集,相关来源可参考[PlotQA](https://arxiv.org/abs/1909.00997)。
### 数据集加载
python
from datasets import load_dataset
import csv
def load_beir_qrels(qrels_file):
qrels = {}
with open(qrels_file) as f:
tsvreader = csv.DictReader(f, delimiter=" ")
for row in tsvreader:
qid = row["query-id"]
pid = row["corpus-id"]
rel = int(row["score"])
if qid in qrels:
qrels[qid][pid] = rel
else:
qrels[qid] = {pid: rel}
return qrels
corpus_ds = load_dataset("openbmb/VisRAG-Ret-Test-PlotQA", name="corpus", split="train")
queries_ds = load_dataset("openbmb/VisRAG-Ret-Test-PlotQA", name="queries", split="train")
qrels_path = "xxxx" # 请填入qrels文件路径,该文件可在仓库的qrels文件夹中获取。
qrels = load_beir_qrels(qrels_path)
提供机构:
maas
创建时间:
2025-05-15



