happy8825/ecva_zeroshot_thinking
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/happy8825/ecva_zeroshot_thinking
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: "Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results"
language:
- en
tags:
- video-retrieval
- evaluation
- vllm
---
# Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results
- **Model**: `Qwen/Qwen3-VL-2B-Thinking`
- **Dataset**: `happy8825/valid_ecva_clean`
- **Generated**: `2025-12-16 01:28:46Z`
## Metrics
| Metric | Value |
| --- | --- |
| Total samples | 924 |
| With GT | 0 |
| Parsed answers | 0 |
| Top-1 accuracy | 0 |
| Recall@5 | 0 |
| MRR | 0 |
The uploaded JSON contains full per-sample predictions produced via `t3_infer_with_vllm.bash`.
### EVQA/ECVA Metrics
| Metric | Value |
| --- | --- |
| EVQA total | 924 |
| EVQA with GT label | 924 |
| EVQA accuracy | 0.544372 |
## Run Summary
```
Saved 924 results to /home/seohyun/vid_understanding/video_retrieval/video_retrieval/output_ecva_zeroshot_thinking/ecva_zeroshot_thinking.json
Metrics: {
"total": 924,
"with_gt": 0,
"with_parsed_answer": 0,
"top1_acc": 0.0,
"recall_at_5": 0.0,
"mrr": 0.0,
"num_shards": 1,
"shard_index": 0,
"evqa_total": 924,
"evqa_with_gt_label": 924,
"evqa_acc": 0.5443722943722944
}
Pushed ecva_zeroshot_thinking.jsonl and README to https://huggingface.co/datasets/happy8825/ecva_zeroshot_thinking
```
提供机构:
happy8825



