happy8825/ecva_zeroshot_thinking

Name: happy8825/ecva_zeroshot_thinking
Creator: happy8825
Published: 2025-12-16 01:28:50
License: 暂无描述

Hugging Face2025-12-16 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/happy8825/ecva_zeroshot_thinking

下载链接

链接失效反馈

官方服务：

资源简介：

--- pretty_name: "Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results" language: - en tags: - video-retrieval - evaluation - vllm --- # Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results - **Model**: `Qwen/Qwen3-VL-2B-Thinking` - **Dataset**: `happy8825/valid_ecva_clean` - **Generated**: `2025-12-16 01:28:46Z` ## Metrics | Metric | Value | | --- | --- | | Total samples | 924 | | With GT | 0 | | Parsed answers | 0 | | Top-1 accuracy | 0 | | Recall@5 | 0 | | MRR | 0 | The uploaded JSON contains full per-sample predictions produced via `t3_infer_with_vllm.bash`. ### EVQA/ECVA Metrics | Metric | Value | | --- | --- | | EVQA total | 924 | | EVQA with GT label | 924 | | EVQA accuracy | 0.544372 | ## Run Summary ``` Saved 924 results to /home/seohyun/vid_understanding/video_retrieval/video_retrieval/output_ecva_zeroshot_thinking/ecva_zeroshot_thinking.json Metrics: { "total": 924, "with_gt": 0, "with_parsed_answer": 0, "top1_acc": 0.0, "recall_at_5": 0.0, "mrr": 0.0, "num_shards": 1, "shard_index": 0, "evqa_total": 924, "evqa_with_gt_label": 924, "evqa_acc": 0.5443722943722944 } Pushed ecva_zeroshot_thinking.jsonl and README to https://huggingface.co/datasets/happy8825/ecva_zeroshot_thinking ```

提供机构：

happy8825

5,000+

优质数据集

54 个

任务类型

进入经典数据集