five

happy8825/ecva_zeroshot_thinking

收藏
Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/happy8825/ecva_zeroshot_thinking
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: "Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results" language: - en tags: - video-retrieval - evaluation - vllm --- # Qwen/Qwen3-VL-2B-Thinking · happy8825/valid_ecva_clean results - **Model**: `Qwen/Qwen3-VL-2B-Thinking` - **Dataset**: `happy8825/valid_ecva_clean` - **Generated**: `2025-12-16 01:28:46Z` ## Metrics | Metric | Value | | --- | --- | | Total samples | 924 | | With GT | 0 | | Parsed answers | 0 | | Top-1 accuracy | 0 | | Recall@5 | 0 | | MRR | 0 | The uploaded JSON contains full per-sample predictions produced via `t3_infer_with_vllm.bash`. ### EVQA/ECVA Metrics | Metric | Value | | --- | --- | | EVQA total | 924 | | EVQA with GT label | 924 | | EVQA accuracy | 0.544372 | ## Run Summary ``` Saved 924 results to /home/seohyun/vid_understanding/video_retrieval/video_retrieval/output_ecva_zeroshot_thinking/ecva_zeroshot_thinking.json Metrics: { "total": 924, "with_gt": 0, "with_parsed_answer": 0, "top1_acc": 0.0, "recall_at_5": 0.0, "mrr": 0.0, "num_shards": 1, "shard_index": 0, "evqa_total": 924, "evqa_with_gt_label": 924, "evqa_acc": 0.5443722943722944 } Pushed ecva_zeroshot_thinking.jsonl and README to https://huggingface.co/datasets/happy8825/ecva_zeroshot_thinking ```
提供机构:
happy8825
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作