five

happy8825/ecva_instruct_ver2

收藏
Hugging Face2025-12-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/happy8825/ecva_instruct_ver2
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: "/hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175 · happy8825/valid_ecva_clean results" language: - en tags: - video-retrieval - evaluation - vllm --- # /hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175 · happy8825/valid_ecva_clean results - **Model**: `/hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175` - **Dataset**: `happy8825/valid_ecva_clean` - **Generated**: `2025-12-18 10:50:16Z` ## Metrics | Metric | Value | | --- | --- | | Total samples | 924 | | With GT | 0 | | Parsed answers | 0 | | Top-1 accuracy | 0 | | Recall@5 | 0 | | MRR | 0 | The uploaded JSON contains full per-sample predictions produced via `t3_infer_with_vllm.bash`. ### EVQA/ECVA Metrics | Metric | Value | | --- | --- | | EVQA total | 924 | | EVQA with GT label | 924 | | EVQA accuracy | 0.569264 | ## Run Summary ``` Saved 924 results to /home/seohyun/vid_understanding/video_retrieval/video_retrieval/output_ecva_instruct/ecva_instruct_ver2.json Metrics: { "total": 924, "with_gt": 0, "with_parsed_answer": 0, "top1_acc": 0.0, "recall_at_5": 0.0, "mrr": 0.0, "num_shards": 1, "shard_index": 0, "evqa_total": 924, "evqa_with_gt_label": 924, "evqa_acc": 0.5692640692640693 } Pushed ecva_instruct_ver2.jsonl and README to https://huggingface.co/datasets/happy8825/ecva_instruct_ver2 ```

pretty_name: "/hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175 · happy8825/valid_ecva_clean 结果集 language: - 英语 tags: - 视频检索(video-retrieval) - 评估(evaluation) - vllm # `/hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175 · happy8825/valid_ecva_clean 结果集 - **模型**:`/hub_data4/seohyun/saves/ecva_instruct/full/sft/checkpoint-175` - **数据集**:`happy8825/valid_ecva_clean` - **生成时间**:`2025-12-18 10:50:16Z` ## 评估指标 | 指标 | 数值 | | --- | --- | | 总样本数 | 924 | | 含真实标签(Ground Truth)样本数 | 0 | | 已解析答案数 | 0 | | Top-1准确率 | 0 | | 召回率@5(Recall@5) | 0 | | 平均倒数排名(Mean Reciprocal Rank) | 0 | 本次上传的JSON文件包含通过`t3_infer_with_vllm.bash脚本生成的完整逐样本预测结果。 ### EVQA/ECVA 评估指标 | 指标 | 数值 | | --- | --- | | EVQA总样本数 | 924 | | EVQA含真实标签样本数 | 924 | | EVQA准确率 | 0.569264 | ## 运行摘要 已将924条结果保存至 /home/seohyun/vid_understanding/video_retrieval/video_retrieval/output_ecva_instruct/ecva_instruct_ver2.json 评估指标:{ "total": 924, "with_gt": 0, "with_parsed_answer": 0, "top1_acc": 0.0, "recall_at_5": 0.0, "mrr": 0.0, "num_shards": 1, "shard_index": 0, "evqa_total": 924, "evqa_with_gt_label": 924, "evqa_acc": 0.5692640692640693 } 已将ecva_instruct_ver2.jsonl与README推送至 https://huggingface.co/datasets/happy8825/ecva_instruct_ver2
提供机构:
happy8825
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作