five

DigiGreen/AgricultureVideosQnA2

收藏
Hugging Face2024-11-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DigiGreen/AgricultureVideosQnA2
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - question-answering language: - or - hi tags: - agriculture - videoqna - videos size_categories: - 1K<n<10K --- The dataset is in XLS format with multiple sheets named for different languages. The dataset is primarily used for training and ground truth of answers that can be generated for agriculture related queries from the videos. Each sheet has list of video urls (youtube links) and the question that can be asked, corresponding answers that can be generated from the videos, source of information in the answer and time stamps. The sources of information could be: Transcript: based on what one hears Object: Based on an object shown Scene description: based on what is described Text overlay: based on text over lay shown in video Corresponding time stamps are also provided. The videos are in the following languages: Hindi Oriya

--- license: Apache-2.0 task_categories: - 问答(question-answering) language: - 奥里亚语(or) - 印地语(hi) tags: - 农业(agriculture) - 视频问答(videoqna) - 视频(videos) size_categories: - 1K<n<10K --- 本数据集采用XLS格式,包含多个以不同语言命名的工作表。本数据集主要用于农业相关视频查询的答案生成模型训练,以及对应答案的基准真值(ground truth)构建。 每个工作表均包含视频链接(YouTube链接)列表、可提出的查询问题、可从对应视频生成的答案、答案的信息来源,以及时间戳。 信息来源类型包括: - 字幕(Transcript):基于视频音频内容 - 目标(Object):基于视频中展示的实体 - 场景描述(Scene description):基于视频中的场景描述内容 - 文字叠加(Text overlay):基于视频中显示的叠加文字 同时还提供了对应的时间戳。 本数据集涉及的视频使用以下两种语言: 印地语 奥里亚语
提供机构:
DigiGreen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作