Naniee/meetingbank
收藏Hugging Face2026-04-26 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/Naniee/meetingbank
下载链接
链接失效反馈官方服务:
资源简介:
MeetingBank是一个基准数据集,源自美国6个主要城市的市议会会议,用于补充现有数据集。它包含1,366次会议,超过3,579小时的视频,以及转录文本、会议纪要PDF文档、议程和其他元数据。平均每次市议会会议时长为2.6小时,其转录文本包含超过28,000个标记,这使其成为会议摘要和从会议视频中提取结构的宝贵测试平台。该数据集包含6,892个片段级摘要实例,用于训练和评估性能。
MeetingBank, a benchmark dataset created from the city councils of 6 major U.S. cities to supplement existing datasets. It contains 1,366 meetings with over 3,579 hours of video, as well as transcripts, PDF documents of meeting minutes, agenda, and other metadata. On average, a council meeting is 2.6 hours long and its transcript contains over 28k tokens, making it a valuable testbed for meeting summarizers and for extracting structure from meeting videos. The datasets contains 6,892 segment-level summarization instances for training and evaluating of performance.
提供机构:
Naniee



