eduhk-compling/GroupG_Project
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/eduhk-compling/GroupG_Project
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多模态标注数据集,包含来自王者荣耀职业联赛(KPL)三场动态比赛(2021秋季半决赛、2025总决赛和2025夏季总决赛)的65分24秒解说内容。数据集通过从BiliBili平台录制视频中提取AI生成的时间对齐转录文本,并结合必要的人工修改以确保准确性。它具有全面的人工标注,包括说话人识别、屏幕事件描述、领域特定术语(OOV)提取和情感分级。数据集以CSV格式提供,解决了高强度领域特定口语资源稀缺的问题,为多模态情感分析和电竞相关自然语言处理(NLP)研究提供了重要潜力。
This dataset introduces a multimodal labeled dataset including 65 minutes and 24 seconds of commentary from three dynamic matches in the King Pro League (KPL) of the game Honor of Kings: the 2021 Autumn Semifinals, 2025 Finals, and 2025 Summer Finals. This dataset was collected by extracting AI-generated, time-aligned transcripts from recording videos on BiliBili Platform, which combines essential manually modification to ensure accuracy. It features comprehensive manual annotations, including speaker recognition, screen event descriptions, domain-specific terminology (OOV) extraction, and emotion grading. Provided in CSV format, this dataset addresses the scarcity of high-intensity, domain-specific spoken language resources, offering significant potential for multimodal sentiment analysis and esports-related Natural Language Processing (NLP) research.
提供机构:
eduhk-compling



