five

eduhk-compling/GroupG_Project

收藏
Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/eduhk-compling/GroupG_Project
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个多模态标注数据集,包含来自王者荣耀职业联赛(KPL)三场动态比赛(2021秋季半决赛、2025总决赛和2025夏季总决赛)的65分24秒解说内容。数据集通过从BiliBili平台录制视频中提取AI生成的时间对齐转录文本,并结合必要的人工修改以确保准确性。它具有全面的人工标注,包括说话人识别、屏幕事件描述、领域特定术语(OOV)提取和情感分级。数据集以CSV格式提供,解决了高强度领域特定口语资源稀缺的问题,为多模态情感分析和电竞相关自然语言处理(NLP)研究提供了重要潜力。

This dataset introduces a multimodal labeled dataset including 65 minutes and 24 seconds of commentary from three dynamic matches in the King Pro League (KPL) of the game Honor of Kings: the 2021 Autumn Semifinals, 2025 Finals, and 2025 Summer Finals. This dataset was collected by extracting AI-generated, time-aligned transcripts from recording videos on BiliBili Platform, which combines essential manually modification to ensure accuracy. It features comprehensive manual annotations, including speaker recognition, screen event descriptions, domain-specific terminology (OOV) extraction, and emotion grading. Provided in CSV format, this dataset addresses the scarcity of high-intensity, domain-specific spoken language resources, offering significant potential for multimodal sentiment analysis and esports-related Natural Language Processing (NLP) research.
提供机构:
eduhk-compling
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作