five

Chinese Classical Opera Script Dataset

收藏
科学数据银行2025-12-17 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=affb050c25584d5d8405f15c763313b0
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset of classical opera scripts was built with the support of Professor Cheng Yun's team from the Cultural Heritage Intelligent Computing Laboratory of Wuhan University. It is a phased achievement of the "Classical Opera Text Analysis and Intelligent Reconstruction Platform" and currently mainly consists of text-based data. The "Classical Opera Text Analysis and Intelligent Reconstruction Platform" focuses on classical opera and aims to build a multimodal intelligent data platform that gathers opera literature and cultural relics. It plans to integrate and apply multimodal data with scripts as the center. Over the past two years of platform construction, 331 yuan drama scripts have been proofread, totaling 2.52 million words; Construct a multi granularity knowledge annotation system for the revitalization and academic research of traditional Chinese opera scripts; On the basis of the nested annotation function installed on the platform, the sentence group level data annotation of 4 scripts was completed, accumulating 17000 pieces of data and realizing the visual conversion of scripts between the "Qupai Lianzuo" and "Banqiang" reading modes. At present, the research ideas and methods related to the project have been applied to the intelligent sorting of the script of "The Peony Pavilion" and the construction of a knowledge base for classical opera lyrics at the Capital Library. This platform includes text data for 50 miscellaneous drama scripts and annotated data for three versions of the legendary "Peony Pavilion" script, in Excel/XML/JSON formats.
提供机构:
Weng Mengjuan; zhang ya jing; Wuhan University
创建时间:
2025-12-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作