Chinese Classical Opera Script Dataset

Name: Chinese Classical Opera Script Dataset
Creator: Science Data Bank
Published: 2025-12-18 03:06:31
License: 暂无描述

DataCite Commons2025-12-18 更新2026-05-05 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=affb050c25584d5d8405f15c763313b0

下载链接

链接失效反馈

官方服务：

资源简介：

The dataset of classical opera scripts was built with the support of Professor Cheng Yun's team from the Cultural Heritage Intelligent Computing Laboratory of Wuhan University. It is a phased achievement of the "Classical Opera Text Analysis and Intelligent Reconstruction Platform" and currently mainly consists of text-based data. The "Classical Opera Text Analysis and Intelligent Reconstruction Platform" focuses on classical opera and aims to build a multimodal intelligent data platform that gathers opera literature and cultural relics. It plans to integrate and apply multimodal data with scripts as the center. Over the past two years of platform construction, 331 yuan drama scripts have been proofread, totaling 2.52 million words; Construct a multi granularity knowledge annotation system for the revitalization and academic research of traditional Chinese opera scripts; On the basis of the nested annotation function installed on the platform, the sentence group level data annotation of 4 scripts was completed, accumulating 17000 pieces of data and realizing the visual conversion of scripts between the "Qupai Lianzuo" and "Banqiang" reading modes. At present, the research ideas and methods related to the project have been applied to the intelligent sorting of the script of "The Peony Pavilion" and the construction of a knowledge base for classical opera lyrics at the Capital Library. This platform includes text data for 50 miscellaneous drama scripts and annotated data for three versions of the legendary "Peony Pavilion" script, in Excel/XML/JSON formats.

提供机构：

Science Data Bank

创建时间：

2025-12-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集