四川话方言朗读数据库

Name: 四川话方言朗读数据库
Creator: 苏州核数聚信息科技有限公司
Published: 2025-06-13 16:49:04
License: 暂无描述

江苏数据交易所2025-06-13 更新2026-01-30 收录

下载链接：

https://exchange.jsdataex.com/trade-home/#/project/tradingMarket/productDetail?productId=1000

下载链接

链接失效反馈

官方服务：

资源简介：

一、基本信息【产品名称】四川话方言朗读数据库【系列名称】CDT-ASR-SC_DIALECT-2022-0005【供方名称】苏州核数聚信息科技有限公司【应用领域】数字金融、数字医疗、工业互联、能源互联、智慧出行、信息技术等【来源行业】人工智能【数据主题】方言数据【产品类型】语音数据包【产品描述】该数据为四川方言朗读数据，可用于语音识别系统训练、测试、语音分析、口音研究等多种用途。主要采集人群为：主要采集人群为：重庆、成都、乐山、南充、广元等地。字正：98%以上。【关键词】方言、四川话、朗读数据二、物理信息【更新频率】一年【覆盖范围】重庆、成都、乐山、南充、广元【存储大小】 34G【存储格式】语音(采样率 16k Hz，采样精度是 16bit，单通道，未压缩 wav）【底层数据维度】共有2个标签字段，分别为：语料、标注序号参数名称字段名称字段描述示例值1语料ref音频采集语料2标注lab语音内容的实际标注结果三、内容说明【样例地址】https://pan.baidu.com/s/1IDaBOK2j8iaKaR1umYrHgQ?pwd=apsj【有效时长】 300小时【录制人数】 890人【音频格式】采样率 16k Hz，采样精度 16bit，单通道，未压缩 wav【人群比例】男女比例（6：4）【采集设备】智能手机，基于 Android 、IOS  系统的多种品牌型号手机进行录制【录音人地域分布】重庆、成都、乐山、南充、广元【数据字典】序号参数名称显示值描述1四川方言注音每个字词对应相应的注音，如：巴适 ba si输出各具体字段内容及其含义说明【数据名称】四环方言朗读数据库【样例数据】序号参数名称字段名称字段描述示例值1角色id角色序号音频名第一列为角色序号2音频id音频音频名第二列为音频id，同一角色不同id【数据条数】 291249条四、使用案例家电项目1、语音识别模型训练：这批音频数据可以用于训练语音识别模型，以使产品能够准确地识别和理解四川话的语音指令和对话。通过收集大量的四川话音频数据，并使用机器学习和深度学习技术进行模型训练，可以提高语音识别的准确性和稳定性。2、语音命令和控制：利用四川话指令音频数据，家电产品可以开发语音命令和控制功能。这意味着用户可以使用方言话与电视进行对话，并通过语音指令控制电视、音箱等的各种功能，如调整音量、切换频道、打开特定应用程序等。这种语音控制的功能提供了一种更直观和便捷的方式来操作电视。3、语音搜索和推荐：四川餐饮、银行等客户服务可以利用这批音频数据实现语音搜索和推荐功能。用户用方言进行内容等搜索，并获得相关的结果。基于这些数据，还可以分析用户的喜好和兴趣，提供个性化的推荐内容，以改善用户的观看体验。五、使用说明【交付方式】：方式一：云端介质（网盘、ftp等）；【使用方式】：可直接使用；六、使用条件【使用目的】：智能产品训练验证【使用主体】：无【使用者资质】：无【时间限制】：无【转让限制】：限制客户二次销售本数据库【其它限制】：无七、来源描述【来源方式】：自行生产，300小时有效数据【采集说明】：音频格式采样率16k Hz，采样精度是16bit，单通道，未压缩wav人群比例男女比例（6：4）采集设备智能手机，基于 Android、IOS 系统的多种品牌型号手机进行录制环境相对安静室内环境，无回声语料领域涵盖售车、时政、直播、课堂、餐饮录音人地域分布重庆、成都、乐山、南充、广元标签无【敏感数据脱敏】：是八、产品价格购买方式价格/计费方式使用次数按小时数据计算购买价格600元/小时不限

I. Basic Information 【Product Name】 Sichuan Dialect Reading Database 【Series Name】 CDT-ASR-SC_DIALECT-2022-0005 【Supplier】 Suzhou Heshuju Information Technology Co., Ltd. 【Application Fields】 Digital Finance, Digital Healthcare, Industrial Internet, Energy Internet, Smart Mobility, Information Technology, etc. 【Source Industry】 Artificial Intelligence 【Data Theme】 Dialect Data 【Product Type】 Voice Data Package 【Product Description】 This dataset comprises Sichuan dialect reading audio, applicable to multiple scenarios including training and testing speech recognition systems, speech analysis, accent research and more. The main data collection population comes from regions including Chongqing, Chengdu, Leshan, Nanchong, Guangyuan and other areas. The pronunciation accuracy exceeds 98%. 【Keywords】 dialect, Sichuan dialect, reading data II. Physical Information 【Update Frequency】 Annually 【Coverage】 Chongqing, Chengdu, Leshan, Nanchong, Guangyuan 【Storage Size】 34 GB 【Storage Format】 Audio (sample rate: 16 kHz, sample precision: 16-bit, mono channel, uncompressed WAV) 【Underlying Data Dimensions】 There are 2 label fields, specifically: 1. Corpus (ref): Audio collection corpus 2. Label (lab): Actual annotation result of speech content III. Content Description 【Sample Address】 https://pan.baidu.com/s/1IDaBOK2j8iaKaR1umYrHgQ?pwd=apsj 【Total Valid Duration】 300 hours 【Number of Recording Speakers】 890 individuals 【Audio Format】 Sample rate: 16 kHz, sample precision: 16-bit, mono channel, uncompressed WAV 【Gender Ratio】 Male:Female = 6:4 【Collection Equipment】 Smartphones of various brands and models based on Android and iOS systems 【Speaker Geographical Distribution】 Chongqing, Chengdu, Leshan, Nanchong, Guangyuan 【Data Dictionary】 | No. | Parameter Name | Display Value | Description | Example | | --- | --- | --- | --- | --- | | 1 | Sichuan Dialect Phonetic Notation | Phonetic notation corresponding to each word, e.g., "巴适" → "ba si" | Output specific field content and its meaning explanation | N/A 【Data Name】 Sichuan Dialect Reading Database 【Sample Data】 | No. | Parameter Name | Field Name | Description | Example | | --- | --- | --- | --- | --- | | 1 | Role ID | Role Serial Number | The first column of the audio filename is the role serial number | N/A | 2 | Audio ID | Audio ID | The second column of the audio filename is the audio ID, with different IDs for the same role | N/A 【Total Number of Data Entries】 291,249 entries IV. Use Cases: Home Appliance Applications 1. Speech Recognition Model Training: This batch of audio data can be used to train speech recognition models, enabling products to accurately recognize and understand Sichuan dialect voice commands and dialogues. By collecting large-scale Sichuan dialect audio data and training models with machine learning and deep learning technologies, the accuracy and stability of speech recognition can be enhanced. 2. Voice Command and Control: Leveraging Sichuan dialect command audio data, home appliance products can develop voice command and control functions. Users can converse with TVs in dialect and control various functions of devices such as TVs and speakers via voice commands, including adjusting volume, switching channels, launching specific applications and more. This voice control feature offers a more intuitive and convenient way to operate home appliances. 3. Voice Search and Recommendation: Customer service scenarios like Sichuan catering and banking can utilize this dataset to implement voice search and recommendation functions. Users can search for content and other information in dialect and obtain relevant results. Additionally, user preferences and interests can be analyzed based on these data to deliver personalized recommended content, improving user experience. V. Usage Instructions 【Delivery Methods】 Method 1: Cloud media (network disk, FTP, etc.); 【Usage Methods】 Directly available for use; VI. Usage Terms 【Purpose of Use】 Intelligent product training and verification 【User Subject】 None 【User Qualifications】 None 【Time Limit】 None 【Transfer Restrictions】 Customers are prohibited from reselling this database 【Other Restrictions】 None VII. Source Description 【Source Method】 Self-produced, 300 hours of valid data 【Collection Instructions】 Audio format: sample rate 16 kHz, sample precision: 16-bit, mono channel, uncompressed WAV. Speaker gender ratio: Male:Female = 6:4. Collection equipment: Smartphones of various brands and models running Android and iOS systems. Recording environment: Quiet indoor environment with no echo. Corpus coverage areas include vehicle sales, current affairs, live streaming, classes and catering. No attached labels. 【Sensitive Data Desensitization】 Yes VIII. Product Pricing Purchase Method: Pricing is calculated based on hourly data usage. Purchase price: 600 CNY per hour, with unlimited usage times.

提供机构：

苏州核数聚信息科技有限公司

创建时间：

2025-06-13

搜集汇总

背景与挑战

背景概述

该数据集是一个四川话方言朗读数据库，包含300小时的有效音频数据，由890名来自重庆、成都、乐山、南充和广元等地的录制者使用智能手机采集，音频格式为16k Hz采样率的未压缩wav文件。数据适用于语音识别系统训练、口音研究和智能产品开发，覆盖数字金融、医疗等多个应用领域，并附带使用限制如禁止二次销售。

以上内容由遇见数据集搜集并总结生成