ASCEND
收藏魔搭社区2025-09-14 更新2025-03-01 收录
下载链接:
https://modelscope.cn/datasets/OpenDataLab/ASCEND
下载链接
链接失效反馈官方服务:
资源简介:
displayName: ASCEND
labelTypes:
- Chinese Corpus
license:
- CC BY-SA 4.0
mediaTypes:
- Speech
paperUrl: https://arxiv.org/pdf/2112.06223v6.pdf
publishDate: "2022"
publishUrl: https://github.com/HLTCHKUST/ASCEND
publisher:
- Hong Kong University of Science and Technology
tags:
- Voice
taskTypes: []
---
# 数据集介绍
## 简介
ASCEND(A Spontaneous Chinese-English Dataset)引入了在香港收集的自发多轮会话对话中文语码转换语料库的优质资源。 ASCEND 包括 23 名中英文流利的双语者,由 10.62 小时的干净语音语料库组成。
## 类定义
null
## 引文
```
@article{lovenia2021ascend,
title={ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation},
author={Lovenia, Holy and Cahyawijaya, Samuel and Winata, Genta Indra and Xu, Peng and Yan, Xu and Liu, Zihan and Frieske, Rita and Yu, Tiezheng and Dai, Wenliang and Barezi, Elham J and others},
journal={arXiv preprint arXiv:2112.06223},
year={2021}
}
```
## Download dataset
:modelscope-code[]{type="git"}
显示名称:ASCEND
标签类型:
- 中文语料库
许可协议:
- 知识共享署名-相同方式共享4.0协议(CC BY-SA 4.0)
媒体类型:
- 语音(Speech)
论文链接:https://arxiv.org/pdf/2112.06223v6.pdf
发布日期:"2022"
发布地址:https://github.com/HLTCHKUST/ASCEND
发布机构:
- 香港科技大学(Hong Kong University of Science and Technology)
标签:
- 语音
任务类型:[]
---
# 数据集介绍
## 简介
ASCEND(全称A Spontaneous Chinese-English Dataset,即自发式中英双语语料库)收录了在香港采集的高质量自发多轮会话中英双语码转换语料资源。该数据集涵盖23名精通中英双语的双语者语音数据,总时长达10.62小时的纯净语音语料库。
## 类定义
无
## 引用文献
@article{lovenia2021ascend,
title={ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation},
author={Lovenia, Holy and Cahyawijaya, Samuel and Winata, Genta Indra and Xu, Peng and Yan, Xu and Liu, Zihan and Frieske, Rita and Yu, Tiezheng and Dai, Wenliang and Barezi, Elham J and others},
journal={arXiv preprint arXiv:2112.06223},
year={2021}
}
## 数据集下载
`modelscope-code`(类型:Git)
提供机构:
maas
创建时间:
2024-07-29



