Localizing Fake Segments in Speech

Name: Localizing Fake Segments in Speech
Creator: National University of Singapore
Published: 2022-06-24 08:40:22
License: 暂无描述

DataCite Commons2022-06-24 更新2024-07-13 收录

下载链接：

https://scholarbank.nus.edu.sg/handle/10635/227398

下载链接

链接失效反馈

官方服务：

资源简介：

Partial Synthetic Detection (Psynd) dataset is a multi-speaker English corpus of 2294 utterances, approximately 13 hours English speech at 24kHz sampling rate. It is derived from LibriTTS , a read English speech corpus (all real voices) designed for TTS research. The data samples are real utterances injected with voice cloning synthetic speech. The fake parts are generated by state-of-art multi-speaker text-to-speech method and have high similarity with target speakers characterized by Global Style Token (GST) and X-Vector.

提供机构：

National University of Singapore

创建时间：

2022-06-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集