swdq/asr_correct

Name: swdq/asr_correct
Creator: swdq
Published: 2024-12-11 16:42:08
License: 暂无描述

Hugging Face2024-12-11 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/swdq/asr_correct

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集主要用于自动语音识别（ASR）任务，特别是针对日语内容，并且包含NSFW（不适合工作场所）和视觉相关的标签。数据集的大小在10万到100万之间。数据集由多个子数据集组成，包括OOPPEENN/Galgame_Dataset、grider-withourai/nekopara-speech和litagin/Galgame_Speech_ASR_16kHz。数据集的目的是通过使用whisper模型的转录结果和正确标签来解决whisper在NSFW内容转录中的不准确性。

This dataset is primarily used for automatic speech recognition tasks, especially for NSFW (Not Safe For Work) content and visual content. The dataset includes multiple sub-datasets such as OOPPEENN/Galgame_Dataset, grider-withourai/nekopara-speech, and litagin/Galgame_Speech_ASR_16kHz. The purpose of these datasets is to improve the inaccuracies of the Whisper model in transcribing NSFW content by using the Whisper model for speech-to-text and combining it with correct labels. The size of the dataset is between 100K and 1M, and the language is Japanese.

提供机构：

swdq

5,000+

优质数据集

54 个

任务类型

进入经典数据集