noxwano/ASMR-Archive-Processed-SFW
收藏Hugging Face2026-04-11 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/noxwano/ASMR-Archive-Processed-SFW
下载链接
链接失效反馈官方服务:
资源简介:
---
license: agpl-3.0
task_categories:
- automatic-speech-recognition
- text-to-speech
language:
- ja
tags:
- speech
- audio
- japanese
- asmr
- anime
- voice
pretty_name: ASMR-Archive-Processed-SFW
size_categories:
- 1M<n<10M
---
# ASMR-Archive-Processed-SFW
## Overview
This dataset is an “educational” subset of the original [OmniAICreator/ASMR-Archive-Processed](https://huggingface.co/datasets/OmniAICreator/ASMR-Archive-Processed) dataset.
We filtered the original dataset to include only records where the `nsfw` metadata flag is `false`.
To maintain the randomness and anonymity of the entries, multiple directories were combined and shuffled.
> The `nsfw` tag in the original dataset is inherited from the tags of the original audio works before they were passed through the pipeline. **Therefore, this subset does not exhaustively cover all inherently SFW entries included in the original data.**
## Dataset Contents & Preprocessing
For detailed information regarding the specific contents of the data and the original preprocessing pipelines, please refer to the original [OmniAICreator/ASMR-Archive-Processed](https://huggingface.co/datasets/OmniAICreator/ASMR-Archive-Processed) dataset.
## Biases and Limitations
Users should be aware of the following limitations inherited from the original dataset:
* **Gender Bias**: Due to the nature of the source material, the voices are heavily skewed towards females.
* **Audio Artifacts**: Some segments might still contain overlapping speakers or residual ASMR sound effects despite the vocal isolation process.
* **Transcription Inaccuracies**: The text transcriptions are entirely AI-generated and lack manual verification; therefore, they may contain errors.
## License & Usage
This dataset inherits the **AGPL-3.0 license** from the source datasets.
**Intended Use**: This dataset is intended strictly for educational and academic research purposes.
**Disclaimer**: Use is at your own risk. You must ensure compliance with applicable laws. The dataset is provided "as is" with absolutely no express or implied warranty.
提供机构:
noxwano



