five

noxwano/ASMR-Archive-Processed-SFW

收藏
Hugging Face2026-04-11 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/noxwano/ASMR-Archive-Processed-SFW
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: agpl-3.0 task_categories: - automatic-speech-recognition - text-to-speech language: - ja tags: - speech - audio - japanese - asmr - anime - voice pretty_name: ASMR-Archive-Processed-SFW size_categories: - 1M<n<10M --- # ASMR-Archive-Processed-SFW ## Overview This dataset is an “educational” subset of the original [OmniAICreator/ASMR-Archive-Processed](https://huggingface.co/datasets/OmniAICreator/ASMR-Archive-Processed) dataset. We filtered the original dataset to include only records where the `nsfw` metadata flag is `false`. To maintain the randomness and anonymity of the entries, multiple directories were combined and shuffled. > The `nsfw` tag in the original dataset is inherited from the tags of the original audio works before they were passed through the pipeline. **Therefore, this subset does not exhaustively cover all inherently SFW entries included in the original data.** ## Dataset Contents & Preprocessing For detailed information regarding the specific contents of the data and the original preprocessing pipelines, please refer to the original [OmniAICreator/ASMR-Archive-Processed](https://huggingface.co/datasets/OmniAICreator/ASMR-Archive-Processed) dataset. ## Biases and Limitations Users should be aware of the following limitations inherited from the original dataset: * **Gender Bias**: Due to the nature of the source material, the voices are heavily skewed towards females. * **Audio Artifacts**: Some segments might still contain overlapping speakers or residual ASMR sound effects despite the vocal isolation process. * **Transcription Inaccuracies**: The text transcriptions are entirely AI-generated and lack manual verification; therefore, they may contain errors. ## License & Usage This dataset inherits the **AGPL-3.0 license** from the source datasets. **Intended Use**: This dataset is intended strictly for educational and academic research purposes. **Disclaimer**: Use is at your own risk. You must ensure compliance with applicable laws. The dataset is provided "as is" with absolutely no express or implied warranty.
提供机构:
noxwano
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作