five

richiejp/dns5-16k

收藏
Hugging Face2026-03-22 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/richiejp/dns5-16k
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - audio-classification tags: - speech - noise - room-impulse-response - acoustic-echo-cancellation - dns-challenge pretty_name: DNS5 16kHz (resampled) --- # DNS5 16kHz Resampled subset of the [ICASSP 2022 DNS Challenge](https://github.com/microsoft/DNS-Challenge) dataset. All audio files resampled from 48kHz to **16kHz** and stored as **FLAC** (lossless compression), packed into tar shards. ## Structure ``` clean/shard_0000.tar # Clean speech (VCTK and other corpora) clean/shard_0001.tar ... noise/shard_0000.tar # Environmental noise (AudioSet, Freesound) ... impulse_responses/shard_0000.tar # Room impulse responses ... ``` Each tar contains FLAC files with their original directory structure preserved. ## Usage ```python from huggingface_hub import snapshot_download import tarfile from pathlib import Path # Download local = snapshot_download("richiejp/dns5-16k", local_dir="/data/dns5", repo_type="dataset") # Extract all shards for tar_path in sorted(Path(local).rglob("*.tar")): with tarfile.open(tar_path) as tf: tf.extractall(tar_path.parent) ``` ## Source Original data from Microsoft's DNS Challenge: - https://github.com/microsoft/DNS-Challenge - License: CC-BY-4.0 (see original repo for details)
提供机构:
richiejp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作