richiejp/dns5-16k
收藏Hugging Face2026-03-22 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/richiejp/dns5-16k
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- audio-classification
tags:
- speech
- noise
- room-impulse-response
- acoustic-echo-cancellation
- dns-challenge
pretty_name: DNS5 16kHz (resampled)
---
# DNS5 16kHz
Resampled subset of the [ICASSP 2022 DNS Challenge](https://github.com/microsoft/DNS-Challenge) dataset.
All audio files resampled from 48kHz to **16kHz** and stored as **FLAC** (lossless compression),
packed into tar shards.
## Structure
```
clean/shard_0000.tar # Clean speech (VCTK and other corpora)
clean/shard_0001.tar
...
noise/shard_0000.tar # Environmental noise (AudioSet, Freesound)
...
impulse_responses/shard_0000.tar # Room impulse responses
...
```
Each tar contains FLAC files with their original directory structure preserved.
## Usage
```python
from huggingface_hub import snapshot_download
import tarfile
from pathlib import Path
# Download
local = snapshot_download("richiejp/dns5-16k", local_dir="/data/dns5", repo_type="dataset")
# Extract all shards
for tar_path in sorted(Path(local).rglob("*.tar")):
with tarfile.open(tar_path) as tf:
tf.extractall(tar_path.parent)
```
## Source
Original data from Microsoft's DNS Challenge:
- https://github.com/microsoft/DNS-Challenge
- License: CC-BY-4.0 (see original repo for details)
提供机构:
richiejp



