freesound-laion-640k
收藏魔搭社区2025-12-05 更新2025-03-22 收录
下载链接:
https://modelscope.cn/datasets/benjamin-paine/freesound-laion-640k
下载链接
链接失效反馈官方服务:
资源简介:
# About this Repository
This repository is a re-upload of [the FreeSound.org dataset](https://huggingface.co/datasets/Meranti/CLAP_freesound) as curated by LAION for [the larger LAION-Audio-630k dataset](https://github.com/LAION-AI/audio-dataset/blob/main/laion-audio-630k/), with the following changes:
1. Limited columns to only the audio and basic metadata.
2. Incorporated necessary information for licensing and attribution.
3. Removed ambiguously licensed samples, amounting to around 1,000 total samples.
## What about download links?
Links were ommitted for the sake of size, as they can be constructed from the data already present. To reconstruct a link, use the following format:
`https://freesound.org/people/{username}/sound/{id}`
# About this Dataset
> LAION-Audio-630K is a large-scale audio-text dataset consisting of 633,526 pairs with the total duration of 4,325.39 hours. It contains audios of human activities, natural sounds and audio effects, consisting of 8 data sources (see the data source table below) from publicly available websites. We collect these datasets by downloading audios and relevant text descriptions. Based on our current knowledge, LAION-Audio-630K is the largest audio-text dataset publicly available and a magnitude larger than previous audio-text datasets (by 2022-11-05).
>
> [LAION-AI, github.com](https://github.com/LAION-AI/audio-dataset/blob/main/laion-audio-630k/)
## Acknowledgment
The whole collection process as well as all usage of the LAION-Audio-630K are conducted by Germany non-profit pure research organization LAION. All contributors and collectors of the dataset are considered as open source contributors affiliated to LAION. These community contributors (Discord ids) include but not limited to: @marianna13#7139, @Chr0my#0173, @PiEquals4#1909, @Yuchen Hui#8574, @Antoniooooo#4758, @IYWO#9072, krishna#1648, @dicknascarsixtynine#3885, and @turian#1607. We would like to appreciate all of them for their efforts on the LAION-Audio-630k dataset.
## License
- LAION dataset metadata is released under [The MIT License.](https://mit-license.org/)
- Audio is released under one of six licenses:
| License | URL |
| ------- | --- |
| CC0-1.0 | https://creativecommons.org/publicdomain/zero/1.0/ |
| CC-BY-NC 4.0 | https://creativecommons.org/licenses/by-nc/4.0/ |
| CC-BY-NC 3.0 | https://creativecommons.org/licenses/by-nc/3.0/ |
| CC-BY 4.0 | https://creativecommons.org/licenses/by/4.0/ |
| CC-BY 3.0 | https://creativecommons.org/licenses/by/3.0/ |
| CC-Sampling+ | https://creativecommons.org/licenses/sampling+/1.0/ |
**Please read the entirety of these licenses before deciding if you can use the audio for your project.** Two important caveats of each license, whether the piece requires attribution and whether the piece can be used in commercial works, are included in the dataset itself to help inform these decisions.
# 关于本仓库
本仓库是对[FreeSound.org数据集(FreeSound.org)](https://huggingface.co/datasets/Meranti/CLAP_freesound)的重新上传,该数据集由LAION为构建更大规模的[LAION-Audio-630K数据集](https://github.com/LAION-AI/audio-dataset/blob/main/laion-audio-630k/)所整理,本次上传做了如下调整:
1. 仅保留音频与基础元数据两列内容。
2. 补充了授权与署名所需的必要信息。
3. 移除了授权模糊的样本,总计约1000条。
## 关于下载链接?
考虑到仓库体积限制,本次上传未包含下载链接,相关链接可通过现有数据自行构建。链接格式如下:
`https://freesound.org/people/{username}/sound/{id}`
# 关于本数据集
> LAION-Audio-630K是一个大规模音频-文本数据集,包含633,526条音频-文本配对样本,总时长达4,325.39小时。该数据集涵盖人类活动音效、自然环境音与音频特效,包含来自公开网站的8类数据源(详见下方数据源表格)。我们通过下载音频及相关文本描述来收集本数据集。据我们所知,截至2022年11月5日,LAION-Audio-630K是目前公开可用的规模最大的音频-文本数据集,其体量较此前的音频-文本数据集高出数个量级。
>
> [LAION-AI, github.com](https://github.com/LAION-AI/audio-dataset/blob/main/laion-audio-630k/)
## 致谢
LAION-Audio-630K的全部收集流程与使用均由德国非营利纯研究机构LAION完成。本数据集的所有贡献者与收集者均视为隶属于LAION的开源贡献者。这些社区贡献者(Discord账号)包括但不限于:@marianna13#7139、@Chr0my#0173、@PiEquals4#1909、@Yuchen Hui#8574、@Antoniooooo#4758、@IYWO#9072、krishna#1648、@dicknascarsixtynine#3885以及@turian#1607。我们谨向所有为LAION-Audio-630K数据集付出努力的人员致以诚挚谢意。
## 授权协议
- LAION数据集的元数据采用[MIT协议](https://mit-license.org/)发布。
- 音频文件采用以下六种协议之一发布:
| 授权协议 | 链接 |
| ------- | --- |
| CC0-1.0 | https://creativecommons.org/publicdomain/zero/1.0/ |
| CC-BY-NC 4.0 | https://creativecommons.org/licenses/by-nc/4.0/ |
| CC-BY-NC 3.0 | https://creativecommons.org/licenses/by-nc/3.0/ |
| CC-BY 4.0 | https://creativecommons.org/licenses/by/4.0/ |
| CC-BY 3.0 | https://creativecommons.org/licenses/by/3.0/ |
| CC-Sampling+ | https://creativecommons.org/licenses/sampling+/1.0/ |
**在决定将音频用于项目之前,请务必完整阅读上述所有授权协议。** 数据集本身已包含各协议的两项重要说明:是否需要署名、是否可用于商业用途,以辅助您做出相关决策。
提供机构:
maas
创建时间:
2025-03-18



