SUNO-XX1
收藏魔搭社区2025-12-05 更新2025-07-26 收录
下载链接:
https://modelscope.cn/datasets/sleeping-ai/SUNO-XX1
下载链接
链接失效反馈官方服务:
资源简介:
<h1 align='center'>SUNO XX1</h1>
<h3 align='center'>You can find the raw database here</h3>
Introducing SUNO XX1, a dataset we constructed after community request to receive the entire SUNO load and having a better and comprehensive 1:1 mirror database of the SUNO site. It is an expansion on-top of SUNO-1M we released earlier.
We are releasing estimated 2M songs and a 1:1 mirror of SUNO database.
*this repo will be constantly uploaded over the course of next 2 weeks.*
### LICENCE
We release this under CC-by-nc-nd 4.0, preventing people from below acts:
1. Sharing the dataset from personal storage
2. We don't allow any commercial use + any derivative of this dataset + any re-upload attempt.
Please, follow this rules for making sure we can release more research datasets legally and push state-of-art research.
# Data Field Descriptions
- ``id``: Unique identifier for the song entry.
- ``user_id``: Unique identifier for the user who created the song.
- ``audio_url``: URL to the song's audio file.
- ``video_url``: URL to the song's video file.
- ``allow_comments``: Boolean indicating if comments are allowed.
- ``artist_clip_id``: Identifier for the artist's clip, if applicable.
- ``avatar_image_url``: URL to the user's avatar image.
- ``can_publish_with_vocal``: Boolean indicating if publishing with vocals is allowed.
- ``can_remix``: Boolean indicating if the song can be remixed.
- ``caption``: Optional caption for the song.
- ``comment_count``: Number of comments on the song.
- ``concat_history``: History of concatenated clips, if any.
- ``continue_at``: Timestamp for resuming song creation.
- ``cover_clip_id``: Identifier for the cover clip, if applicable.
- ``created_at``: Timestamp of when the song was created.
- ``display_name``: User's display name.
- ``display_tags``: Tags displayed for the song.
- ``duration``: Duration of the song in seconds.
- ``edit_session_id``: Identifier for the editing session.
- ``edited_clip_id``: Identifier for the edited clip, if applicable.
- ``entity_type``: Type of entity (e.g., song_schema).
- ``explicit``: Boolean indicating if the song contains explicit content.
- ``flag_count``: Number of flags reported on the song.
- ``from_index``: Starting index for pagination, if applicable.
- ``handle``: User's unique handle.
- ``has_vocal``: Boolean indicating if the song includes vocals.
- ``image_large_url``: URL to the large version of the song's image.
- ``image_s3_id``: S3 identifier for the song's image.
- ``image_url``: URL to the song's image.
- ``include_future_s``: Future settings inclusion flag.
- ``include_history_s``: History settings inclusion flag.
- ``infill``: Infill data for song generation.
- ``infill_context_end_s``: End time for infill context.
- ``infill_context_start_s``: Start time for infill context.
- ``infill_dur_s``: Duration of infill segment.
- ``infill_end_s``: End time for infill segment.
- ``infill_start_s``: Start time for infill segment.
- ``is_contest_clip``: Boolean indicating if the song is part of a contest.
- ``is_handle_updated``: Boolean indicating if the handle has been updated.
- ``is_liked``: Boolean indicating if the song is liked by the user.
- ``is_owned``: Boolean indicating if the song is owned by the user.
- ``is_public``: Boolean indicating if the song is publicly accessible.
- ``is_remix``: Boolean indicating if the song is a remix.
- ``is_trashed``: Boolean indicating if the song is marked as trashed.
- ``major_model_version``: Major version of the model used to generate the song.
- ``metadata``: Additional metadata about the song (e.g., tags, prompt).
- ``model_name``: Name of the model used for song generation.
- ``name``: Optional name of the song.
- ``negative_tags``: Tags indicating undesirable attributes.
- ``page_size``: Size of pagination page, if applicable.
- ``persona``: Persona associated with the song.
- ``persona_id``: Identifier for the persona.
- ``play_count``: Number of times the song has been played.
- ``preview_url``: URL to the song's preview.
- ``priority``: Priority level for song generation.
- ``prompt``: Text prompt used for song generation.
- ``public_songhot``: Indicator of song popularity.
- ``refund_credits``: Boolean indicating if credits were refunded.
- ``result``: Result data from song generation.
- ``root_clip_id``: Identifier for the root clip, if applicable.
- ``source``: Source of the song.
- ``status``: Current status of the song (e.g., complete).
- ``stream``: Boolean indicating if the song is streamable.
- ``tags``: Tags associated with the song.
- ``task``: Task associated with song generation.
- ``title``: Title of the song.
- ``total_hits``: Total number of hits/views for the song.
- ``type``: Type of song or generation.
- ``upsample_clip_id``: Identifier for the upsampled clip, if applicable.
- ``upvote_count``: Number of upvotes the song has received.
- ``user_display_name``: Display name of the user.
- ``user_handle``: Handle of the user.
- ``user_image_url``: URL to the user's image.
- ``video_cover_url``: URL to the video cover image.
- ``video_upload_height``: Height of the uploaded video.
- ``video_upload_width``: Width of the uploaded video.
# 居中展示:SUNO XX1
## 居中展示:您可在此获取原始数据集
本次推出SUNO XX1数据集,本数据集是应社区需求打造而成,旨在收录完整的SUNO平台内容,并构建与SUNO官网完全一致的1:1镜像数据库。本数据集是此前发布的SUNO-1M的扩展版本。
本次将发布约200万首歌曲及SUNO数据库的1:1完整镜像。
*本仓库将在未来两周内持续完成上传工作。*
### 授权协议
本数据集采用CC-by-nc-nd 4.0协议发布,禁止以下行为:
1. 从个人存储渠道分享本数据集
2. 禁止任何商业使用、基于本数据集的衍生创作,以及二次上传行为。
请遵守本协议条款,以便我们能够合法发布更多研究数据集,推动前沿学术研究的发展。
# 数据字段说明
- ``id``:歌曲条目的唯一标识符
- ``user_id``:创建该歌曲的用户的唯一标识符
- ``audio_url``:歌曲音频文件的下载链接
- ``video_url``:歌曲视频文件的下载链接
- ``allow_comments``:布尔值,用于标识是否允许对该歌曲进行评论
- ``artist_clip_id``:艺术家剪辑的标识符(如适用)
- ``avatar_image_url``:用户头像图片的链接
- ``can_publish_with_vocal``:布尔值,用于标识是否允许发布带人声的版本
- ``can_remix``:布尔值,用于标识该歌曲是否可被二次创作(混音)
- ``caption``:歌曲的可选说明文字
- ``comment_count``:该歌曲的评论总数
- ``concat_history``:拼接剪辑的历史记录(如适用)
- ``continue_at``:歌曲创作的恢复时间戳
- ``cover_clip_id``:封面剪辑的标识符(如适用)
- ``created_at``:歌曲创建的时间戳
- ``display_name``:用户的展示名称
- ``display_tags``:该歌曲展示所用的标签
- ``duration``:歌曲时长,单位为秒
- ``edit_session_id``:编辑会话的标识符
- ``edited_clip_id``:已编辑剪辑的标识符(如适用)
- ``entity_type``:实体类型(例如:song_schema)
- ``explicit``:布尔值,用于标识该歌曲是否包含敏感/低俗内容
- ``flag_count``:该歌曲被举报的总次数
- ``from_index``:分页起始索引(如适用)
- ``handle``:用户的唯一用户名(Handle)
- ``has_vocal``:布尔值,用于标识该歌曲是否包含人声
- ``image_large_url``:歌曲高清图片的链接
- ``image_s3_id``:歌曲图片的S3标识符
- ``image_url``:歌曲图片的链接
- ``include_future_s``:未来设置的包含标识
- ``include_history_s``:历史设置的包含标识
- ``infill``:歌曲生成的补全数据
- ``infill_context_end_s``:补全上下文的结束时间点
- ``infill_context_start_s``:补全上下文的起始时间点
- ``infill_dur_s``:补全片段的时长
- ``infill_end_s``:补全片段的结束时间点
- ``infill_start_s``:补全片段的起始时间点
- ``is_contest_clip``:布尔值,用于标识该歌曲是否属于赛事内容
- ``is_handle_updated``:布尔值,用于标识用户的Handle是否已更新
- ``is_liked``:布尔值,用于标识当前用户是否点赞了该歌曲
- ``is_owned``:布尔值,用于标识该歌曲是否归当前用户所有
- ``is_public``:布尔值,用于标识该歌曲是否可公开访问
- ``is_remix``:布尔值,用于标识该歌曲是否为混音作品
- ``is_trashed``:布尔值,用于标识该歌曲是否已被标记为删除
- ``major_model_version``:用于生成该歌曲的模型的主版本号
- ``metadata``:歌曲的附加元数据(例如:标签、提示词)
- ``model_name``:用于生成该歌曲的模型名称
- ``name``:歌曲的可选名称
- ``negative_tags``:标识不良属性的标签
- ``page_size``:分页页面的大小(如适用)
- ``persona``:与该歌曲关联的角色人设
- ``persona_id``:角色人设的标识符
- ``play_count``:该歌曲的播放次数
- ``preview_url``:歌曲预览文件的链接
- ``priority``:歌曲生成的优先级等级
- ``prompt``:用于生成该歌曲的文本提示词
- ``public_songhot``:歌曲热度的标识指标
- ``refund_credits``:布尔值,用于标识是否已退还创作积分
- ``result``:歌曲生成的结果数据
- ``root_clip_id``:根剪辑的标识符(如适用)
- ``source``:该歌曲的来源
- ``status``:该歌曲的当前状态(例如:已完成)
- ``stream``:布尔值,用于标识该歌曲是否可流式播放
- ``tags``:与该歌曲关联的标签
- ``task``:与歌曲生成相关的任务
- ``title``:该歌曲的标题
- ``total_hits``:该歌曲的总访问/浏览量
- ``type``:歌曲或生成任务的类型
- ``upsample_clip_id``:超采样剪辑的标识符(如适用)
- ``upvote_count``:该歌曲获得的点赞数
- ``user_display_name``:用户的展示名称
- ``user_handle``:用户的唯一用户名(Handle)
- ``user_image_url``:用户图片的链接
- ``video_cover_url``:视频封面图片的链接
- ``video_upload_height``:上传视频的高度
- ``video_upload_width``:上传视频的宽度
提供机构:
maas
创建时间:
2025-07-19



