five

SUNO-XX1

收藏
魔搭社区2025-12-05 更新2025-07-26 收录
下载链接:
https://modelscope.cn/datasets/sleeping-ai/SUNO-XX1
下载链接
链接失效反馈
官方服务:
资源简介:
<h1 align='center'>SUNO XX1</h1> <h3 align='center'>You can find the raw database here</h3> Introducing SUNO XX1, a dataset we constructed after community request to receive the entire SUNO load and having a better and comprehensive 1:1 mirror database of the SUNO site. It is an expansion on-top of SUNO-1M we released earlier. We are releasing estimated 2M songs and a 1:1 mirror of SUNO database. *this repo will be constantly uploaded over the course of next 2 weeks.* ### LICENCE We release this under CC-by-nc-nd 4.0, preventing people from below acts: 1. Sharing the dataset from personal storage 2. We don't allow any commercial use + any derivative of this dataset + any re-upload attempt. Please, follow this rules for making sure we can release more research datasets legally and push state-of-art research. # Data Field Descriptions - ``id``: Unique identifier for the song entry. - ``user_id``: Unique identifier for the user who created the song. - ``audio_url``: URL to the song's audio file. - ``video_url``: URL to the song's video file. - ``allow_comments``: Boolean indicating if comments are allowed. - ``artist_clip_id``: Identifier for the artist's clip, if applicable. - ``avatar_image_url``: URL to the user's avatar image. - ``can_publish_with_vocal``: Boolean indicating if publishing with vocals is allowed. - ``can_remix``: Boolean indicating if the song can be remixed. - ``caption``: Optional caption for the song. - ``comment_count``: Number of comments on the song. - ``concat_history``: History of concatenated clips, if any. - ``continue_at``: Timestamp for resuming song creation. - ``cover_clip_id``: Identifier for the cover clip, if applicable. - ``created_at``: Timestamp of when the song was created. - ``display_name``: User's display name. - ``display_tags``: Tags displayed for the song. - ``duration``: Duration of the song in seconds. - ``edit_session_id``: Identifier for the editing session. - ``edited_clip_id``: Identifier for the edited clip, if applicable. - ``entity_type``: Type of entity (e.g., song_schema). - ``explicit``: Boolean indicating if the song contains explicit content. - ``flag_count``: Number of flags reported on the song. - ``from_index``: Starting index for pagination, if applicable. - ``handle``: User's unique handle. - ``has_vocal``: Boolean indicating if the song includes vocals. - ``image_large_url``: URL to the large version of the song's image. - ``image_s3_id``: S3 identifier for the song's image. - ``image_url``: URL to the song's image. - ``include_future_s``: Future settings inclusion flag. - ``include_history_s``: History settings inclusion flag. - ``infill``: Infill data for song generation. - ``infill_context_end_s``: End time for infill context. - ``infill_context_start_s``: Start time for infill context. - ``infill_dur_s``: Duration of infill segment. - ``infill_end_s``: End time for infill segment. - ``infill_start_s``: Start time for infill segment. - ``is_contest_clip``: Boolean indicating if the song is part of a contest. - ``is_handle_updated``: Boolean indicating if the handle has been updated. - ``is_liked``: Boolean indicating if the song is liked by the user. - ``is_owned``: Boolean indicating if the song is owned by the user. - ``is_public``: Boolean indicating if the song is publicly accessible. - ``is_remix``: Boolean indicating if the song is a remix. - ``is_trashed``: Boolean indicating if the song is marked as trashed. - ``major_model_version``: Major version of the model used to generate the song. - ``metadata``: Additional metadata about the song (e.g., tags, prompt). - ``model_name``: Name of the model used for song generation. - ``name``: Optional name of the song. - ``negative_tags``: Tags indicating undesirable attributes. - ``page_size``: Size of pagination page, if applicable. - ``persona``: Persona associated with the song. - ``persona_id``: Identifier for the persona. - ``play_count``: Number of times the song has been played. - ``preview_url``: URL to the song's preview. - ``priority``: Priority level for song generation. - ``prompt``: Text prompt used for song generation. - ``public_songhot``: Indicator of song popularity. - ``refund_credits``: Boolean indicating if credits were refunded. - ``result``: Result data from song generation. - ``root_clip_id``: Identifier for the root clip, if applicable. - ``source``: Source of the song. - ``status``: Current status of the song (e.g., complete). - ``stream``: Boolean indicating if the song is streamable. - ``tags``: Tags associated with the song. - ``task``: Task associated with song generation. - ``title``: Title of the song. - ``total_hits``: Total number of hits/views for the song. - ``type``: Type of song or generation. - ``upsample_clip_id``: Identifier for the upsampled clip, if applicable. - ``upvote_count``: Number of upvotes the song has received. - ``user_display_name``: Display name of the user. - ``user_handle``: Handle of the user. - ``user_image_url``: URL to the user's image. - ``video_cover_url``: URL to the video cover image. - ``video_upload_height``: Height of the uploaded video. - ``video_upload_width``: Width of the uploaded video.

# 居中展示:SUNO XX1 ## 居中展示:您可在此获取原始数据集 本次推出SUNO XX1数据集,本数据集是应社区需求打造而成,旨在收录完整的SUNO平台内容,并构建与SUNO官网完全一致的1:1镜像数据库。本数据集是此前发布的SUNO-1M的扩展版本。 本次将发布约200万首歌曲及SUNO数据库的1:1完整镜像。 *本仓库将在未来两周内持续完成上传工作。* ### 授权协议 本数据集采用CC-by-nc-nd 4.0协议发布,禁止以下行为: 1. 从个人存储渠道分享本数据集 2. 禁止任何商业使用、基于本数据集的衍生创作,以及二次上传行为。 请遵守本协议条款,以便我们能够合法发布更多研究数据集,推动前沿学术研究的发展。 # 数据字段说明 - ``id``:歌曲条目的唯一标识符 - ``user_id``:创建该歌曲的用户的唯一标识符 - ``audio_url``:歌曲音频文件的下载链接 - ``video_url``:歌曲视频文件的下载链接 - ``allow_comments``:布尔值,用于标识是否允许对该歌曲进行评论 - ``artist_clip_id``:艺术家剪辑的标识符(如适用) - ``avatar_image_url``:用户头像图片的链接 - ``can_publish_with_vocal``:布尔值,用于标识是否允许发布带人声的版本 - ``can_remix``:布尔值,用于标识该歌曲是否可被二次创作(混音) - ``caption``:歌曲的可选说明文字 - ``comment_count``:该歌曲的评论总数 - ``concat_history``:拼接剪辑的历史记录(如适用) - ``continue_at``:歌曲创作的恢复时间戳 - ``cover_clip_id``:封面剪辑的标识符(如适用) - ``created_at``:歌曲创建的时间戳 - ``display_name``:用户的展示名称 - ``display_tags``:该歌曲展示所用的标签 - ``duration``:歌曲时长,单位为秒 - ``edit_session_id``:编辑会话的标识符 - ``edited_clip_id``:已编辑剪辑的标识符(如适用) - ``entity_type``:实体类型(例如:song_schema) - ``explicit``:布尔值,用于标识该歌曲是否包含敏感/低俗内容 - ``flag_count``:该歌曲被举报的总次数 - ``from_index``:分页起始索引(如适用) - ``handle``:用户的唯一用户名(Handle) - ``has_vocal``:布尔值,用于标识该歌曲是否包含人声 - ``image_large_url``:歌曲高清图片的链接 - ``image_s3_id``:歌曲图片的S3标识符 - ``image_url``:歌曲图片的链接 - ``include_future_s``:未来设置的包含标识 - ``include_history_s``:历史设置的包含标识 - ``infill``:歌曲生成的补全数据 - ``infill_context_end_s``:补全上下文的结束时间点 - ``infill_context_start_s``:补全上下文的起始时间点 - ``infill_dur_s``:补全片段的时长 - ``infill_end_s``:补全片段的结束时间点 - ``infill_start_s``:补全片段的起始时间点 - ``is_contest_clip``:布尔值,用于标识该歌曲是否属于赛事内容 - ``is_handle_updated``:布尔值,用于标识用户的Handle是否已更新 - ``is_liked``:布尔值,用于标识当前用户是否点赞了该歌曲 - ``is_owned``:布尔值,用于标识该歌曲是否归当前用户所有 - ``is_public``:布尔值,用于标识该歌曲是否可公开访问 - ``is_remix``:布尔值,用于标识该歌曲是否为混音作品 - ``is_trashed``:布尔值,用于标识该歌曲是否已被标记为删除 - ``major_model_version``:用于生成该歌曲的模型的主版本号 - ``metadata``:歌曲的附加元数据(例如:标签、提示词) - ``model_name``:用于生成该歌曲的模型名称 - ``name``:歌曲的可选名称 - ``negative_tags``:标识不良属性的标签 - ``page_size``:分页页面的大小(如适用) - ``persona``:与该歌曲关联的角色人设 - ``persona_id``:角色人设的标识符 - ``play_count``:该歌曲的播放次数 - ``preview_url``:歌曲预览文件的链接 - ``priority``:歌曲生成的优先级等级 - ``prompt``:用于生成该歌曲的文本提示词 - ``public_songhot``:歌曲热度的标识指标 - ``refund_credits``:布尔值,用于标识是否已退还创作积分 - ``result``:歌曲生成的结果数据 - ``root_clip_id``:根剪辑的标识符(如适用) - ``source``:该歌曲的来源 - ``status``:该歌曲的当前状态(例如:已完成) - ``stream``:布尔值,用于标识该歌曲是否可流式播放 - ``tags``:与该歌曲关联的标签 - ``task``:与歌曲生成相关的任务 - ``title``:该歌曲的标题 - ``total_hits``:该歌曲的总访问/浏览量 - ``type``:歌曲或生成任务的类型 - ``upsample_clip_id``:超采样剪辑的标识符(如适用) - ``upvote_count``:该歌曲获得的点赞数 - ``user_display_name``:用户的展示名称 - ``user_handle``:用户的唯一用户名(Handle) - ``user_image_url``:用户图片的链接 - ``video_cover_url``:视频封面图片的链接 - ``video_upload_height``:上传视频的高度 - ``video_upload_width``:上传视频的宽度
提供机构:
maas
创建时间:
2025-07-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作