Rapidata/text-2-video-human-preferences-sora-2-pro

Name: Rapidata/text-2-video-human-preferences-sora-2-pro
Creator: Rapidata
Published: 2025-10-30 15:26:25
License: 暂无描述

Hugging Face2025-10-30 更新2026-02-07 收录

下载链接：

https://hf-mirror.com/datasets/Rapidata/text-2-video-human-preferences-sora-2-pro

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: prompt dtype: string - name: video1 dtype: string - name: video2 dtype: string - name: weighted_results1_Alignment dtype: float64 - name: weighted_results2_Alignment dtype: float64 - name: detailedResults_Alignment list: - name: userDetails struct: - name: age dtype: string - name: country dtype: string - name: gender dtype: string - name: language dtype: string - name: occupation dtype: string - name: userScores struct: - name: global dtype: float64 - name: votedFor dtype: string - name: weighted_results1_Coherence dtype: float64 - name: weighted_results2_Coherence dtype: float64 - name: detailedResults_Coherence list: - name: userDetails struct: - name: age dtype: string - name: country dtype: string - name: gender dtype: string - name: language dtype: string - name: occupation dtype: string - name: userScores struct: - name: global dtype: float64 - name: votedFor dtype: string - name: weighted_results1_Preference dtype: float64 - name: weighted_results2_Preference dtype: float64 - name: detailedResults_Preference list: - name: userDetails struct: - name: age dtype: string - name: country dtype: string - name: gender dtype: string - name: language dtype: string - name: occupation dtype: string - name: userScores struct: - name: global dtype: float64 - name: votedFor dtype: string - name: file_name1 dtype: string - name: file_name2 dtype: string - name: model1 dtype: string - name: model2 dtype: string splits: - name: train num_bytes: 6495323 num_examples: 1485 download_size: 632907 dataset_size: 6495323 configs: - config_name: default data_files: - split: train path: data/train-* license: apache-2.0 task_categories: - video-classification - text-to-video - text-classification language: - en tags: - videos - t2v - text-2-video - text2video - text-to-video - human - annotations - preferences - likert - coherence - alignment - wan - wan 2.1 - veo2 - veo - pikka - alpha - sora - hunyuan - veo3 - mochi-1 - seedance-1-pro - seedance - seedance 1 - Marey - moonvalley - sora2 - openai - sora2 pro pretty_name: Sora 2 Pro Human Preferences size_categories: - 1K<n<10K --- <style> .vertical-container { display: flex; flex-direction: column; gap: 60px; } .image-container img { height: 150px; /* Set the desired height */ margin:0; object-fit: contain; /* Ensures the aspect ratio is maintained */ width: auto; /* Adjust width automatically based on height */ } .image-container { display: flex; /* Aligns images side by side */ justify-content: space-around; /* Space them evenly */ align-items: center; /* Align them vertically */ } .container { width: 90%; margin: 0 auto; } .text-center { text-align: center; } .score-amount { margin: 0; margin-top: 10px; } .score-percentage { font-size: 12px; font-weight: semi-bold; } </style> # Rapidata Video Generation Sora 2 Pro Human Preference <a href="https://www.rapidata.ai"> <img src="https://cdn-uploads.huggingface.co/production/uploads/66f5624c42b853e73e0738eb/jfxR79bOztqaC6_yNNnGU.jpeg" width="300" alt="Dataset visualization"> </a> <a href="https://huggingface.co/datasets/Rapidata/text-2-image-Rich-Human-Feedback"> </a> In this dataset, ~75k human responses from ~15k human annotators were collected to evaluate the Sora 2 Pro video generation model on our benchmark. This dataset was collected in roughtly 30 min using the [Rapidata Python API](https://docs.rapidata.ai), accessible to anyone and ideal for large scale data annotation. Explore our latest model rankings on our [website](https://www.rapidata.ai/benchmark). If you get value from this dataset and would like to see more in the future, please consider liking it ❤️ # Overview In this dataset, ~75k human responses from ~15k human annotators were collected to evaluate the Sora 2 Pro video generation model on our benchmark. This dataset was collected in roughtly 30 min using the [Rapidata Python API](https://docs.rapidata.ai), accessible to anyone and ideal for large scale data annotation. The benchmark data is accessible on [huggingface](https://huggingface.co/datasets/Rapidata/text-2-video-human-preferences) directly. # Explanation of the colums The dataset contains paired video comparisons. Each entry includes 'video1' and 'video2' fields, which contain links to downscaled GIFs for easy viewing. The full-resolution videos can be found [here](https://huggingface.co/datasets/Rapidata/text-2-video-human-preferences-moonvalley-marey/tree/main/Videos) The weighted_results column contains scores ranging from 0 to 1, representing aggregated user responses. Individual user responses can be found in the detailedResults column. # Alignment The alignment score quantifies how well an video matches its prompt. Users were asked: "Which video fits the description better?". ## Examples <div class="vertical-container"> <div class="container"> <div class="text-center"> <q>A bustling city marketplace from dawn to dusk, capturing vendors setting up, colorful goods being sold, diverse crowds interacting, lights illuminating as night falls, and the vibrant energy transitioning between day and night.</q> </div> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro</h3> <div class="score-percentage">(Score: 100%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/sora2-pro_9-10-25_0_0.gif" width=500> </div> <div> <h3 class="score-amount">Ray 2 </h3> <div class="score-percentage">(Score: 0%)</div> <img src="https://assets.rapidata.ai/ray2_0000_2.gif" width=500> </div> </div> </div> <div class="container"> <div class="text-center"> <q>A serene mermaid glides through vibrant coral reefs, sunlight filtering through the water. Fish of every color swim around her gracefully as she dances with the ocean currents.</q> </div> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro</h3> <div class="score-percentage">(Score: 9.47%)</div> <img src="https://assets.rapidata.ai/sora2-pro_9-10-25_43_0.gif " width=500> </div> <div> <h3 class="score-amount">Veo 2 </h3> <div class="score-percentage">(Score: 90.53%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/veo2_0043_0.gif" width=500> </div> </div> </div> </div> # Coherence The coherence score measures whether the generated video is logically consistent and free from artifacts or visual glitches. Without seeing the original prompt, users were asked: "Which video has more glitches and is more likely to be AI generated?" ## Examples <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro</h3> <div class="score-percentage">(Glitch Rating: 0%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/sora2-pro_9-10-25_75_0.gif" width="500" alt="Dataset visualization"> </div> <div> <h3 class="score-amount">Alpha </h3> <div class="score-percentage">(Glitch Rating: 100%)</div> <img src="https://assets.rapidata.ai/alpha_0075_359978639.gif " width="500" alt="Dataset visualization"> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro</h3> <div class="score-percentage">(Glitch Rating: 88.95%)</div> <img src="https://assets.rapidata.ai/sora2-pro_9-10-25_24_0.gif" width="500" alt="Dataset visualization"> </div> <div> <h3 class="score-amount">Marey </h3> <div class="score-percentage">(Glitch Rating: 11.05%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/marey-11-8-25_24_0.gif" width="500" alt="Dataset visualization"> </div> </div> </div> </div> # Preference The preference score reflects how visually appealing participants found each video, independent of the prompt. Users were asked: "Which video do you prefer aesthetically?" ## Examples <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro</h3> <div class="score-percentage">(Score: 94.11%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/sora2-pro_9-10-25_39_0.gif" width="500" alt="Dataset visualization"> </div> <div> <h3 class="score-amount">Pika 2.2 </h3> <div class="score-percentage">(Score: 5.89%)</div> <img src="https://assets.rapidata.ai/pika2.2_0039_1.gif" width="500" alt="Dataset visualization"> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Sora2 Pro </h3> <div class="score-percentage">(Score: 0%)</div> <img src="https://assets.rapidata.ai/sora2-pro_9-10-25_64_0.gif" width="500" alt="Dataset visualization"> </div> <div> <h3 class="score-amount">Veo 3 </h3> <div class="score-percentage">(Score: 100%)</div> <img style="border: 5px solid #18c54f;" src="https://assets.rapidata.ai/veo3_0064_0.gif" width="500" alt="Dataset visualization"> </div> </div> </div> </div> </br> # About Rapidata Rapidata's technology makes collecting human feedback at scale faster and more accessible than ever before. Visit [rapidata.ai](https://www.rapidata.ai/) to learn more about how we're revolutionizing human feedback collection for AI development. # Other Datasets We run a benchmark of the major video generation models, the results can be found on our [website](https://www.rapidata.ai/leaderboard/video-models). We rank the models according to their coherence/plausiblity, their aligment with the given prompt and style prefernce. The underlying 2M+ annotations can be found here: - Link to the [Rich Video Annotation dataset](https://huggingface.co/datasets/Rapidata/text-2-video-Rich-Human-Feedback) - Link to the [Coherence dataset](https://huggingface.co/datasets/Rapidata/Flux_SD3_MJ_Dalle_Human_Coherence_Dataset) - Link to the [Text-2-Image Alignment dataset](https://huggingface.co/datasets/Rapidata/Flux_SD3_MJ_Dalle_Human_Alignment_Dataset) - Link to the [Preference dataset](https://huggingface.co/datasets/Rapidata/700k_Human_Preference_Dataset_FLUX_SD3_MJ_DALLE3)

提供机构：

Rapidata

5,000+

优质数据集

54 个

任务类型

进入经典数据集