five

davidguzmanr/CulturalGround-dpo

收藏
Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/davidguzmanr/CulturalGround-dpo
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: all features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 1695821010 num_examples: 10500 download_size: 1695690527 dataset_size: 1695821010 - config_name: bangladesh features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 47244801 num_examples: 250 download_size: 47248283 dataset_size: 47244801 - config_name: brazil features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 39201231 num_examples: 250 download_size: 39204768 dataset_size: 39201231 - config_name: bulgaria features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38339436 num_examples: 250 download_size: 38343913 dataset_size: 38339436 - config_name: china features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 43121790 num_examples: 250 download_size: 43125018 dataset_size: 43121790 - config_name: czechia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 46773585 num_examples: 250 download_size: 46776684 dataset_size: 46773585 - config_name: egypt features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 55388176 num_examples: 250 download_size: 55393207 dataset_size: 55388176 - config_name: ethiopia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38586064 num_examples: 250 download_size: 38588981 dataset_size: 38586064 - config_name: france features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 41685407 num_examples: 250 download_size: 41687607 dataset_size: 41685407 - config_name: germany features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 41746228 num_examples: 250 download_size: 41748589 dataset_size: 41746228 - config_name: greece features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38280555 num_examples: 250 download_size: 38282686 dataset_size: 38280555 - config_name: india features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 40256688 num_examples: 250 download_size: 40259616 dataset_size: 40256688 - config_name: indonesia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 37527831 num_examples: 250 download_size: 37532150 dataset_size: 37527831 - config_name: iran features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 34050722 num_examples: 250 download_size: 34053517 dataset_size: 34050722 - config_name: ireland features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 37074942 num_examples: 250 download_size: 37077998 dataset_size: 37074942 - config_name: israel features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 33247088 num_examples: 250 download_size: 33249036 dataset_size: 33247088 - config_name: italy features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 39367882 num_examples: 250 download_size: 39371435 dataset_size: 39367882 - config_name: japan features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 40937793 num_examples: 250 download_size: 40940667 dataset_size: 40937793 - config_name: kenya features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38796918 num_examples: 250 download_size: 38800168 dataset_size: 38796918 - config_name: malaysia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 42127035 num_examples: 250 download_size: 42130837 dataset_size: 42127035 - config_name: mexico features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 55789938 num_examples: 250 download_size: 55793720 dataset_size: 55789938 - config_name: mongolia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 35996134 num_examples: 250 download_size: 35998024 dataset_size: 35996134 - config_name: netherlands features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38460243 num_examples: 250 download_size: 38463576 dataset_size: 38460243 - config_name: nigeria features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 35785652 num_examples: 250 download_size: 35788419 dataset_size: 35785652 - config_name: norway features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 37137641 num_examples: 250 download_size: 37140567 dataset_size: 37137641 - config_name: pakistan features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 35584374 num_examples: 250 download_size: 35587874 dataset_size: 35584374 - config_name: poland features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38685866 num_examples: 250 download_size: 38687905 dataset_size: 38685866 - config_name: portugal features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 39893995 num_examples: 250 download_size: 39899270 dataset_size: 39893995 - config_name: romania features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 35741388 num_examples: 250 download_size: 35743512 dataset_size: 35741388 - config_name: russia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 46020851 num_examples: 250 download_size: 46024196 dataset_size: 46020851 - config_name: rwanda features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 53147178 num_examples: 250 download_size: 53151167 dataset_size: 53147178 - config_name: saudi_arabia features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 31025200 num_examples: 250 download_size: 31027707 dataset_size: 31025200 - config_name: singapore features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 56133593 num_examples: 250 download_size: 56137370 dataset_size: 56133593 - config_name: south_korea features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 34784583 num_examples: 250 download_size: 34786905 dataset_size: 34784583 - config_name: spain features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 39479791 num_examples: 250 download_size: 39483989 dataset_size: 39479791 - config_name: sri_lanka features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 38492378 num_examples: 250 download_size: 38495628 dataset_size: 38492378 - config_name: taiwan features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 40810930 num_examples: 250 download_size: 40813496 dataset_size: 40810930 - config_name: tanzania features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 37264677 num_examples: 250 download_size: 37268368 dataset_size: 37264677 - config_name: thailand features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 42631461 num_examples: 250 download_size: 42634710 dataset_size: 42631461 - config_name: turkey features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 34951187 num_examples: 250 download_size: 34953371 dataset_size: 34951187 - config_name: ukraine features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 41505272 num_examples: 250 download_size: 41507775 dataset_size: 41505272 - config_name: united_kingdom features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 36647728 num_examples: 250 download_size: 36650710 dataset_size: 36647728 - config_name: vietnam features: - name: prompt list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: chosen list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: rejected list: - name: content list: - name: text dtype: string - name: type dtype: string - name: role dtype: string - name: images list: image splits: - name: train num_bytes: 36096736 num_examples: 250 download_size: 36099887 dataset_size: 36096736 configs: - config_name: all data_files: - split: train path: all/train-* - config_name: bangladesh data_files: - split: train path: bangladesh/train-* - config_name: brazil data_files: - split: train path: brazil/train-* - config_name: bulgaria data_files: - split: train path: bulgaria/train-* - config_name: china data_files: - split: train path: china/train-* - config_name: czechia data_files: - split: train path: czechia/train-* - config_name: egypt data_files: - split: train path: egypt/train-* - config_name: ethiopia data_files: - split: train path: ethiopia/train-* - config_name: france data_files: - split: train path: france/train-* - config_name: germany data_files: - split: train path: germany/train-* - config_name: greece data_files: - split: train path: greece/train-* - config_name: india data_files: - split: train path: india/train-* - config_name: indonesia data_files: - split: train path: indonesia/train-* - config_name: iran data_files: - split: train path: iran/train-* - config_name: ireland data_files: - split: train path: ireland/train-* - config_name: israel data_files: - split: train path: israel/train-* - config_name: italy data_files: - split: train path: italy/train-* - config_name: japan data_files: - split: train path: japan/train-* - config_name: kenya data_files: - split: train path: kenya/train-* - config_name: malaysia data_files: - split: train path: malaysia/train-* - config_name: mexico data_files: - split: train path: mexico/train-* - config_name: mongolia data_files: - split: train path: mongolia/train-* - config_name: netherlands data_files: - split: train path: netherlands/train-* - config_name: nigeria data_files: - split: train path: nigeria/train-* - config_name: norway data_files: - split: train path: norway/train-* - config_name: pakistan data_files: - split: train path: pakistan/train-* - config_name: poland data_files: - split: train path: poland/train-* - config_name: portugal data_files: - split: train path: portugal/train-* - config_name: romania data_files: - split: train path: romania/train-* - config_name: russia data_files: - split: train path: russia/train-* - config_name: rwanda data_files: - split: train path: rwanda/train-* - config_name: saudi_arabia data_files: - split: train path: saudi_arabia/train-* - config_name: singapore data_files: - split: train path: singapore/train-* - config_name: south_korea data_files: - split: train path: south_korea/train-* - config_name: spain data_files: - split: train path: spain/train-* - config_name: sri_lanka data_files: - split: train path: sri_lanka/train-* - config_name: taiwan data_files: - split: train path: taiwan/train-* - config_name: tanzania data_files: - split: train path: tanzania/train-* - config_name: thailand data_files: - split: train path: thailand/train-* - config_name: turkey data_files: - split: train path: turkey/train-* - config_name: ukraine data_files: - split: train path: ukraine/train-* - config_name: united_kingdom data_files: - split: train path: united_kingdom/train-* - config_name: vietnam data_files: - split: train path: vietnam/train-* ---
提供机构:
davidguzmanr
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作