FineVisionShuffle
收藏魔搭社区2026-01-06 更新2025-09-13 收录
下载链接:
https://modelscope.cn/datasets/moondream/FineVisionShuffle
下载链接
链接失效反馈官方服务:
资源简介:
# FineVision Filtered
Filtered FineVision dataset. Removed samples containing Chinese, Japanese, Korean, Russian/Cyrillic, and Vietnamese text.
## Subsets
- CoSyn_400k_chemical
- CoSyn_400k_circuit
- CoSyn_400k_diagram
- CoSyn_400k_document
- CoSyn_400k_graphic
- CoSyn_400k_math
- CoSyn_400k_music
- CoSyn_400k_nutrition
- CoSyn_400k_table
- SynthFormulaNet
- a_okvqa
- aguvis-stage-1
- ai2d_merged
- alfworldgpt
- allava_laion
- allava_vflan
- art
- arxivqa
- bentham
- blockdiagramcomputerized
- blockdiagramhandwritten
- cambrian(filtered)_processed
- captcha
- chrome_writting
- clevr
- clevr_math
- clevr_math(mathv360k)
- coco_colors
- cocoqa
- cocotext
- datikz
- diagram_image_to_text
- face_emotion
- figureqa
- figureqa(mathv360k)
- geo170k(align)
- geo170k(qa)
- geo3k
- geometry3k(mathv360k)
- geomverse
- geos(mathv360k)
- google_landmarks
- groundui
- handwriting_forms
- hateful_memes
- hitab
- hw_squad
- iam
- iconqa
- iconqa(mathv360k)
- idk
- iiit5k
- image_textualization(filtered)
- imgur5k
- indoor_qa
- infographic_vqa
- intergps
- invoices_receipts
- latex_handwritten
- latexformulas
- llavar_gpt4_20k
- lnqa
- lrv_chart
- lrv_normal(filtered)
- lvis_instruct4v
- mapqa
- mapqa(mathv360k)
- maptext
- mathwriting-google
- mavis_math_metagen
- mavis_math_rule_geo
- memotion
- mimic_cgd
- mmc_instruct
- mmevol
- mmra
- mmsoc_memotion
- nlvr2
- ocrvqa
- oodvqa
- orand_car_a
- pathvqa
- pdfvqa
- raven
- rendered_text
- robut_sqa
- robut_wikisql
- robut_wtq
- scienceqa
- screen2words
- screenqa
- sketchyvqa
- spark
- spatialsense
- spot_the_diff
- sujet_finance
- super_clevr(mathv360k)
- synthdog
- tabmwp(mathv360k)
- tqa
- ureader_cap
- ureader_ie
- vision_flan(filtered)
- visualmrc
- visualwebinstruct(filtered)
- vizwiz(mathv360k)
- vqaonbd
- vqarad
- vsr
- websight
- wildvision
- wordart
- yesbut
# 经过筛选的FineVision数据集(FineVision Filtered)
本数据集为经过筛选的FineVision数据集,已移除包含中文、日文、韩文、俄文/西里尔文以及越南语文本的样本。
## 数据集子集
- CoSyn_400k_chemical
- CoSyn_400k_circuit
- CoSyn_400k_diagram
- CoSyn_400k_document
- CoSyn_400k_graphic
- CoSyn_400k_math
- CoSyn_400k_music
- CoSyn_400k_nutrition
- CoSyn_400k_table
- SynthFormulaNet
- a_okvqa
- aguvis-stage-1
- ai2d_merged
- alfworldgpt
- allava_laion
- allava_vflan
- art
- arxivqa
- bentham
- blockdiagramcomputerized
- blockdiagramhandwritten
- cambrian(filtered)_processed
- captcha
- chrome_writting
- clevr
- clevr_math
- clevr_math(mathv360k)
- coco_colors
- cocoqa
- cocotext
- datikz
- diagram_image_to_text
- face_emotion
- figureqa
- figureqa(mathv360k)
- geo170k(align)
- geo170k(qa)
- geo3k
- geometry3k(mathv360k)
- geomverse
- geos(mathv360k)
- google_landmarks
- groundui
- handwriting_forms
- hateful_memes
- hitab
- hw_squad
- iam
- iconqa
- iconqa(mathv360k)
- idk
- iiit5k
- image_textualization(filtered)
- imgur5k
- indoor_qa
- infographic_vqa
- intergps
- invoices_receipts
- latex_handwritten
- latexformulas
- llavar_gpt4_20k
- lnqa
- lrv_chart
- lrv_normal(filtered)
- lvis_instruct4v
- mapqa
- mapqa(mathv360k)
- maptext
- mathwriting-google
- mavis_math_metagen
- mavis_math_rule_geo
- memotion
- mimic_cgd
- mmc_instruct
- mmevol
- mmra
- mmsoc_memotion
- nlvr2
- ocrvqa
- oodvqa
- orand_car_a
- pathvqa
- pdfvqa
- raven
- rendered_text
- robut_sqa
- robut_wikisql
- robut_wtq
- scienceqa
- screen2words
- screenqa
- sketchyvqa
- spark
- spatialsense
- spot_the_diff
- sujet_finance
- super_clevr(mathv360k)
- synthdog
- tabmwp(mathv360k)
- tqa
- ureader_cap
- ureader_ie
- vision_flan(filtered)
- visualmrc
- visualwebinstruct(filtered)
- vizwiz(mathv360k)
- vqaonbd
- vqarad
- vsr
- websight
- wildvision
- wordart
- yesbut
提供机构:
maas
创建时间:
2025-09-09



