MultiUI
收藏huggingface.co2025-01-22 收录
下载链接:
https://huggingface.co/datasets/neulab/MultiUI
下载链接
链接失效反馈官方服务:
资源简介:
MulitUI
Dataset for the paper: Harnessing Webpage Uis For Text Rich Visual Understanding
🌐 Homepage | 🐍 GitHub | 📖 arXiv
Introduction
We introduce MultiUI, a dataset containing 7.3 million samples from 1 million websites, covering diverse multi- modal tasks and UI layouts. Models trained on MultiUI not only excel in web UI tasks—achieving up to a 48% improvement on VisualWebBench and a 19.1% boost in action accuracy on a web agent dataset… See the full description on the dataset page: https://huggingface.co/datasets/neulab/MultiUI.
本数据集名为 MulitUI,源自论文《Harnessing Webpage Uis For Text Rich Visual Understanding》。该数据集汇聚了来自一百万个网站、共计七百三十万个样本,涵盖了多种多模态任务和用户界面布局。基于 MultiUI 训练的模型不仅在网页用户界面任务上表现出色——在 VisualWebBench 上实现了高达 48% 的性能提升,同时在网页智能体数据集上的动作准确性上提升了 19.1%……欲了解更多详细信息,请参阅数据集页面:https://huggingface.co/datasets/neulab/MultiUI。
提供机构:
huggingface.co



