Website Screenshots Dataset
收藏universe.roboflow.com2021-08-21 更新2025-03-22 收录
下载链接:
https://universe.roboflow.com/rajat-raina/website-screenshots-yt4tg
下载链接
链接失效反馈官方服务:
资源简介:
# About This Dataset
The Roboflow `Website Screenshots` dataset is a synthetically generated dataset composed of screenshots from over 1000 of the world's top websites. They have been automatically annotated to label the following classes:
:fa-spacer:
* `button` - navigation links, tabs, etc.
* `heading` - text that was enclosed in `<h1>` to `<h6>` tags.
* `link` - inline, textual `<a>` tags.
* `label` - text labeling form fields.
* `text` - all other text.
* `image` - `<img>`, `<svg>`, or `<video>` tags, and icons.
* `iframe` - ads and 3rd party content.
## Example
This is an example image and annotation from the dataset:

## Usage
Annotated screenshots are very useful in Robotic Process Automation. But they can be expensive to label. This dataset would cost over $4000 for humans to label on popular labeling services. We hope this dataset provides a good starting point for your project. Try it with [a model from our model library](https://models.roboflow.ai).
## Collecting Custom Data
Roboflow is happy to provide a custom screenshots dataset to meet your particular needs. We can crawl public or internal web applications. Just [reach out](https://roboflow.ai/contact) and we'll be happy to provide a quote!
# About Roboflow
[Roboflow](https://roboflow.ai) makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.
:fa-spacer:
Developers reduce 50% of their boilerplate code when using Roboflow's workflow, save training time, and increase model reproducibility.
:fa-spacer:
#### [](https://roboflow.ai)
关于此数据集
Roboflow '网站截图' 数据集为一组合成生成数据集,包含来自全球超过1000家顶级网站的截图。这些截图已自动标注,以识别以下类别:
* 'button' - 导航链接、标签页等。
* 'heading' - 被包裹在 <h1> 至 <h6> 标签内的文本。
* 'link' - 行内文本的 <a> 标签。
* 'label' - 标记表单字段的文本。
* 'text' - 所有其他文本。
* 'image' - <img>、<svg> 或 <video> 标签,以及图标。
* 'iframe' - 广告和第三方内容。
## 示例
以下是该数据集中的一个示例图像及其标注:
## 用途
标注的截图在机器人流程自动化领域极为有用。然而,人工标注成本高昂。使用流行的标注服务,此数据集的标注费用将超过4000美元。我们希望此数据集能为您的项目提供一个良好的起点。您可以尝试使用我们模型库中的[模型](https://models.roboflow.ai)。
## 自定义数据收集
Roboflow乐于为您提供满足特定需求的自定义截图数据集。我们可以爬取公共或内部网络应用程序。只需[联系我们](https://roboflow.ai/contact),我们将很高兴为您提供报价!
# 关于Roboflow
[Roboflow](https://roboflow.ai)致力于实现计算机视觉数据集的管理、预处理、增强和版本控制的无缝化。
开发者在使用Roboflow的工作流程时,可以减少50%的样板代码,节省训练时间,并提高模型的可重复性。
#### [](https://roboflow.ai)
提供机构:
Roboflow



