Ot & Sien, a dataset to help the development of object detection in children's book illustrations
收藏4TU.ResearchData2023-06-23 更新2026-04-23 收录
下载链接:
https://data.4tu.nl/datasets/d1f3ca5c-f1e4-48f5-9a04-0564572d2b9c/1
下载链接
链接失效反馈官方服务:
资源简介:
The original dataset is Ot & Sien Dataset (https://lab.kb.nl/dataset/ot-sien-dataset). We corrected mistakes and made it ML-ready.The purpose of this dataset is to help the development of automatic visual object detection in children's book illustrations. The properties of our dataset are summarized as: The dataset consists of illustrations rather than standard photos. 1452 images with 8241 objects (5.7 per image) are annotated including the category and bounding boxes.All images are resized to 416 x 416 with black fitting edges to adapt to the training procedure.The dataset follows a natural long-tail property, with some object categories being rare.The dataset has imbalanced categories.
本数据集为Ot & Sien数据集(Ot & Sien Dataset),其来源链接为https://lab.kb.nl/dataset/ot-sien-dataset。我们对该数据集进行了错误修正,并将其处理为机器学习可用格式。本数据集的研发目标是助力儿童绘本插画领域的自动化视觉目标检测技术开发。本数据集的特性总结如下:其一,数据源为插画而非标准照片;其二,共计包含1452张图像,其中标注了8241个目标(平均每张图像含5.7个目标),标注信息涵盖目标类别与边界框;其三,所有图像均被调整至416×416分辨率,并添加黑色适配边缘以适配模型训练流程;其四,该数据集符合自然长尾分布特性,部分目标类别较为稀有;其五,数据集存在类别不平衡问题。
提供机构:
Wang, Haoran; Khademi, Seyran
创建时间:
2023-06-23



