five

A dataset for the detection of mathematical expressions in camera captured document images acquired in Vietnam

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/rd5x9vz4y6
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists of 6000 Vietnamese camera captured document images that containing mathematical expressions. The dataset is divided into training and testing datasets. The training and testing datasets consist of 5000 and 1000 document images, respectively. The dataset can be used for development and evaluation of the detection of mathematical expressions in camera captured document images issue. This is the first dataset for the development and evaluation of the detection algorithms of mathematical expressions in Vietnamese camera captured document images. The annotation files (in .json format) provide position information of mathematical expressions. Moreover, the dataset provides Latex strings of mathematical expressions that can be applied for the evaluation of recognition algorithms of mathematical expressions. Researchers can use the Intersection of Union (IoU) metric to determine if the detection is correct or not by using the dataset. Please, kindly refer to the following articles when using the dataset: [1] Bui Hai Phong et al., "Mathematical Expression Detection in Camera Captured Document Images", Lecture Notes on Data Engineering and Communications Technologiesthis link is disabled, 2022, 148, pp. 98–109, 2022. [2] Bui Hai Phong et al., "An end-to-end framework for the detection of mathematical expressions in scientific document images", Expert Systemsthis link is disabled, 2022, 39(1), e12800, 2022.
创建时间:
2023-05-11
二维码
社区交流群
二维码
科研交流群
商业服务