Korean Dataset

Name: Korean Dataset
Creator: ElementAI
Published: 2025-09-30T13:43:26+08:00

arXiv2025-09-30 收录

韩文字符识别

机器学习

数据链接：

https://github.com/ElementAI/synbols-benchmarks 数据链接链接失效反馈

官方服务：

资源简介：

该数据集是通过从Unicode标准的前1000个符号中均匀采样生成的，特别关注韩语。令人惊讶的是，观察到了较高的准确度，这可能是因为字体多样性较低所致。该数据集的任务是对字体进行分类，并评估学习算法的效果。

This dataset is generated via uniform sampling from the first 1000 symbols of the Unicode Standard, with a particular focus on Korean language symbols. Surprisingly, a relatively high classification accuracy was observed, which is potentially attributed to the low font diversity. The core task of this dataset is font classification, which is used to evaluate the performance of learning algorithms.

提供机构：

ElementAI

Korean Dataset

资源简介：

相关数据集