five

Korean_Travel_Boards_Image_Dataset

收藏
魔搭社区2025-11-27 更新2025-11-29 收录
下载链接:
https://modelscope.cn/datasets/Kratos-AI/Korean_Travel_Boards_Image_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
# Korean Travel Boards Image Dataset *This dataset contains high-resolution images of Korean travel and tourism boards, including city maps, attraction guides, directional signs, transportation boards, and public tourism information displays. It supports OCR, multilingual translation, and computer vision research in travel-tech applications.* ## Contact For queries or collaborations related to this dataset, contact: - anoushka@kgen.io - abhishek.vadapalli@kgen.io ## Supported Tasks - **Task Categories**: - Image Classification - Text Recognition (OCR) - Scene Understanding - **Supported Tasks**: - Korean text detection and transcription from travel-related signs - Translation benchmarking for multilingual (Korean-English-Chinese-Japanese) tourism boards - Scene text recognition in outdoor environments - Visual understanding of travel information design and layout - AI modeling for navigation and smart tourism applications ## Languages - **Primary Language**: Korean - **Secondary Presence**: English, Chinese, and Japanese (on multilingual tourism boards) ## Dataset Creation ### Curation Rationale This dataset was developed to enable AI systems to interpret public travel-related signage and boards in Korea. It aids in building models for multilingual OCR, contextual translation, and navigation assistance for travelers, enhancing accessibility and smart tourism solutions. ### Source Data - **Contributors**: Field photographers, tourism content collectors, and public dataset contributors - **Collection Process**: Images were captured from public travel boards across airports, railway stations, tourist attractions, city centers, and cultural heritage sites. All identifiable personal information was removed before inclusion. ### Other Known Limitations - **Bias**: Urban and major tourist destinations may be overrepresented - **Environmental Variability**: Lighting conditions, reflections, or obstructions may affect image clarity - **Context Limitations**: Does not include dynamic or digital displays (e.g., airport flight info screens) ## Intended Uses ### ✅ Direct Use - OCR and multilingual text translation research for travel signs - Training computer vision models for wayfinding and navigation - Cross-lingual accessibility and tourism information apps - Visual design research in public signage and information systems ### ❌ Out-of-Scope Use - Commercial replication of signage designs without permission - Geolocation inference or tracking of specific sites or individuals - Misuse of public board content for unrelated commercial advertising ## License CC BY 4.0

# 韩国旅游展板图像数据集 *本数据集包含韩国旅游与观光展板的高分辨率图像,涵盖城市地图、景点导览牌、方向指引标识、交通展板及公共旅游信息显示屏,可支撑旅游科技应用场景下的光学字符识别(Optical Character Recognition, OCR)、多语言翻译及计算机视觉相关研究。* ## 联系方式 如需咨询本数据集相关事宜或开展合作,请联系: - anoushka@kgen.io - abhishek.vadapalli@kgen.io ## 支持任务 - **任务类别**: - 图像分类 - 文本识别(Optical Character Recognition, OCR) - 场景理解 - **支持任务**: - 旅游相关标识的韩语文本检测与转录 - 多语言(韩-英-中-日)观光展板的翻译基准测试 - 户外场景文本识别 - 旅游信息设计与布局的视觉理解 - 导航与智慧旅游应用的AI建模 ## 语言 - **主要语言**:韩语 - **次要语言分布**:多语言观光展板上的英语、中文及日语 ## 数据集构建 ### 遴选依据 本数据集旨在赋能AI系统解读韩国公共旅游相关标识与展板,助力构建面向旅行者的多语言OCR、上下文翻译及导航辅助模型,提升旅游服务的可及性与智慧旅游解决方案的质量。 ### 源数据 - **贡献者**:实地摄影师、旅游内容采集者及公开数据集贡献者 - **采集流程**:图像采集自机场、火车站、旅游景点、城市中心及文化遗产地的公共旅游展板,所有可识别的个人信息均在纳入数据集前完成脱敏处理。 ### 其他已知局限性 - **偏倚问题**:城市及主要旅游目的地的样本可能占比过高 - **环境变异性**:光照条件、反光或遮挡可能影响图像清晰度 - **上下文局限性**:未涵盖动态或数字显示屏(如机场航班信息屏) ## 预期用途 ### ✅ 合法直接用途 - 旅游标识的光学字符识别与多语言文本翻译研究 - 用于导航指引的计算机视觉模型训练 - 跨语言可及性与旅游信息应用开发 - 公共标识与信息系统的视觉设计研究 ### ❌ 超出适用范围的用途 - 未经许可的标识设计商业复刻 - 特定地点或个人的地理定位推断与追踪 - 将公共展板内容用于无关商业广告的不当使用 ## 许可协议 CC BY 4.0
提供机构:
maas
创建时间:
2025-10-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作