MojiTextCN
收藏魔搭社区2025-10-25 更新2025-09-20 收录
下载链接:
https://modelscope.cn/datasets/shoohi/MojiTextCN
下载链接
链接失效反馈官方服务:
资源简介:
MojiTextCN是一个面向中文密集文本图像生成与识别的大规模数据集,旨在推动多模态大模型在“含文本图像生成”中的研究与应用。MojiTextCN 涵盖了数百万级中文文本图像样本,覆盖常用汉字表(一级)、词语、古诗文、现代句子等多种文本形式,结合多样化的版式与背景,能够全面刻画真实世界中的文本分布。
MojiTextCN is a large-scale dataset targeting Chinese dense text image generation and recognition, designed to advance the research and application of multimodal large language models in the field of "text-containing image generation". MojiTextCN encompasses millions of Chinese text image samples, covering diverse text formats including the first-level commonly used Chinese character set, words, ancient Chinese poetry and prose, modern sentences, etc. By integrating diverse layouts and backgrounds, it can comprehensively characterize the text distribution in real-world scenarios.
提供机构:
maas
创建时间:
2025-08-27



