five

pixxu/ViTextRender-500K

收藏
Hugging Face2025-12-09 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/pixxu/ViTextRender-500K
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-to-image language: - vi tags: - text-rendering - ocr - synthetic-data - generative-models - fine-tuning size_categories: - 100K<n<1M pretty_name: Vietnamese Text Render 500K configs: - config_name: complex data_files: - split: train path: data/complex/* - config_name: simple data_files: - split: train path: data/simple/* --- # Vietnamese Text Render 500K Dataset A large-scale dataset containing 500K Vietnamese text rendering image-text pairs for training generative models to improve text rendering performance. ## Dataset Structure - **`image`**: Rendered text image in PNG format - **`text`**: Corresponding text content - **`filename`**: Original filename ## Usage This dataset is designed for fine-tuning generative models to improve text rendering capabilities on Vietnamese language. ```python from datasets import load_dataset from PIL import Image from io import BytesIO dataset = load_dataset("pixxu/ViTextRender-500K") # Whole dataset include simple & complex images print(dataset) # Access a sample sample = dataset['train'][0] print(sample['text']) img = Image.open(BytesIO(sample['image'])) img.show() # If you run on Google Colab Notebook or Kaggle Notebook: # img = Image.open(BytesIO(sample['image'])).convert("RGB") # display(img) ``` If you want to load only simple or complex images: ```python from datasets import load_dataset from PIL import Image from io import BytesIO dataset = load_dataset("pixxu/ViTextRender-500K", name="simple", split="train") # For simple images # For complex images, use: dataset = load_dataset("pixxu/ViTextRender-500K", name="complex", split="train") print(dataset) # Access a sample sample = dataset[0] print(sample['text']) img = Image.open(BytesIO(sample['image'])) img.show() # If you run on Google Colab Notebook or Kaggle Notebook # img = Image.open(BytesIO(sample['image'])).convert("RGB") # display(img) ``` ## Applications - Fine-tuning text-to-image models for better text rendering on Vietnamese language. ## License Apache License 2.0
提供机构:
pixxu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作