pixxu/ViTextRender-500K

Name: pixxu/ViTextRender-500K
Creator: pixxu
Published: 2025-12-09 18:38:14
License: 暂无描述

Hugging Face2025-12-09 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/pixxu/ViTextRender-500K

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - text-to-image language: - vi tags: - text-rendering - ocr - synthetic-data - generative-models - fine-tuning size_categories: - 100K<n<1M pretty_name: Vietnamese Text Render 500K configs: - config_name: complex data_files: - split: train path: data/complex/* - config_name: simple data_files: - split: train path: data/simple/* --- # Vietnamese Text Render 500K Dataset A large-scale dataset containing 500K Vietnamese text rendering image-text pairs for training generative models to improve text rendering performance. ## Dataset Structure - **`image`**: Rendered text image in PNG format - **`text`**: Corresponding text content - **`filename`**: Original filename ## Usage This dataset is designed for fine-tuning generative models to improve text rendering capabilities on Vietnamese language. ```python from datasets import load_dataset from PIL import Image from io import BytesIO dataset = load_dataset("pixxu/ViTextRender-500K") # Whole dataset include simple & complex images print(dataset) # Access a sample sample = dataset['train'][0] print(sample['text']) img = Image.open(BytesIO(sample['image'])) img.show() # If you run on Google Colab Notebook or Kaggle Notebook: # img = Image.open(BytesIO(sample['image'])).convert("RGB") # display(img) ``` If you want to load only simple or complex images: ```python from datasets import load_dataset from PIL import Image from io import BytesIO dataset = load_dataset("pixxu/ViTextRender-500K", name="simple", split="train") # For simple images # For complex images, use: dataset = load_dataset("pixxu/ViTextRender-500K", name="complex", split="train") print(dataset) # Access a sample sample = dataset[0] print(sample['text']) img = Image.open(BytesIO(sample['image'])) img.show() # If you run on Google Colab Notebook or Kaggle Notebook # img = Image.open(BytesIO(sample['image'])).convert("RGB") # display(img) ``` ## Applications - Fine-tuning text-to-image models for better text rendering on Vietnamese language. ## License Apache License 2.0

提供机构：

pixxu

5,000+

优质数据集

54 个

任务类型

进入经典数据集