nirmalpratheep/TamilTextCorpus
收藏Hugging Face2025-11-09 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/nirmalpratheep/TamilTextCorpus
下载链接
链接失效反馈官方服务:
资源简介:
Project Madurai语料库笔记是一组从Project Madurai网站通过OCR提取的文本集合。该数据集用于泰米尔语的语言建模实验。
Project Madurai Corpus Notes is a collection of OCR-extracted text from the Project Madurai website, intended for Tamil language modeling experiments.
提供机构:
nirmalpratheep



