chuuhtetnaing/mm-lib-book-dataset
收藏Hugging Face2025-04-01 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/chuuhtetnaing/mm-lib-book-dataset
下载链接
链接失效反馈官方服务:
资源简介:
MM-Lib缅甸书籍语料库数据集包含从MM-Lib网站提取的437本完整的书籍文本内容和元数据信息。这些书籍的原始文本是从EPUB文件中提取的,并包含了书籍链接、标题、封面图片链接、类别、作者名称、作者描述等元数据。
The MM-Lib Myanmar Book Corpus Dataset consists of 437 books with full-text content and metadata extracted from the MM-Lib website. The raw text content is extracted from EPUB files and includes metadata such as book links, titles, cover image links, categories, author names, and author descriptions.
提供机构:
chuuhtetnaing



