five

nibzard/narodne-novine-full-text-markdown

收藏
Hugging Face2026-03-31 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nibzard/narodne-novine-full-text-markdown
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Narodne Novine Full Text Markdown language: - hr license: other annotations_creators: - machine-generated source_datasets: - original task_categories: - text-generation tags: - law - legislation - croatian - markdown - html-to-markdown --- # Narodne Novine Full Text Markdown HTML-to-Markdown text extraction snapshot derived from the NN archive. ## Coverage - Extracted acts: `96855` - Failed acts: `157` - Missing HTML embodiments: `150` ## Files - `texts.parquet` - `failures.parquet` - `metadata.json` ## Notes - Extraction prefers `/hrv/printhtml`, then falls back to `/hrv/html`. - Conversion method: `markitdown_html` - This snapshot does not mirror PDFs.
提供机构:
nibzard
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作