nibzard/narodne-novine-full-text-markdown
收藏Hugging Face2026-03-31 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nibzard/narodne-novine-full-text-markdown
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Narodne Novine Full Text Markdown
language:
- hr
license: other
annotations_creators:
- machine-generated
source_datasets:
- original
task_categories:
- text-generation
tags:
- law
- legislation
- croatian
- markdown
- html-to-markdown
---
# Narodne Novine Full Text Markdown
HTML-to-Markdown text extraction snapshot derived from the NN archive.
## Coverage
- Extracted acts: `96855`
- Failed acts: `157`
- Missing HTML embodiments: `150`
## Files
- `texts.parquet`
- `failures.parquet`
- `metadata.json`
## Notes
- Extraction prefers `/hrv/printhtml`, then falls back to `/hrv/html`.
- Conversion method: `markitdown_html`
- This snapshot does not mirror PDFs.
提供机构:
nibzard



