browndw/morphoseg-en-source-assets
收藏Hugging Face2026-03-20 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/browndw/morphoseg-en-source-assets
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-classification
language:
- en
pretty_name: MorphoSeg Source Assets
dataset_info:
- config_name: source_assets
---
# MorphoSeg Source Assets
This dataset provides canonical source morphology assets exported from the `morph-parser` repository.
## Files
1. `morph_candidates.parquet`
- records: 347,993
- bytes: 5,194,597
- sha256: `c83022c5ce2ca926108dbe339a1b7e1983bc7c4745c57ed4ad651b24e294e0cb`
2. `wiki_morph.parquet`
- records: 505,033
- bytes: 43,383,678
- sha256: `d91a67226c083fb72f728823c5b5d6e86529f61639823bb3f8dca3433c7cd9f9`
3. Optional stream files (if `--keep-jsonl` was enabled):
- `morph_candidates.jsonl`
- `wiki_morph.jsonl`
## Provenance
- Source repository: `morph-parser`
- Generated at: `2026-03-20T13:30:30.522755+00:00`
- Target repo: `browndw/morphoseg-en-source-assets`
## Notes
- Primary format is Parquet for Hugging Face dataset viewer compatibility.
- JSONL files are optional stream-processing exports.
- Original source files and checksums are documented in `manifest.json`.
提供机构:
browndw



