five

mukuls9971/address-benchmark-v1

收藏
Hugging Face2026-04-20 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mukuls9971/address-benchmark-v1
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: "mit" pretty_name: "Indian Address Benchmark Dataset v1" task_categories: ["token-classification"] task_ids: ["named-entity-recognition"] language: ["en", "hi"] tags: ["pii", "ner", "token-classification", "benchmark", "addresses"] --- # Indian Address Benchmark Dataset v1 Mixed benchmark dataset for Indian-address tagging built from synthetic data plus public upstream datasets. ## Repository - Dataset repo: `mukuls9971/address-benchmark-v1` - Train split: `26728` - Validation split: `6158` - Test split: `1410` ## Files - `train.jsonl` - `validation.jsonl` - `test.jsonl` - `report.json` ## Notes - Generated and published by the `pii-model-oss` workflow. - Upstream datasets used to assemble benchmark variants retain their own licenses. ## Warnings - LinCE train/dev could not be fetched from the original host; used CodeMixBench ner_hineng test as a held-out-only fallback.
提供机构:
mukuls9971
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作