five

temsa/govie-office-holder-regression-bilingual-v3

收藏
Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/temsa/govie-office-holder-regression-bilingual-v3
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-retrieval - question-answering language: - en - ga pretty_name: GOV.IE Office-Holder Regression Bilingual v3 size_categories: - n<1K --- # GOV.IE Office-Holder Regression Bilingual v3 This dataset is a tenure-aware regression truth set for GOV.IE office-holder search and chatbot checks. It is derived from `temsa/govie-office-holder-reranker-bilingual-v2` and adds: - explicit `effective_from` / `effective_to` intervals - stable role-page canonical targets for role queries - recent-history backfill for a small set of high-value offices - Education portfolio coverage across the 2025 holder change ## Intended use The dataset is intended for: - post-deploy office-holder regression checks - snapshot-aware validation with `--as-of-date` - comparing retrieval behavior against a stable indexed environment ## Core fields - `query_id`, `query`, `query_type`, `language`, `split` - `holder`, `holder_ascii`, `role` - `official_profile_url` - `effective_from`, `effective_to` - `candidates` - `source_urls`, `source_notes` `candidates` encodes the accepted canonical targets for a query. For role queries this usually includes the current stable role page plus, where useful, the associated biography path.
提供机构:
temsa
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作