Replication Data for: Measuring the Significance of Policy Outputs with Positive Unlabeled Learning

NIAID Data Ecosystem2026-03-12 收录

下载链接：

https://doi.org/10.7910/DVN/1XXDMW

下载链接

链接失效反馈

官方服务：

资源简介：

Identifying important policy outputs has long been of interest to political scientists. In this work, we propose a novel approach to the classification of policies. Instead of obtaining and aggregating expert evaluations of significance for a finite set of policy outputs, we use experts to identify a small set of significant outputs and then employ positive unlabeled (PU) learning to search for other similar examples in a large unlabeled set. We further propose to automate the first step by harvesting ‘seed’ sets of significant outputs from web data. We offer an application of the new approach by classifying over 9,000 government regulations in the United Kingdom. The obtained estimates are successfully validated against human experts, by forecasting web citations, and with a construct validity test.

创建时间：

2020-10-19

5,000+

优质数据集

54 个

任务类型

进入经典数据集