Replication Data for: Measuring the Significance of Policy Outputs with Positive Unlabeled Learning

DataONE2020-10-19 更新2024-06-08 收录

下载链接：

https://search.dataone.org/view/sha256:244e664be45a391829a8f24ac650e52015b3572daa17a2bde62f8f5ba8d6f0ba

下载链接

链接失效反馈

官方服务：

资源简介：

Identifying important policy outputs has long been of interest to political scientists. In this work, we propose a novel approach to the classification of policies. Instead of obtaining and aggregating expert evaluations of significance for a finite set of policy outputs, we use experts to identify a small set of significant outputs and then employ positive unlabeled (PU) learning to search for other similar examples in a large unlabeled set. We further propose to automate the first step by harvesting ‘seed’ sets of significant outputs from web data. We offer an application of the new approach by classifying over 9,000 government regulations in the United Kingdom. The obtained estimates are successfully validated against human experts, by forecasting web citations, and with a construct validity test.

创建时间：

2023-11-23

5,000+

优质数据集

54 个

任务类型

进入经典数据集