Board Leadership Database (U.S. Public Firms) + ML Script for Scaling Human Coded Data
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7304696
下载链接
链接失效反馈官方服务:
资源简介:
Files include: (1) an open sourced database of CEO duality and board chair orientations developed by scaling human coded data using supervised machine learning techniques (in both .dta and .csv formats), as well as (2) the accompanying training and scoring scripts to scale human coded data.
Users may apply the scoring script to score the same variables from company proxy statements, or may adapt the training/scoring scripts and retrain models to scale human coded data of other constructs or measures.
We note that early steps in the process to develop our database and script required web-scraping of company filings from SEC Edgar and text extraction from collected filings. We relied on other publicly available scripts to develop our own fetcher and extraction scripts. Users seeking to duplicate those parts of the process may benefit from the following resources from Kai Chen and pipy.org:
For resources from Kai Chen: see https://www.kaichen.work/?p=681 and https://www.kaichen.work/?p=946
For resources from pipy.org, see sec-edgar-downloader and sec-api
创建时间:
2023-06-26



