five

Replication Data for: Constructing Vec-tionaries to Extract Message Features from Texts: A Case Study of Moral Content

收藏
DataONE2025-03-11 更新2025-12-06 收录
下载链接:
https://search.dataone.org/view/sha256:3227ea090e4a72446335c4a5398e60c595994bd3ca8aaef22f5425b10e4066f5
下载链接
链接失效反馈
官方服务:
资源简介:
While researchers often study message featureslike moral content in text, such as party manifestos and social media, their quantification remains a challenge. Conventional human coding struggles with scalability and intercoder reliability. While dictionary-based methods are cost-effective and computationally efficient, they often lack contextual sensitivity and are limited by the vocabularies developed for the original applications. In this paper, we present an approach to construct vec-tionary measurement toolsthat boost validated dictionaries with word embedding through nonlinear optimization. By harnessing semantic relationships encoded by embeddings, vec-tionariesimprove the measurement of message features from text, especially those in short format, by expanding the applicability of originalvocabularies to other contexts. Importantly, avec-tionary can produce additional metrics tocapture the valence and ambivalence of amessage feature beyond its strength in texts. Using moral content in tweets as acase study,we illustrate the steps to construct the moral foundations vec-tionary, showcasing itsability to process texts missed by conventional dictionaries and word embedding methods and to produce measurements better aligned with crowdsourced human assessments. Furthermore, additional metrics from the vec-tionary unveiled unique insights that facilitated predicting outcomes such as message retransmission.
创建时间:
2025-10-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作