道德基础Reddit语料库
收藏arXiv2022-08-18 更新2024-06-21 收录
下载链接:
https://huggingface.co/datasets/USC-MOLA-Lab/MFRC
下载链接
链接失效反馈官方服务:
资源简介:
道德基础Reddit语料库是由南加州大学创建的一个包含16,123条英文Reddit评论的数据集。这些评论来自12个不同的子论坛,并根据最新的道德基础理论框架,由至少三名训练有素的标注者手工标注了8种道德情感类别。该数据集旨在通过提供大规模的手工标注训练数据,提高自然语言处理中道德情感分类的性能。数据集的应用领域包括研究在线和离线行为中的道德框架和情感影响,如捐赠、环保行动、政治参与甚至暴力抗议等。
The Moral Foundation Reddit Corpus is a dataset comprising 16,123 English Reddit comments developed by the University of Southern California. These comments originate from 12 distinct subreddits, and were manually annotated into 8 moral emotion categories by at least three trained annotators in accordance with the latest Moral Foundation Theory framework. This dataset is designed to enhance the performance of moral emotion classification in natural language processing by providing large-scale manually annotated training data. Its applicable research domains include studies on moral frameworks and emotional impacts in both online and offline behaviors, such as donation activities, environmental protection initiatives, political participation, and even violent protests, and so on.
提供机构:
南加州大学
创建时间:
2022-08-11



