five

Moral Judgments in Narratives on Reddit: Investigating Moral Sparks via Social Commonsense and Linguistic Signals

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/8342060
下载链接
链接失效反馈
官方服务:
资源简介:
The file 'post_instances.jsonl' contains instances extracted from specific posts. In this file, instances that contain moral sparks are labeled as '1,' while others are labeled differently or as '0.' Each instance is scraped by using PushShift API by searching for an unique id. And each instance contains its comment ids that use ">" to quote excerpts in a post. We removed author names and make it left with ids and contexts. The "label" field is computed by using regular expressions to match predefined r/AmItheAsshole verdict codes. The sup_documents.pdf includes full lists of c-event clusters and parameters of linguistic features used in our paper. The regular expressions used to extract the verdicts are as follows:AUTHOR = (0, 'YTA', [         r'\m(?i:YWBTA?)\M',         r'\m(?i:YTAH?)\M',         r"(?e)(?i:"           r"you(?:'re| r| are| were| would be| will be) "           r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"           r"(?:an? |the )?"           r"(?:huge |big |giant )?"           r"(?:asshole|a-?hole)"         r"){e<=1}",         r"(?e)(?i:"           r"you "           r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"           r"(?: r| are| were)? (?:an? |the )?"           r"(?:huge |big |giant )?"           r"(?:asshole|a-?hole)"         r"){e<=1}"     ])     OTHER = (1, 'NTA', [         r'\m(?i:YWNBTA?)\M',         r'\m(?i:Y?NTAH?)\M',         r'(?e)(?i:'         r"you(?:'re| r| are| were| would| will) "         r"(?!both)"         r'(?:not| not be) '         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'(?:an? |the )?'         r"(?:asshole|a-?hole)"         r'){e<=1}',         r'(?e)(?i:'         r"(he|she)(?:'s|s| s| is| was)"         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'(?:an? |the )?'         r"(?:asshole|a-?hole)"         r'){e<=1}',         r'(?e)(?i:'         r"they(?:'re|r| r| are| were)"           r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"           r'(?:the )?'           r"(?:asshole|a-?hole)"         r'){e<=1}'     ])     EVERYBODY = (2, 'ESH', [         r'\m(?i:ESH)\M',         r'(?e)(?i:every(?:one|body) sucks here){e<=1}',         r'(?e)(?i:you both suck){e<=1}',         r'(?e)(?i:'         r"you(?:'re| r| are| were) "         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'both (?:the )? (?:assholes?|a-?holes?)){e<=1}',         r'(?e)(?i:'         r"you both"         r"(?:'re| r| are| were)? "         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'(?:the )?'         r"(?:assholes?|a-?holes?)"         r'){e<=1}',         r'(?e)(?i:'         r"there(?: r| are| were)(?: any| all) "         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'(?:assholes?|a-?holes?)){e<=1}'              ])     NOBODY = (3, 'NAH', [         r'\m(?i:NAH?H)\M',         r'(?e)(?i:no (?:assholes|a-?holes|asshole) here)',         r'(?e)(?i:no one is the (?:asshole|a-?hole)){e<=1}',         r'(?e)(?i:'         r"you both"         r"(?:'re| r| are| were)? "         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'not (?:an? |the )?'         r"(?:assholes?|a-?holes?)"         r'){e<=1}',         r'(?e)(?i:'         r"you(?:'re| r| are| were) "         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'both not (?:an? |the )? (?:assholes?|a-?holes?)){e<=1}',         r'(?e)(?i:'         r"you "         r"both(?: weren't| aren't)? (?:an? |the )? (?:assholes?|a-?holes?)){e<=1}",         r'(?e)(?i:'         r"there(?: r| are| were)"         r' no '         r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"         r'(?:assholes?|a-?holes?)){e<=1}'     ])     INFO = (4, 'INFO', [         r'\m(?i:INFO)\M',         r'(?e)(?i:not enough info){e<=1}',         r'(?e)(?i:needs? more info){e<=1}',         r"(?e)(?i:more info(?:'s| is)? required){e<=1}"     ])
创建时间:
2024-07-11
二维码
社区交流群
二维码
科研交流群
商业服务