Moral Judgments in Narratives on Reddit: Investigating Moral Sparks via Social Commonsense and Linguistic Signals
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/8342060
下载链接
链接失效反馈官方服务:
资源简介:
The file 'post_instances.jsonl' contains instances extracted from specific posts. In this file, instances that contain moral sparks are labeled as '1,' while others are labeled differently or as '0.'
Each instance is scraped by using PushShift API by searching for an unique id. And each instance contains its comment ids that use ">" to quote excerpts in a post. We removed author names and make it left with ids and contexts. The "label" field is computed by using regular expressions to match predefined r/AmItheAsshole verdict codes.
The sup_documents.pdf includes full lists of c-event clusters and parameters of linguistic features used in our paper.
The regular expressions used to extract the verdicts are as follows:AUTHOR = (0, 'YTA', [
r'\m(?i:YWBTA?)\M',
r'\m(?i:YTAH?)\M',
r"(?e)(?i:"
r"you(?:'re| r| are| were| would be| will be) "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r"(?:an? |the )?"
r"(?:huge |big |giant )?"
r"(?:asshole|a-?hole)"
r"){e<=1}",
r"(?e)(?i:"
r"you "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r"(?: r| are| were)? (?:an? |the )?"
r"(?:huge |big |giant )?"
r"(?:asshole|a-?hole)"
r"){e<=1}"
])
OTHER = (1, 'NTA', [
r'\m(?i:YWNBTA?)\M',
r'\m(?i:Y?NTAH?)\M',
r'(?e)(?i:'
r"you(?:'re| r| are| were| would| will) "
r"(?!both)"
r'(?:not| not be) '
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:an? |the )?'
r"(?:asshole|a-?hole)"
r'){e<=1}',
r'(?e)(?i:'
r"(he|she)(?:'s|s| s| is| was)"
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:an? |the )?'
r"(?:asshole|a-?hole)"
r'){e<=1}',
r'(?e)(?i:'
r"they(?:'re|r| r| are| were)"
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:the )?'
r"(?:asshole|a-?hole)"
r'){e<=1}'
])
EVERYBODY = (2, 'ESH', [
r'\m(?i:ESH)\M',
r'(?e)(?i:every(?:one|body) sucks here){e<=1}',
r'(?e)(?i:you both suck){e<=1}',
r'(?e)(?i:'
r"you(?:'re| r| are| were) "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'both (?:the )? (?:assholes?|a-?holes?)){e<=1}',
r'(?e)(?i:'
r"you both"
r"(?:'re| r| are| were)? "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:the )?'
r"(?:assholes?|a-?holes?)"
r'){e<=1}',
r'(?e)(?i:'
r"there(?: r| are| were)(?: any| all) "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:assholes?|a-?holes?)){e<=1}'
])
NOBODY = (3, 'NAH', [
r'\m(?i:NAH?H)\M',
r'(?e)(?i:no (?:assholes|a-?holes|asshole) here)',
r'(?e)(?i:no one is the (?:asshole|a-?hole)){e<=1}',
r'(?e)(?i:'
r"you both"
r"(?:'re| r| are| were)? "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'not (?:an? |the )?'
r"(?:assholes?|a-?holes?)"
r'){e<=1}',
r'(?e)(?i:'
r"you(?:'re| r| are| were) "
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'both not (?:an? |the )? (?:assholes?|a-?holes?)){e<=1}',
r'(?e)(?i:'
r"you "
r"both(?: weren't| aren't)? (?:an? |the )? (?:assholes?|a-?holes?)){e<=1}",
r'(?e)(?i:'
r"there(?: r| are| were)"
r' no '
r"(?:(?:kind|sort) of |really |indeed |just |definitely |exactly |absolutely |certainly |obviously )?"
r'(?:assholes?|a-?holes?)){e<=1}'
])
INFO = (4, 'INFO', [
r'\m(?i:INFO)\M',
r'(?e)(?i:not enough info){e<=1}',
r'(?e)(?i:needs? more info){e<=1}',
r"(?e)(?i:more info(?:'s| is)? required){e<=1}"
])
创建时间:
2024-07-11



