CHASM
收藏CHASM: A Corpus of Countering HAte Speech and Microaggressions
关于CHASM
CHASM数据集包含:
- 306条反仇恨言论和42条微干预信息,由GPT-2、GPT-Neo和GPT-3通过提示生成
- 通过Amazon Mechanical Turk进行的人工评估标签:
- 每条仇恨言论或微攻击的冒犯性
- 每个模型生成内容的冒犯性、立场和信息性
数据集
counter_conan.jsoncounter_sbic.json
格式
每个数据集的格式如下:
id: 四句话的集合IDposttext: 仇恨言论或微攻击score: 由众包工作者标注的冒犯性评分,共九个标签(每个模型三个工作者)
GPT-3text: 反叙事scoreoff: 由三个众包工作者标注的冒犯性评分stance: 由三个众包工作者标注的立场评分info: 由三个众包工作者标注的信息性评分
GPT-2和GPT-Neo具有与GPT-3相同的text和score字段
引用
@inproceedings{ashida-komachi-2022-towards, title = "Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions", author = "Ashida, Mana and Komachi, Mamoru", booktitle = "Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH)", month = jul, year = "2022", address = "Seattle, Washington (Hybrid)", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2022.woah-1.2", pages = "11--23" }
同时引用以下数据集:
@inproceedings{chung-etal-2019-conan, title = "{CONAN} - {CO}unter {NA}rratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech", author = "Chung, Yi-Ling and Kuzmenko, Elizaveta and Tekiroglu, Serra Sinem and Guerini, Marco", booktitle = "Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics", month = jul, year = "2019", address = "Florence, Italy", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/P19-1271", doi = "10.18653/v1/P19-1271", pages = "2819--2829" }
@inproceedings{fanton-2021-human, title="{Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech}", author="{Fanton, Margherita and Bonaldi, Helena and Tekiroğlu, Serra Sinem and Guerini, Marco}", booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics", month = aug, year = "2021", publisher = "Association for Computational Linguistics", }
@inproceedings{chung-etal-2021-knowledge, title = "{Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech", author = "Chung, Yi-Ling and Tekiroğlu, Serra Sinem and Guerini, Marco", booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics", month = aug, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", }
@inproceedings{sap2020socialbiasframes, title={Social Bias Frames: Reasoning about Social and Power Implications of Language}, author={Sap, Maarten and Gabriel, Saadia and Qin, Lianhui and Jurafsky, Dan and Smith, Noah A and Choi, Yejin}, year={2020}, booktitle={ACL}, }
@inproceedings{breitfeller-etal-2019-finding, title = "Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts", author = "Breitfeller, Luke and Ahn, Emily and Jurgens, David and Tsvetkov, Yulia", booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)", month = nov, year = "2019", address = "Hong Kong, China", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/D19-1176", doi = "10.18653/v1/D19-1176", pages = "1664--1674", }




