five

Dataset: A German Gold-Standard Dataset for Sentiment Analysis in Software Engineering

收藏
Zenodo2025-07-09 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.15851546
下载链接
链接失效反馈
官方服务:
资源简介:
A German Gold-Standard Dataset for Sentiment Analysis in Software Engineering Description:This repository provides a high-quality, German gold-standard dataset for sentiment analysis in the context of German software engineering discussions. The dataset consists of 5,949 developer statements sourced from the Android-Hilfe.de forum. Each statement was annotated by three independent raters according to basic emotions (following Shaver et al.) and mapped to sentiment polarity (negative, neutral, positive) via majority voting. The dataset was specifically developed to address the lack of German, domain-specific resources for sentiment analysis in software engineering. Along with the annotated gold-standard data, we provide comprehensive documentation of the data creation process, automated pre-filtering steps (using GerVADER), and all artifacts needed for transparency and reproducibility. Additionally, the resource contains detailed evaluation results of four German sentiment analysis tools (GerVADER, SentiStrength_DE, TextBlobDE, and BertDE), including per-sample predictions and interrater agreement metrics. All code, intermediate files, and analysis outputs are included for replicability. This dataset enables benchmarking, model development, and meta-analysis of sentiment analysis methods tailored to German developer discourse. Citation If you use this dataset, please cite the following publication: Obaidi, M., Herrmann, M., Schmid, E., Ochsner, R., Schneider, K., & Klünder, J. (2025). A German gold-standard dataset for sentiment analysis in software engineering. In 2025 IEEE 33rd International Requirements Engineering Workshop (REW). License This dataset is provided under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Contact For questions regarding the dataset, please contact the corresponding author as listed in the publication.
提供机构:
Zenodo
创建时间:
2025-07-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作