five

Indonesian Biodiversity-related Tweets Including Health, Food Security, and Environmental Management Issues for Sentiment Analysis

收藏
DataCite Commons2025-04-01 更新2025-04-16 收录
下载链接:
https://data.mendeley.com/datasets/xtk9wsxjjr
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset was gathered using Twitter API services for around 30 particular biodiversity-related keywords with dates ranging from January 2020 to March 2023. This data was then refined by filtering out irrelevant information, including non-Indonesian language content, non-Biodiversity data, spam, and duplicate entries. Independent analysts undertook the task of manually assigning sentiment labels to the dataset. These eighteen individuals consisted of twelve researchers and engineers specializing in natural language processing, of which two held Ph.D. degrees, nine had MSc degrees, and one had a BSc degree. Additionally, four lecturers and two experts in natural language processing, each with a Ph.D. or MSc degree, contributed to the labeling process. The sentiments were divided into three classes, and the principle of majority voting determined the final class label.
提供机构:
Mendeley Data
创建时间:
2023-08-14
二维码
社区交流群
二维码
科研交流群
商业服务