five

ekkirinaldi/YRVSA_Dataset

收藏
Hugging Face2023-07-15 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/ekkirinaldi/YRVSA_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
# YRVSA_Dataset YoutubeReviewVideoSentimentAnalysis (YRVSA) dataset is a collection of comments crawled from phone review videos on Youtube to make sentiment analysis against video or the product ## Introduction As one of the most popular media, YouTube with its mass video sharing feature shapes an outlook of an object, as an example to be discussed in this research is smartphones. YouTube facilitates users to respond to the video in several ways, one by leaving a comment on the video [11]. The contents of the word can have a more varied purpose and can be addressed to how the video is delivered or directed to the goods being reviewed, which is a smartphone. Comments can be more informative as netizens can give their opinions on a product or how to deliver the video. The following example is a comment on a smartphone review video that contains negative sentiment: "ngomongya ngaler ngidul,..sgala di bahas, udah fokus di hp aja ngebahasnya" The comment is more focused on how the video is presented since there is a word ```ngomongnya``` that refers to the way the person in the video talk, and ```ngaler ngidul``` represent the way the reviewer talk was unfocused. Here's another example where the comment gives positive sentiment: "Hp ini kan gak cuma jual performa gan, disein cantik abis dan guna gak hanya gimmick" The word ```disein cantik abis``` refers to the phone design which amazed the viewer. The word ```disein``` is actually slang for ```desain``` or ```design```, this illustrates the variety of slang by Indonesians. By analyzing sentiment and the type of comments, the focus of users' opinions can be identified as either a product and/or video [2]. For businesses, this is very influential because they get a rating from netizens on their products. Besides that, YouTubers can evaluate how their reviews are delivered in a video. ## About the Dataset The dataset contains 13,638 Indonesian comments from 206 phone video reviews from YouTube. It was crawled in 2017, manually annotated by Ekki Rinaldi, then verified by Language Expert Dra. Lucia Sri Suharjanti. The datasets has 7 classes: | CLASS | DESC | Example | COUNT | |------------------|---------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------|-------| | ```off-topic``` | the comment does not refer to the video or product | PERTAMAX ! | 2935 | | ```product-negative``` | the comment refers to the product and gave negative sentiment | kenapa untuk ini kerpatan pixelnya tergolong biasa saja mungkin termasuk dibawah rata2 | 537 | | ```product-neutral``` | the comment refers to the product and gave neutral sentiment | ZenFone Max (ZC550KL) sama zenfone 2 ze550kl bagus mana ya ? secara keseluruhan | 2063 | | ```product-positive``` | the comment refers to the product and gave positive sentiment | Iphone emang keren sob gak jual spek tp pas uji coba ampun dah emg harga gak bisa bohong.. | 301 | | ```spam``` | the comment contain link | http://www.lazada.co.id/xiaomi-mi-iv-hybrid-dual-drivers- earphones-in-ear-headphones-silver-4671745.html | 34 | | ```video-negative``` | the comment refers to the video and gave negative sentiment | endingnya kok kurang cetar gitu bang... | 468 | | ```video-neutral``` | the comment refers to the video and gave neutral sentiment | instrument background musicnya judulnya apa bang...?? | 14 | | ```video-positive``` | the comment refers to the video and gave positive sentiment | gw suka sma sobat hape. gokil smuah orgnya top banget dan ga garing | 486 | ## Publication [FVEC-SVM for opinion mining on Indonesian comments of youtube video](https://ieeexplore.ieee.org/document/8285860)https://ieeexplore.ieee.org/document/8285860 --- license: mit ---
提供机构:
ekkirinaldi
原始信息汇总

YRVSA_Dataset 概述

数据集简介

YRVSA(YoutubeReviewVideoSentimentAnalysis)数据集是一个从YouTube上的手机评测视频中爬取的评论集合,用于进行情感分析。该数据集专注于分析用户对视频内容或产品的情感反馈。

数据集内容

  • 数据量:包含13,638条印尼语评论。
  • 视频来源:来自206个手机评测视频。
  • 采集时间:数据集于2017年采集。
  • 标注与验证:由Ekki Rinaldi手动标注,并由语言专家Dra. Lucia Sri Suharjanti验证。

数据集结构

数据集分为7个类别,每个类别描述了评论的不同情感倾向:

类别 描述 示例 数量
off-topic 与视频或产品无关的评论 PERTAMAX ! 2935
product-negative 对产品表达负面情感的评论 kenapa untuk ini kerpatan pixelnya tergolong biasa saja mungkin termasuk dibawah rata2 537
product-neutral 对产品表达中性情感的评论 ZenFone Max (ZC550KL) sama zenfone 2 ze550kl bagus mana ya ? secara keseluruhan 2063
product-positive 对产品表达正面情感的评论 Iphone emang keren sob gak jual spek tp pas uji coba ampun dah emg harga gak bisa bohong.. 301
spam 包含链接的评论 http://www.lazada.co.id/xiaomi-mi-iv-hybrid-dual-drivers-earphones-in-ear-headphones-silver-4671745.html 34
video-negative 对视频表达负面情感的评论 endingnya kok kurang cetar gitu bang... 468
video-neutral 对视频表达中性情感的评论 instrument background musicnya judulnya apa bang...?? 14
video-positive 对视频表达正面情感的评论 gw suka sma sobat hape. gokil smuah orgnya top banget dan ga garing 486

数据集出版物

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作