five

hakkam10/screenplay_emotions

收藏
Hugging Face2023-09-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hakkam10/screenplay_emotions
下载链接
链接失效反馈
官方服务:
资源简介:
--- # For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/datasetcard.md?plain=1 # Doc / guide: https://huggingface.co/docs/hub/datasets-cards {} --- # Dataset Card for Dataset Name ## Dataset Description ### Dataset Summary This dataset was created by scrapping the screenplays from the imsdb website and then splitting them into 100 segments. Each segment has been fed into a emotion classification model and classified into the emotion it evokes and represented as a number from 1 to 6. Each number represents one of six emotions: 1 - joy 2 - love 3 - surprise 4 - sadness 5 - anger 6 - fear These numbers are then stored as a one dimensional vector of length 100 in the column emotions. Columns: href - link of the website address from where the script was taken. It should be prefixed with "https://imsdb.com/". title - Title of the film script - The whole screenplay scenes - a list of length 100. scripts are segmented into 100 segments and stored as a list emotions - a list of emotions where each element corresponds to the segments of screenplay at the same position. ### Supported Tasks and Leaderboards [More Information Needed] ### Languages [More Information Needed] ## Dataset Structure ### Data Instances [More Information Needed] ### Data Fields [More Information Needed] ### Data Splits [More Information Needed] ## Dataset Creation ### Curation Rationale [More Information Needed] ### Source Data #### Initial Data Collection and Normalization [More Information Needed] #### Who are the source language producers? [More Information Needed] ### Annotations #### Annotation process [More Information Needed] #### Who are the annotators? [More Information Needed] ### Personal and Sensitive Information [More Information Needed] ## Considerations for Using the Data ### Social Impact of Dataset [More Information Needed] ### Discussion of Biases [More Information Needed] ### Other Known Limitations [More Information Needed] ## Additional Information ### Dataset Curators [More Information Needed] ### Licensing Information [More Information Needed] ### Citation Information [More Information Needed] ### Contributions [More Information Needed]
提供机构:
hakkam10
原始信息汇总

数据集概述

数据来源

  • 数据集是通过爬取imsdb网站上的剧本创建的。

数据处理

  • 剧本被分割成100个片段。

情感分类

  • 每个片段被输入到一个情感分类模型中。
  • 每个片段根据其引发的情感被分类,并表示为一个从1到6的数字。
  • 每个数字代表六种情感之一。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作