hakkam10/screenplay_emotions
收藏Hugging Face2023-09-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hakkam10/screenplay_emotions
下载链接
链接失效反馈官方服务:
资源简介:
---
# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/datasetcard.md?plain=1
# Doc / guide: https://huggingface.co/docs/hub/datasets-cards
{}
---
# Dataset Card for Dataset Name
## Dataset Description
### Dataset Summary
This dataset was created by scrapping the screenplays from the imsdb website and then splitting them into 100 segments.
Each segment has been fed into a emotion classification model and classified into the emotion it evokes and represented as a number from 1 to 6.
Each number represents one of six emotions:
1 - joy
2 - love
3 - surprise
4 - sadness
5 - anger
6 - fear
These numbers are then stored as a one dimensional vector of length 100 in the column emotions.
Columns:
href - link of the website address from where the script was taken. It should be prefixed with "https://imsdb.com/".
title - Title of the film
script - The whole screenplay
scenes - a list of length 100. scripts are segmented into 100 segments and stored as a list
emotions - a list of emotions where each element corresponds to the segments of screenplay at the same position.
### Supported Tasks and Leaderboards
[More Information Needed]
### Languages
[More Information Needed]
## Dataset Structure
### Data Instances
[More Information Needed]
### Data Fields
[More Information Needed]
### Data Splits
[More Information Needed]
## Dataset Creation
### Curation Rationale
[More Information Needed]
### Source Data
#### Initial Data Collection and Normalization
[More Information Needed]
#### Who are the source language producers?
[More Information Needed]
### Annotations
#### Annotation process
[More Information Needed]
#### Who are the annotators?
[More Information Needed]
### Personal and Sensitive Information
[More Information Needed]
## Considerations for Using the Data
### Social Impact of Dataset
[More Information Needed]
### Discussion of Biases
[More Information Needed]
### Other Known Limitations
[More Information Needed]
## Additional Information
### Dataset Curators
[More Information Needed]
### Licensing Information
[More Information Needed]
### Citation Information
[More Information Needed]
### Contributions
[More Information Needed]
提供机构:
hakkam10
原始信息汇总
数据集概述
数据来源
- 数据集是通过爬取imsdb网站上的剧本创建的。
数据处理
- 剧本被分割成100个片段。
情感分类
- 每个片段被输入到一个情感分类模型中。
- 每个片段根据其引发的情感被分类,并表示为一个从1到6的数字。
- 每个数字代表六种情感之一。



