Sigurdur/imdb-isl-mideind-translate
收藏Hugging Face2024-07-14 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/Sigurdur/imdb-isl-mideind-translate
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是IMDB电影评论翻译成冰岛语的版本,包含50000个样本,每个样本包括电影评论和情感标签。数据集用于文本分类任务,语言为冰岛语,规模类别为1K<n<10K。数据集的许可证为MIT。
The dataset includes two main features: review and sentiment, both of which are string types. The dataset is divided into a training set, containing 50000 samples. The original language of the dataset is Icelandic, sourced from translated IMDB movie reviews. The dataset is used for text classification tasks and falls into the 1K to 10K size category.
提供机构:
Sigurdur
原始信息汇总
IMDB电影评论翻译成冰岛语数据集
数据集信息
特征
- review: 类型为字符串
- sentiment: 类型为字符串
数据分割
- train: 包含50000个样本,占用75997554字节
下载和数据大小
- 下载大小: 48513136字节
- 数据集大小: 75997554字节
配置
- default: 数据文件路径为
data/train-*
许可
- MIT
任务类别
- 文本分类
语言
- 冰岛语
数据集规模
- 1K<n<10K
引用
tex @techreport{Johannsson2023, title = {Evaluating Icelandic Sentiment Analysis Models Trained on Translated Data}, author = {Ólafur Aron Johannsson and Birkir Arndal and Eysteinn Örn}, year = {2023}, institution = {University of Reykjavik}, department = {Department of Computer Science}, month = {12}, day = {15}, note = {Supervised by Stefán Ólafsson and Hrafn Loftsson, Examined by Sigurjón Ingi Garðarsson} }



