DreamBank-annotated
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/gustavecortal/DreamBank-annotated
下载链接
链接失效反馈官方服务:
资源简介:
## Presentation
[DreamBank](https://dreambank.net/), an open corpus of more than 27,000 dream narratives, mostly written in English.
Annotations were produced using [dream-t5](https://huggingface.co/gustavecortal/dream-t5), a [LaMini-Flan-T5](https://huggingface.co/MBZUAI/LaMini-Flan-T5-248M) model finetuned on [Hall and Van de Castle annotations](https://dreams.ucsc.edu/Coding/) to predict character and emotion. I've introduced this task in this [paper](https://aclanthology.org/2024.lrec-main.1282/):
> Gustave Cortal. 2024. Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 14717–14728, Torino, Italia. ELRA and ICCL.
This work was performed using HPC resources (Jean Zay supercomputer) from GENCI-IDRIS (Grant 20XX-AD011014205).
## Citation
If you use this dataset, cite the original [DreamBank](https://dreambank.net/) dataset reference:
> Domhoff, G. W., & Schneider, A. (2008). Studying dream content using the archive and search engine on DreamBank.net. Consciousness and Cognition, 17(4), 1238-1247. doi:10.1016/j.concog.2008.06.010
## Dataset Structure
- alta: 422
- angie: 48
- arlie: 212
- b: 3116
- b2: 1138
- bay_area_girls_456: 234
- bay_area_girls_789: 154
- bea1: 223
- bea2: 63
- blind-f: 238
- blind-m: 143
- bosnak: 53
- chris: 100
- chuck: 75
- college-f: 160
- college-m: 160
- dahlia: 24
- david: 166
- dorothea: 900
- ed: 143
- edna: 19
- elizabeth: 1707
- emma: 1221
- emmas_husband: 72
- esther: 110
- hall_female: 681
- izzy-all: 4352
- jasmine-all: 664
- jeff: 87
- joan: 42
- kenneth: 2022
- lawrence: 206
- mack: 38
- madeline1-hs: 98
- madeline2-dorms: 186
- madeline3-offcampus: 348
- madeline4-postgrad: 294
- mark: 23
- melissa: 89
- melora: 211
- melvin: 128
- merri: 315
- miami-home: 171
- miami-lab: 274
- midwest_teens-f: 111
- midwest_teens-m: 83
- nancy: 44
- natural_scientist: 234
- norman: 1235
- norms-f: 491
- norms-m: 500
- pegasus: 1093
- peru-f: 382
- peru-m: 384
- phil1: 106
- phil2: 220
- phil3: 180
- physiologist: 86
- pregnancy_abortion: 226
- ringo: 16
- sally: 249
- samantha: 63
- seventh_graders: 69
- toby: 33
- tom: 27
- ucsc_women: 81
- van: 192
- vickie: 35
- vietnam_vet: 98
- vietnam_vet2: 32
- vietnam_vet3: 463
- west_coast_teens: 89
# 数据集概述
[DreamBank(梦境银行)](https://dreambank.net/) 是一个开源语料库,包含超过27000条梦境叙事文本,其中绝大多数以英语撰写。
该语料库的标注数据由dream-t5模型生成,该模型是基于Hall与Van de Castle标注集微调的LaMini-Flan-T5模型,用于预测梦境叙事中的角色与情感。本任务的相关研究成果已发表于以下论文:
> 古斯塔夫·科尔塔尔(Gustave Cortal). 2024. 用于梦境叙事中角色与情感检测的序列到序列语言模型. 见:2024年计算语言学、语言资源与评估联合国际会议(LREC-COLING 2024)论文集,意大利都灵,第14717–14728页. ELRA与ICCL出版.
本研究依托GENCI-IDRIS提供的高性能计算资源(Jean Zay超级计算机)完成(项目编号:20XX-AD011014205)。
# 引用说明
若您使用本数据集,请引用原始DreamBank数据集的参考文献:
> 多姆霍夫(Domhoff, G. W.)与施奈德(Schneider, A.). 2008. 依托DreamBank.net(梦境银行网站)的档案与搜索引擎开展梦境内容研究. 《意识与认知》,17(4),1238-1247. DOI:10.1016/j.concog.2008.06.010
# 数据集结构
- alta: 422
- angie: 48
- arlie: 212
- b: 3116
- b2: 1138
- bay_area_girls_456: 234
- bay_area_girls_789: 154
- bea1: 223
- bea2: 63
- blind-f: 238
- blind-m: 143
- bosnak: 53
- chris: 100
- chuck: 75
- college-f: 160
- college-m: 160
- dahlia: 24
- david: 166
- dorothea: 900
- ed: 143
- edna: 19
- elizabeth: 1707
- emma: 1221
- emmas_husband: 72
- esther: 110
- hall_female: 681
- izzy-all: 4352
- jasmine-all: 664
- jeff: 87
- joan: 42
- kenneth: 2022
- lawrence: 206
- mack: 38
- madeline1-hs: 98
- madeline2-dorms: 186
- madeline3-offcampus: 348
- madeline4-postgrad: 294
- mark: 23
- melissa: 89
- melora: 211
- melvin: 128
- merri: 315
- miami-home: 171
- miami-lab: 274
- midwest_teens-f: 111
- midwest_teens-m: 83
- nancy: 44
- natural_scientist: 234
- norman: 1235
- norms-f: 491
- norms-m: 500
- pegasus: 1093
- peru-f: 382
- peru-m: 384
- phil1: 106
- phil2: 220
- phil3: 180
- physiologist: 86
- pregnancy_abortion: 226
- ringo: 16
- sally: 249
- samantha: 63
- seventh_graders: 69
- toby: 33
- tom: 27
- ucsc_women: 81
- van: 192
- vickie: 35
- vietnam_vet: 98
- vietnam_vet2: 32
- vietnam_vet3: 463
- west_coast_teens: 89
提供机构:
maas
创建时间:
2025-10-14



