five

DreamBank-annotated

收藏
魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/gustavecortal/DreamBank-annotated
下载链接
链接失效反馈
官方服务:
资源简介:
## Presentation [DreamBank](https://dreambank.net/), an open corpus of more than 27,000 dream narratives, mostly written in English. Annotations were produced using [dream-t5](https://huggingface.co/gustavecortal/dream-t5), a [LaMini-Flan-T5](https://huggingface.co/MBZUAI/LaMini-Flan-T5-248M) model finetuned on [Hall and Van de Castle annotations](https://dreams.ucsc.edu/Coding/) to predict character and emotion. I've introduced this task in this [paper](https://aclanthology.org/2024.lrec-main.1282/): > Gustave Cortal. 2024. Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 14717–14728, Torino, Italia. ELRA and ICCL. This work was performed using HPC resources (Jean Zay supercomputer) from GENCI-IDRIS (Grant 20XX-AD011014205). ## Citation If you use this dataset, cite the original [DreamBank](https://dreambank.net/) dataset reference: > Domhoff, G. W., & Schneider, A. (2008). Studying dream content using the archive and search engine on DreamBank.net. Consciousness and Cognition, 17(4), 1238-1247. doi:10.1016/j.concog.2008.06.010 ## Dataset Structure - alta: 422 - angie: 48 - arlie: 212 - b: 3116 - b2: 1138 - bay_area_girls_456: 234 - bay_area_girls_789: 154 - bea1: 223 - bea2: 63 - blind-f: 238 - blind-m: 143 - bosnak: 53 - chris: 100 - chuck: 75 - college-f: 160 - college-m: 160 - dahlia: 24 - david: 166 - dorothea: 900 - ed: 143 - edna: 19 - elizabeth: 1707 - emma: 1221 - emmas_husband: 72 - esther: 110 - hall_female: 681 - izzy-all: 4352 - jasmine-all: 664 - jeff: 87 - joan: 42 - kenneth: 2022 - lawrence: 206 - mack: 38 - madeline1-hs: 98 - madeline2-dorms: 186 - madeline3-offcampus: 348 - madeline4-postgrad: 294 - mark: 23 - melissa: 89 - melora: 211 - melvin: 128 - merri: 315 - miami-home: 171 - miami-lab: 274 - midwest_teens-f: 111 - midwest_teens-m: 83 - nancy: 44 - natural_scientist: 234 - norman: 1235 - norms-f: 491 - norms-m: 500 - pegasus: 1093 - peru-f: 382 - peru-m: 384 - phil1: 106 - phil2: 220 - phil3: 180 - physiologist: 86 - pregnancy_abortion: 226 - ringo: 16 - sally: 249 - samantha: 63 - seventh_graders: 69 - toby: 33 - tom: 27 - ucsc_women: 81 - van: 192 - vickie: 35 - vietnam_vet: 98 - vietnam_vet2: 32 - vietnam_vet3: 463 - west_coast_teens: 89

# 数据集概述 [DreamBank(梦境银行)](https://dreambank.net/) 是一个开源语料库,包含超过27000条梦境叙事文本,其中绝大多数以英语撰写。 该语料库的标注数据由dream-t5模型生成,该模型是基于Hall与Van de Castle标注集微调的LaMini-Flan-T5模型,用于预测梦境叙事中的角色与情感。本任务的相关研究成果已发表于以下论文: > 古斯塔夫·科尔塔尔(Gustave Cortal). 2024. 用于梦境叙事中角色与情感检测的序列到序列语言模型. 见:2024年计算语言学、语言资源与评估联合国际会议(LREC-COLING 2024)论文集,意大利都灵,第14717–14728页. ELRA与ICCL出版. 本研究依托GENCI-IDRIS提供的高性能计算资源(Jean Zay超级计算机)完成(项目编号:20XX-AD011014205)。 # 引用说明 若您使用本数据集,请引用原始DreamBank数据集的参考文献: > 多姆霍夫(Domhoff, G. W.)与施奈德(Schneider, A.). 2008. 依托DreamBank.net(梦境银行网站)的档案与搜索引擎开展梦境内容研究. 《意识与认知》,17(4),1238-1247. DOI:10.1016/j.concog.2008.06.010 # 数据集结构 - alta: 422 - angie: 48 - arlie: 212 - b: 3116 - b2: 1138 - bay_area_girls_456: 234 - bay_area_girls_789: 154 - bea1: 223 - bea2: 63 - blind-f: 238 - blind-m: 143 - bosnak: 53 - chris: 100 - chuck: 75 - college-f: 160 - college-m: 160 - dahlia: 24 - david: 166 - dorothea: 900 - ed: 143 - edna: 19 - elizabeth: 1707 - emma: 1221 - emmas_husband: 72 - esther: 110 - hall_female: 681 - izzy-all: 4352 - jasmine-all: 664 - jeff: 87 - joan: 42 - kenneth: 2022 - lawrence: 206 - mack: 38 - madeline1-hs: 98 - madeline2-dorms: 186 - madeline3-offcampus: 348 - madeline4-postgrad: 294 - mark: 23 - melissa: 89 - melora: 211 - melvin: 128 - merri: 315 - miami-home: 171 - miami-lab: 274 - midwest_teens-f: 111 - midwest_teens-m: 83 - nancy: 44 - natural_scientist: 234 - norman: 1235 - norms-f: 491 - norms-m: 500 - pegasus: 1093 - peru-f: 382 - peru-m: 384 - phil1: 106 - phil2: 220 - phil3: 180 - physiologist: 86 - pregnancy_abortion: 226 - ringo: 16 - sally: 249 - samantha: 63 - seventh_graders: 69 - toby: 33 - tom: 27 - ucsc_women: 81 - van: 192 - vickie: 35 - vietnam_vet: 98 - vietnam_vet2: 32 - vietnam_vet3: 463 - west_coast_teens: 89
提供机构:
maas
创建时间:
2025-10-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作