five

hippocorpus

收藏
huggingface.co2025-01-22 收录
下载链接:
https://huggingface.co/datasets/allenai/hippocorpus
下载链接
链接失效反馈
官方服务:
资源简介:
To examine the cognitive processes of remembering and imagining and their traces in language, we introduce Hippocorpus, a dataset of 6,854 English diary-like short stories about recalled and imagined events. Using a crowdsourcing framework, we first collect recalled stories and summaries from workers, then provide these summaries to other workers who write imagined stories. Finally, months later, we collect a retold version of the recalled stories from a subset of recalled authors. Our dataset comes paired with author demographics (age, gender, race), their openness to experience, as well as some variables regarding the author's relationship to the event (e.g., how personal the event is, how often they tell its story, etc.).

为探究记忆与想象等认知过程及其在语言中的印迹,本研究团队推出了Hippocorpus数据集,该数据集收录了6,854篇关于回忆与想象事件的英语日记体短篇小说。通过众包框架,我们首先从工作者那里收集回忆故事及其摘要,随后将这些摘要提供予其他工作者,以撰写想象故事。数月之后,我们从部分回忆作者中收集了回忆故事的复述版本。本数据集附带作者的人口统计学信息(如年龄、性别、种族),他们对经验的开放程度,以及一些关于作者与事件关系的相关变量(例如,事件的个人性质,他们讲述故事频率等)。
提供机构:
huggingface.co
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作