Implicature dataset
收藏Figshare2020-08-23 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/Implicature_dataset/10315505/7
下载链接
链接失效反馈官方服务:
资源简介:
This data set consists of conversational implicatures of utterances. Conversational implicatures are the meanings of an utterance more than what is literally stated by the utterance. The data consist of 1001 utterances that come as responses in a specific context and their implicatures. These written representations of the utterances are collected manually by scraping and transcribing from relevant sources from August, 2019 to August, 2020. The source of dialogues in the data include TOEFL listening comprehension short conversations, movie dialogues from IMSDb and websites explaining idioms, similes, metaphors and hyperboles. The implicatures are annotated manually.FormattingThe dataset file (Conversational Implicature Dataset 1-1001 - implicature data 1-1001.csv) is written as comma-separated values file. Columns that contain commas (,) are escaped using double-quotes ("). The dataset is also available as an excel sheet (Conversational Implicature Dataset 1-1001.xlsx)ContentThe dataset is available in Conversational Implicature Dataset 1-1001 - implicature data 1-1001.csv. Each entry in the dataset consists of a context utterance, a response utterance and an Implicature.Context UtteranceThe written representation of an utterance which serves as the context in which the response utterance can implicate a meaning different from its literal meaning.Response UtteranceThe written representation of an utterance which has a different meaning than the meaning of the sentences used in it.ImplicatureThe implicated meaning of the response utterance.
本数据集收录会话隐含(conversational implicatures)语句。会话隐含指语句字面表述之外的额外引申含义。本数据集包含1001条特定语境下的应答语句及其隐含含义。上述语句的书面文本采集于2019年8月至2020年8月期间,通过从相关来源爬取并转录的方式手动完成。
数据集的对话来源包括托福(TOEFL)听力短对话、IMSDB电影台词,以及解释习语、明喻、隐喻和夸张修辞手法的相关网站。所有会话隐含含义均通过人工标注完成。
### 数据格式
数据集文件为`Conversational Implicature Dataset 1-1001 - implicature data 1-1001.csv`,采用逗号分隔值(CSV)格式存储。若列内容包含逗号,将使用双引号进行转义。本数据集同时提供Excel表格版本,文件名为`Conversational Implicature Dataset 1-1001.xlsx`。
### 数据集内容
本数据集可通过`Conversational Implicature Dataset 1-1001 - implicature data 1-1001.csv`获取,每条数据条目包含三部分:语境语句、应答语句与隐含含义。
1. **语境语句**:作为上下文的语句书面文本,在此语境下,应答语句可传递与其字面意义不同的隐含含义。
2. **应答语句**:书面语句文本,其实际表达含义与所用句子的字面意义存在差异。
3. **隐含含义**:应答语句所传递的非字面引申意义。
创建时间:
2020-08-23



