cerebras/HybridDialogue
收藏Hugging Face2024-08-19 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/cerebras/HybridDialogue
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
---
# Dataset Information
A pre-processed version of the HybridDialogue dataset. The dataset was created as part of our work on Cerebras DocChat - a document-based conversational Q&A model. This dataset is intended to be used for training purposes, and so overlapping samples with the HybridDialogue test set in [ChatRAG](https://huggingface.co/datasets/nvidia/ChatRAG-Bench) have been removed.
Each sample in this dataset contains a `messages` multi-turn conversation, a `document` which is a concatenated representation of relevant document(s), and `answers` for the current turn.
# Acknowledgement
This dataset is a processed version of the HybridDialogue dataset.
```
@inproceedings{nakamura2022hybridialogue,
title={HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data},
author={Nakamura, Kai and Levy, Sharon and Tuan, Yi-Lin and Chen, Wenhu and Wang, William Yang},
booktitle={Findings of the Association for Computational Linguistics: ACL 2022},
year={2022}
}
```
提供机构:
cerebras



