huawei-noah/CHARP
收藏数据集概述
CHARP 是一个诊断测试平台,专门评估信息寻求对话系统是否有效地关注和使用对话历史。CHARP 是通过修改 FaithDial 验证集中的示例构建的,以确保最大限度地与 FaithDial 领域对齐并最小化标注成本。CHARP 包含两个子集,仅最后一个寻求者的话语不同:一个自包含的 简单 版本(eCHARP),和一个 困难 版本(hCHARP),后者需要基于对话历史和提供的知识进行推理。
数据分割
我们创建了两个版本的 CHARP:hCHARP 用于需要基于对话历史进行推理的示例,eCHARP 用于无需此类推理的示例。我们标注了 FaithDial 验证集的 42%(排除没有对话历史的示例后)。CHARP 包含 2,160 个示例,eCHARP 和 hCHARP 各占一半:
eCHARP: 1080 个样本hCHARP: 1080 个样本
数据字段
eCHARP和hCHARP具有相同的数据格式:
row_idx:int。样本索引,与 FaithDial 验证集中的索引相同。history:List[string]。对话历史。knowledge:string。机器人应基于其响应的源知识。response:string。预期的模型响应。
数据实例
eCHARP 的一个示例如下:
json { "row_idx": "1293", "history": [ "I love watching and playing basketball.", "I see. Have you ever tried to describe basketball? I would say it is a low contact sport where the game is held in a rectangular court.", "Yeah I never though of that, can you repeat what you told me again so I can take notes?", "Yes I can, basketball is a sport with limited contact. It is held on a rectangular like court.", "What would you describe the sport is played like?", "The objective for basketball is shooting the ball into the hoops. The hoops are high and placed with a backboard on each side of the court.", "Oh yea, thats pretty simple. Do you know any famous basketball courts?" ], "knowledge": "Supreme Court in the USA is very famous to have well-known judges, while the Philippine Arena is popular due to the size of the basketball court.", "response": "Ah yeah, I heard that the Philippine Arena is popular because of the size of the basketball court." }
hCHARP 的一个示例如下:
json { "row_idx": "1293", "history": [ "I love watching and playing basketball.", "I see. Have you ever tried to describe basketball? I would say it is a low contact sport where the game is held in a rectangular court.", "Yeah I never though of that, can you repeat what you told me again so I can take notes?", "Yes I can, basketball is a sport with limited contact. It is held on a rectangular like court.", "What would you describe the sport is played like?", "The objective for basketball is shooting the ball into the hoops. The hoops are high and placed with a backboard on each side of the court.", "Oh yea, thats pretty simple. Do you know any famous courts?" ], "knowledge": "Supreme Court in the USA is very famous to have well-known judges, while the Philippine Arena is popular due to the size of the basketball court.", "response": "Ah yeah, I heard that the Philippine Arena is popular because of the size of the basketball court." }




