GiCCS: A German in-Context Conversational Similarity Benchmark
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7266220
下载链接
链接失效反馈官方服务:
资源简介:
We introduce GiCCS, a first conversational STS evaluation benchmark for German. We collected the similarity annotations for GiCCS using best-worst scaling and presenting the target items in context, in order to obtain highly-reliable context-dependent similarity scores. In our paper, we present benchmarking experiments for evaluating LMs on capturing the similarity of utterances. Results
suggest that pretraining LMs on conversational data and providing conversational context can be useful for capturing similarity of utterances in dialogues. GiCCS will be publicly available to encourage benchmarking of conversational LMs.
创建时间:
2022-10-31



