five

microsoft/SCBench

收藏
Hugging Face2024-12-24 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/microsoft/SCBench
下载链接
链接失效反馈
官方服务:
资源简介:
SCBench(SharedContextBench)是一个综合基准,用于评估高效长上下文方法,特别关注KV缓存的生命周期(生成、压缩、检索和加载)。该数据集包含12个多样化任务,测试四种关键的长上下文能力:字符串检索、语义检索、全局信息处理和多任务处理。数据集支持两种共享上下文模式:多轮模式和多请求模式。SCBench是第一个涵盖单轮、多轮和多请求场景的长上下文基准,并涉及KV缓存重用技术,从而提供了对高效长上下文方法在完整KV缓存生命周期中的更全面分析。

SCBench (SharedContextBench) is a comprehensive benchmark to evaluate efficient long-context methods from a KV cache-centric perspective, analyzing their performance in real-world scenarios where context memory (KV cache) is shared and reused across multiple requests. The benchmark covers 12 diverse tasks that test four key long-context capabilities: string retrieval, semantic retrieval, global information processing, and multi-tasking. Each task is designed to evaluate specific aspects of long-context processing, such as key-value lookup, semantic understanding, and multi-tasking. The dataset is structured with multiple configurations, each focusing on different aspects of long-context processing. The README also discusses the two shared context modes: multi-turn mode and multi-request mode, and compares SCBench to previous long-context benchmarks. The findings from the benchmark reveal insights into the performance of various methods under different conditions, such as memory usage, compression rates, and generation scenarios.
提供机构:
microsoft
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作