LongICLBench
收藏魔搭社区2025-12-05 更新2024-06-01 收录
下载链接:
https://modelscope.cn/datasets/TIGER-Lab/LongICLBench
下载链接
链接失效反馈官方服务:
资源简介:
This is the benchmark we adopt in our TMLR2025 paper [Long-context LLMs Struggle with Long In-context Learning](https://arxiv.org/abs/2404.02060). Check out our leaderboard at https://huggingface.co/spaces/TIGER-Lab/LongICL-Leaderboard.
Cite our work by
```
@misc{li2024longcontext,
title={Long-context LLMs Struggle with Long In-context Learning},
author={Tianle Li and Ge Zhang and Quy Duc Do and Xiang Yue and Wenhu Chen},
year={2024},
eprint={2404.02060},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
本基准数据集为我们在TMLR2025发表的论文《长上下文大语言模型(Large Language Models,LLMs)难以胜任长上下文内学习》中所使用的基准集,论文arXiv预印本链接为https://arxiv.org/abs/2404.02060。
可访问我们的排行榜页面:https://huggingface.co/spaces/TIGER-Lab/LongICL-Leaderboard。
引用本研究请采用以下BibTeX格式:
@misc{li2024longcontext,
title={Long-context LLMs Struggle with Long In-context Learning},
author={Tianle Li and Ge Zhang and Quy Duc Do and Xiang Yue and Wenhu Chen},
year={2024},
eprint={2404.02060},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
提供机构:
maas
创建时间:
2024-05-29



