five

inferencelabs/TruthTensor

收藏
Hugging Face2026-01-29 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/inferencelabs/TruthTensor
下载链接
链接失效反馈
官方服务:
资源简介:
# TruthTensor: Measuring instruction-following under drift Large language models are usually evaluated as if the world were static. Real deployments aren’t: **instructions persist while environments drift**; probabilities shift, narratives evolve, and agents must decide whether to update, resist, or overreact. TruthTensor evaluates **instruction divergence**: how far a model shifts away from its prescribed decision procedure as the environment changes. Paper: **TruthTensor: Evaluating LLMs Through Human Imitation on Prediction Markets Under Drift and Holistic Reasoning** ([arXiv:2601.13545](https://arxiv.org/abs/2601.13545)). - `UserFinetuning.parquet` — user-defined finetuned agent decisions; Public Dataset Export: 2026-01-09 to 2026-01-10. - `Experiment_InstructionLocked.parquet` — instruction-locked experiment execution logs. ## Citation ```bibtex @misc{shahabi2026truthtensor, title = {TruthTensor: Evaluating LLMs through Human Imitation on Prediction Market under Drift and Holistic Reasoning}, author = {Shirin Shahabi and Spencer Graham and Haruna Isah}, year = {2026}, eprint = {2601.13545}, archivePrefix= {arXiv}, primaryClass = {cs.AI}, url = {https://arxiv.org/abs/2601.13545} } ``` ## Contact For the entire public dataset available on [TruthTensor.com](https://truthtensor.com), contact the Inference Labs team at Spencer@inferencelabs.com.
提供机构:
inferencelabs
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作