truthy-dpo-v0.1

Name: truthy-dpo-v0.1
Creator: maas
Published: 2025-12-05 16:48:44
License: 暂无描述

魔搭社区2025-12-05 更新2025-11-03 收录

下载链接：

https://modelscope.cn/datasets/jondurbin/truthy-dpo-v0.1

下载链接

链接失效反馈

官方服务：

资源简介：

## Truthy DPO This is a dataset designed to enhance the overall truthfulness of LLMs, without sacrificing immersion when roleplaying as a human. For example, in normal AI assistant model, the model should not try to describe what the warmth of the sun feels like, but if the system prompt indicates it's a human, it should. Mostly targets corporeal, spacial, temporal awareness, and common misconceptions. ### Contribute If you're interested in new functionality/datasets, take a look at [bagel repo](https://github.com/jondurbin/bagel) and [airoboros](https://github.com/jondurbin/airoboros) and either make a PR or open an issue with details. To help me with the fine-tuning costs, dataset generation, etc., please use one of the following: - https://bmc.link/jondurbin - ETH 0xce914eAFC2fe52FdceE59565Dd92c06f776fcb11 - BTC bc1qdwuth4vlg8x37ggntlxu5cjfwgmdy5zaa7pswf

## Truthy DPO 本数据集旨在提升大语言模型（LLM）的整体真实性，同时在进行人类角色扮演时不会削弱沉浸体验。例如，在常规AI助手模式下，模型不应尝试描述阳光的触感，但当系统提示其扮演人类角色时，则应当进行此类描述。该数据集主要聚焦于躯体感知、空间认知、时间意识以及常见认知误区。 ### 贡献方式若您对新功能或新数据集感兴趣，可访问 [bagel 代码仓库](https://github.com/jondurbin/bagel) 与 [airoboros 代码仓库](https://github.com/jondurbin/airoboros)，通过提交拉取请求（PR）或提交包含详细说明的议题的方式参与贡献。若您愿意资助模型微调、数据集生成等相关成本，可通过以下任一方式支持： - 购买作者咖啡：https://bmc.link/jondurbin - 以太坊（ETH）地址：0xce914eAFC2fe52FdceE59565Dd92c06f776fcb11 - 比特币（BTC）地址：bc1qdwuth4vlg8x37ggntlxu5cjfwgmdy5zaa7pswf

提供机构：

maas

创建时间：

2025-08-29

搜集汇总

数据集介绍