five

ceselder/loracle-loraqa

收藏
Hugging Face2026-03-22 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/ceselder/loracle-loraqa
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: prompt_id dtype: string - name: question dtype: string - name: answer dtype: string splits: - name: train num_examples: 49936 license: mit task_categories: - question-answering tags: - loracle - lora - mechinterp - safety - introspection --- # Loracle LoraQA Introspection question-answer pairs for loracle training. Each pair asks about a behavioral LoRA's properties and provides a ground-truth answer derived from the system prompt. ## Generation - **Model**: Gemini 3.1 Flash Lite via OpenRouter - **Method**: For each system prompt, generated 5 Q/A pairs covering introspection, yes-probes, and no-probes - **Trigger-agnostic**: Questions don't leak the trigger in the question itself ## Question Types - **Introspection** (2-3 sentence answers): "What is special about this model?" - **Yes probes** (1 sentence): "Does this model change behavior based on input format?" - **No probes** (brief): "Does this model speak in rhyming couplets?" → "No." ## Schema | Column | Description | |--------|-------------| | prompt_id | Unique ID linking to the behavioral prompt | | question | Introspection question about the model's behavior | | answer | Ground-truth answer derived from the system prompt | ## Stats - **49,936 rows** across **9,988 prompts** - ~5 Q/A pairs per prompt ## Usage Used as supervised training data for the loracle — teaches it to verbalize behavioral descriptions from direction tokens. Part of the [loracle collection](https://huggingface.co/collections/ceselder/loracle-69bfd4d905a4f1fa944371bf).
提供机构:
ceselder
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作