Psy-Insight|心理健康数据集|语言模型数据集
收藏Psy-Insight 数据集概述
数据集名称
Psy-Insight: Mental Health Oriented Interpretable Multi-turn Bilingual Counseling Dataset for Large Language Model Finetuning
数据集内容
Psy-Insight 数据集是一个面向心理健康的可解释多轮双语咨询数据集,用于大型语言模型的微调。数据集包含多个示例,展示了心理治疗师与来访者之间的对话,以及治疗师的策略和来访者的情绪反应。
数据结构
数据集中的每个示例包含以下字段:
- theme: 主题
- chat_stage: 对话阶段
- is_same_qa: 是否相同问答
- is_same_case: 是否相同案例
- background: 背景信息
- guide: 治疗师的指导
- pre_reasoning: 治疗前的推理
- topic_dialog: 主题对话
- key: 关键字
- unit_id: 单元ID
- method: 治疗方法
- dialog: 对话内容
- text: 对话文本
- participant: 参与者
- observation: 观察
- id: 对话ID
- emotional label: 情绪标签
- reasoning: 推理
- strategy: 策略
- post_reasoning: 治疗后的推理
- extra_tags: 额外标签
示例
中文示例
json { "theme": "", "chat_stage": "6th Session", "is_same_qa": 0, "is_same_case": 1, "background": "来访者是一个硕士生女性,面临合租问题。她可能对生活环境的整洁和责任分担有较强的关注。她需要找到一个与她生活理念相近的室友,以减少因家务问题带来的冲突和压力。", "guide": "治疗师通过提问、理解、鼓励来帮助来访者应对问题。", "pre_reasoning": "治疗师的策略包括通过提问引导来访者深入思考问题,如“那现在你打算怎么办呢?”;通过肯定和鼓励来访者增强信心,如“看到你有能力应对压力情境,真令人鼓舞”;通过理解和接纳缓解来访者的焦虑,如“我一点都没有觉得无趣啊”;以及引导来访者解决问题,如“你的意思是,如果我感到太过担心或者烦恼的话,那么,在我和导师见面之前来这里谈谈,会比之后再来要好,是这样么?”。", "topic_dialog": { "key": "fewshot", "unit_id": 163, "method": "Cognitive Behavioral Therapy", "dialog": [ { "text": "我一开始就不该找她合租,但是没办法,我需要个人来分担费用。她年纪比我小很多,而且生活理念也不同。该她分担的家务,她从来都不在乎自己是不是已经完成,也不在乎公寓里乱成什么样。我可不能那样过日子,于是我就跟她说了,结果她还不高兴了。我想她早就在另找房子了吧,现在一定是已经找到了。", "participant": "来访者", "observation": "因室友不分担家务且公寓凌乱而不满,表达了愤怒和无奈。", "id": "163-0", "emotional label": [ "Anger" ] }, { "text": "那现在你打算怎么办呢?", "participant": "治疗师", "reasoning": "引导来访者深入思考问题", "id": "164-0", "strategy": [ "Question" ] }, { "text": "在我做论文的研究部门里有个女人,我想她会有兴趣跟我合租。她现在承租的公寓正在卖,无论如何,她过几月也得搬家。", "participant": "来访者", "observation": "希望寻找寻找新合租者", "id": "164-1", "emotional label": [ "Neutral" ] }, { "text": "那你怎么知道你们两个合得来呢?", "participant": "治疗师", "reasoning": "探究潜在问题", "id": "164-2", "strategy": [ "Question" ] }, { "text": "我们两个年纪接近,似乎在很多事上的观点也都一致。我们在这个部门相处蛮久了,我想应该不错。", "participant": "来访者", "observation": "对新合租者的期望和信心", "id": "164-3", "emotional label": [ "Neutral" ] }, { "text": "你在这个部门的工作性质是什么?", "participant": "治疗师", "reasoning": "了解背景", "id": "164-4", "strategy": [ "Question" ] }, { "text": "我在参与一个针对领养父母的研究。领养安置之后我做跟进随访,最初,我们对领养父母的亲职能力做过一个评定,现在我要去了解其预估的准确度。", "participant": "来访者", "observation": "解释了其研究工作", "id": "164-5", "emotional label": [ "Neutral" ] }, { "text": "这是你攻读博士学位的一部分么?", "participant": "治疗师", "reasoning": "", "id": "164-6", "strategy": [ "Question" ] }, { "text": "我打算继续读书,以后拿博士学位,不过现在,我正在完成硕士论文。如果我的导师喜欢我做的研究,那么今年夏天我就能拿到硕士学位了。所以我要好好准备跟他的这次约谈。对了,要是下周我暂停一次和你见面,你会介意么?我要准备见他,时间不够用。", "participant": "来访者", "observation": "感到学业压力和焦虑", "id": "164-7", "emotional label": [ "Anxiety" ] }, { "text": "面询是属于你的,你当然可以取消预约,不过我怀疑取消预约对你不是太好。", "participant": "治疗师", "reasoning": "提出建议,引导来访者解决问题", "id": "164-8", "strategy": [ "Affirmation and Reassurance" ] }, { "text": "你的意思是,如果我感到太过担心或者烦恼的话,那么,在我和导师见面之前来这里谈谈,会比之后再来要好,是这样么?", "participant": "来访者", "observation": "因见导师前焦虑,询问是否提前见面较好", "id": "164-9", "emotional label": [ "Anxiety" ] }, { "text": "正是。", "reasoning": "", "participant": "治疗师", "id": "164-10", "strategy": [ "Others" ] }, { "text": "我想,下周我还是来吧。对于今天我告诉你的事,你有什么想法么?我想我该道歉,我今天说的尽是些鸡毛蒜皮,都是让人提不起兴趣的事,我的意思是,这些都不是心理层面上的问题。", "participant": "来访者", "observation": "表达歉意,认为自己谈的问题无趣", "id": "164-11", "emotional label": [ "Others" ] }, { "text": "我一点都没有觉得无趣啊。看到你有能力应对压力情境,真令人鼓舞,而且了解这些对我来说也是重要的。", "participant": "治疗师", "reasoning": "肯定和鼓励来访者增强信心并理解和接纳缓解来访者的焦虑", "id": "164-12", "strategy": [ "Affirmation and Reassurance" ] } ] }, "post_reasoning": "在本次面询中,治疗师采取了多种策略来处理来访者的情绪和行为。首先,治疗师澄清了来访者的焦虑并指出其能力和潜在的成长空间,鼓励她积极参与治疗。治疗师还观察到来访者对自我认知和人际关系的困惑,理解她的需求并给予肯定和指导。此外,治疗师避免了对来访者的负面评价,而是以好奇心和理解回应她的挑战,让来访者感到被接纳和理解。治疗师的策略包括鼓励、肯定、理解和引导,以帮助来访者面对自身问题并获得成长。这些策略有助于建立积极的治疗关系,促进来访者的自我探索和改善。", "extra_tags": [] }
英文示例
json { "theme": "solution-focused brief therapy: building trust in the therapeutic relationship through acceptance and support for students sharing their experiences.", "is_same_qa": 0, "is_same_case": 0, "background": "A student confides in the therapist about a disturbing experience with her uncle and her concerns about her mothers reaction.", "pre_reasoning": "The students willingness to share her experience may indicate that the therapists acceptance and support have built trust in the therapeutic relationship.", "topic_dialog": { "key": "fewshot", "unit_id": 98, "method": "Solution-Focused Brief Therapy", "dialog": [ { "id": "98-0", "participant": "Beth", "text": "It’s my older uncle … He comes over a lot and stays with us, and sometimes my mom leaves us with him when she goes out. I liked him, but last semester he stayed over and he wanted to kiss me. I didn’t know what to do. My mom likes him a lot, and they are close. Now, he says he loves me, and he has put his hands on me, you know, my breasts. I just stood there and didn’t know what to do. I don’t want to let mom know; she would be mad at me.", "observation": "Beth discloses a distressing experience involving her uncle, expressing confusion and fear about the situation and concern about her mothers reaction, indicating a complex family dynamic and her reluctance to cause conflict.", "emotional label": [ "Fear", "Shame" ] }, { "id": "98-1", "participant": "Therapist", "text": "Thank you for taking the chance and trying to help your situation by sharing it with me. I understand how difficult it must have been for you keeping this secret and trying to protect your mom’s relationship with her brother. I admire your courage to take care of yourself given the very difficult situation you’ve been in. Do you have any ideas about how you would want to make this better and not have this happening?", "reasoning": "The therapist acknowledges Beths bravery in sharing her experience and offers support while encouraging her to consider ways to improve her situation, empowering her to take control and explore solutions.", "strategy": [ "Question" ] }, { "id": "98-2", "participant": "Beth", "text": "I’m aware of child protective services. They see some of my friends and their families. I don’t want to have to leave my mom, and I don’t want her to be mad at me.", "observation": "Beth demonstrates awareness of potential support services but expresses concerns about the impact on her family dynamics, highlighting her internal conflict and the importance of maintaining her relationship with her mother.", "emotional label": [ "Fear", "Guilty" ] }, { "id": "98-3", "participant": "Therapist", "text": "I agree that you should be able to stay in your home and also to have a good relationship with your mom. You are aware of child protective services. It is their job to help protect you and any other young person. I agree with you that it is best when you stay at home and make it a safe place for you and your brothers and sisters and your mom, too. From what you have shared, it would be important to contact child protective services. I know several people there, and I would like for you to help me make this contact so that you can make your home safe. Will you help make that happen?", "reasoning": "The therapist validates Beths desire to maintain her family relationships while emphasizing the importance of her safety, suggesting a collaborative approach to involve child protective services and ensure a safe environment for Beth and her family.", "strategy": [ "Question" ] }, { "id": "98-4", "participant": "Beth", "text": "I guess … But I’m still afraid of what mom might say and of her brother.", "observation": "Beth expresses fear of potential repercussions within her family, indicating the internal struggle she faces in balancing her safety with her relationships and the challenging decisions ahead.", "emotional label": [ "Anxiety", "Fear" ] }, { "id": "98-5", "participant": "Therapist", "text": "I understand your not wanting to upset your mom or even your uncle. It’s not an easy decision to make. What do you think would be the best way to handle this so that you don’t have to deal with your uncle’s behavior and you can feel safe?", "reasoning": "The therapist acknowledges Beths concerns and explores her thoughts on managing the situation delicately, encouraging her to consider strategies that prioritize her safety and well-being while navigating the complexities of family dynamics.", "strategy": [ "Question" ] } ] } }
数据集用途
Psy-Insight 数据集主要用于大型语言模型的微调,特别是在心理健康领域的应用,如心理咨询机器人的开发和心理治疗对话系统的训练。

Sleep
该数据集包含关于睡眠效率和持续时间的信息,每个条目代表一个独特的睡眠体验,并包括ID、年龄、性别、睡眠持续时间、睡眠效率、REM睡眠百分比、深度睡眠百分比、轻度睡眠百分比、觉醒次数、咖啡因消费、酒精消费和吸烟状况等列。
github 收录
MedDialog
MedDialog数据集(中文)包含了医生和患者之间的对话(中文)。它有110万个对话和400万个话语。数据还在不断增长,会有更多的对话加入。原始对话来自好大夫网。
github 收录
CliMedBench
CliMedBench是一个大规模的中文医疗大语言模型评估基准,由华东师范大学等机构创建。该数据集包含33,735个问题,涵盖14个核心临床场景,主要来源于顶级三级医院的真实电子健康记录和考试练习。数据集的创建过程包括专家指导的数据选择和多轮质量控制,确保数据的真实性和可靠性。CliMedBench旨在评估和提升医疗大语言模型在临床决策支持、诊断和治疗建议等方面的能力,解决医疗领域中模型性能评估的不足问题。
arXiv 收录
RadDet
RadDet是一个包含11种雷达类别的数据集,包括6种新的低概率干扰(LPI)多相码(P1, P2, P3, P4, Px, Zadoff-Chu)和一种新的宽带调频连续波(FMCW)。数据集覆盖500 MHz频段,包含40,000个雷达帧,分为训练集、验证集和测试集。数据集在两种不同的雷达环境中提供:稀疏数据集(RadDet-1T)和密集数据集(RadDet-9T)。
github 收录
中国30米分辨率土壤可蚀性因子数据集
该数据集为2018年中国30米分辨率土壤可蚀性因子(K)栅格数据,数据是利用中国1979-1994年的全国第二次土壤普查的成果数据进行计算;再利用径流小区观测数据修正计算结果;将修订结果利用反距离权重插值法插值生成栅格数据。特殊地类河湖库塘、冰川及永久积雪、裸岩土地类型K因子值强制赋值为0。如果用户采用的土地利用精度较高,建议重新对以下土地类型的K因子强制赋值为0:河湖库塘、冰川及永久积雪、裸岩。如果有K值为0,但不属于上述类型的,K因子可按如下原则:取邻近相同土地类型图斑的K值,或取与该图斑邻近且不等于0的所有图斑K值的平均值。
国家地球系统科学数据中心 收录