five

2O24dpower2024/ESCALATE

收藏
Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/2O24dpower2024/ESCALATE
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-4.0 pretty_name: ESCALATE Dataset task_categories: - text-classification - text-generation task_ids: - dialogue-modeling - multi-class-classification tags: - healthcare - medical - llm - multi-agent - simulation - dialogue - evaluation size_categories: - 1K<n<10K --- # ESCALATE: A Dataset for Safety-Critical Clinical Escalation Conversations [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.19224182.svg)](https://doi.org/10.5281/zenodo.19224182) [![Hugging Face](https://img.shields.io/badge/🤗%20Hugging%20Face-Dataset-yellow)](https://huggingface.co/datasets/2O24dpower2024/ESCALATE) [![GitHub Repo](https://img.shields.io/badge/GitHub-Repository-black)](https://github.com/DavePower-cloud/ESCALATE-dataset) # Dataset Card for ESCALATE ## Dataset Summary ESCALATE is a synthetic dataset of clinical escalation-of-care conversations generated using multi-agent large language models. It enables evaluation of communication quality and safety under structured (ISBAR) and unstructured conditions. --- ## Motivation Clinical escalation is a safety-critical process where communication failures can lead to patient harm. ESCALATE provides a structured way to evaluate how AI systems perform in this context. --- ## Composition - 200 paired cases (control vs ISBAR) - 10 deterioration archetypes - 400 total transcripts --- ## Generation Process - Case cards define ground truth - Role-locked agents simulate: - nurse - registrar - optional nurse-in-charge - Models: - OpenAI (GPT-4o-mini) - Anthropic (Claude Haiku) - Google (Gemini Flash) --- ## Evaluation Labels Each transcript is scored using a structured rubric: - omissions - hallucinations - actionability - escalation appropriateness - closed-loop communication --- ## Splits - Train / Validation / Test - Paired conversations kept together --- ## Intended Uses - AI evaluation - simulation research - clinical communication analysis --- ## Out-of-Scope Uses - real clinical decision support - patient care --- ## Limitations - synthetic data - model-generated behaviours - limited to defined archetypes --- ## Ethical Considerations - no real patient data - designed for research use only --- ## Citation Power, D., & Power, T. (2026). ESCALATE: A Dataset for Safety-Critical Clinical Escalation Conversations (Version: 1.0.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.19224182 [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.19224182.svg)](https://doi.org/10.5281/zenodo.19224182) --- ### BibTeX @dataset{power2026escalate,\ author = {David Power},\ title = {ESCALATE: A Dataset for Safety-Critical Clinical Escalation Conversations},\ year = {2026},\ publisher = {Zenodo},\ doi = {10.5281/zenodo.19224182}\ } --- ## 🔗 Related Research This dataset builds on prior work demonstrating the feasibility of multi-agent large language models for simulating clinical conversations: *Power, D., & Power, T. (2026). Can Large Language Models Generate Role-Consistent Clinical Dialogue for Education? A Multi-agent Approach.* Under Review. Preprint available at EdArViX: https://doi.org/10.35542/osf.io/etv6d_v ESCALATE extends this work by introducing a structured dataset of escalation-of-care conversations with safety-focused evaluation labels --- 👤 Author David Power Healthcare Simulation Specialist | MSc Artificial Intelligence 💼 LinkedIn: https://www.linkedin.com/in/dave-power-47280a44/ 💻 GitHub: https://github.com/DavePower-cloud --- ## 📜 Licensing - Code is released under the MIT License - Dataset is released under CC BY 4.0 Please cite appropriately when using the dataset. ---
提供机构:
2O24dpower2024
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作