five

nivvis/eq-esconv-sifted

收藏
Hugging Face2026-03-19 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/nivvis/eq-esconv-sifted
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-4.0 parent_dataset: thu-coai/esconv task_categories: - text-generation - conversational tags: - empathy - emotional-intelligence - eq - emotional-support - ranked language: - en size_categories: - 1K<n<10K --- # EQ-ESConv-Sifted: Elo-Ranked Emotional Support Conversations The [ESConv](https://huggingface.co/datasets/thu-coai/esconv) dataset (Liu et al., ACL 2021) ranked by empathetic quality via Swiss-style Elo tournament. All 1,300 conversations scored and sorted. ## Why this exists ESConv is a widely-used emotional support dataset but quality varies significantly — some conversations have excellent empathetic support, others are low-effort or off-topic. This dataset adds Elo rankings so you can filter by quality. For higher-quality synthetic empathetic conversations, see: - [nivvis/eq-convos](https://huggingface.co/datasets/nivvis/eq-convos) — 1,774 multi-turn synthetic conversations, Elo-ranked - [nivvis/eq-dpo](https://huggingface.co/datasets/nivvis/eq-dpo) — 2,880 turn-level DPO preference pairs This dataset is still useful as: - Source material for further synthesis - A benchmark for comparing empathy quality across datasets - SFT data when filtered to top tiers ## Fields | Field | Description | |-------|-------------| | `id` | Original ESConv index | | `elo` | Tournament Elo rating (higher = better empathy) | | `wins` | Tournament wins | | `losses` | Tournament losses | | `draws` | Tournament draws | | `emotion_type` | Primary emotion: anxiety, depression, sadness, anger, fear, etc. | | `problem_type` | Problem category: breakup, job crisis, ongoing depression, etc. | | `situation` | Description of the seeker's situation | | `dialog` | Full conversation as JSON array of `{text, speaker, strategy}` turns | | `survey_score` | Original crowdworker survey ratings (empathy, relevance, emotion intensity) | ## Quality Tiers | Tier | Elo | Count | |------|-----|-------| | Top 20% | >= 1530 | ~260 | | Top 50% | >= 1500 | ~650 | | Bottom 25% | < 1480 | ~325 | ## Tournament Method - Swiss-style pairing: similar-Elo conversations matched each round - 10 rounds, ~650 matches per round - Logit-probe A/B judging: normalized P(A|{A,B}) from single-token output - Position bias mitigated via randomized A/B presentation order - Continuous Elo scoring (no binary win/loss, raw probabilities as match scores) ## Citation Original ESConv dataset: ```bibtex @inproceedings{liu-etal-2021-towards, title = "Towards Emotional Support Dialog Systems", author = "Liu, Siyang and others", booktitle = "ACL 2021", year = "2021" } ```
提供机构:
nivvis
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作