DiyRex/emobooks-dataset

Name: DiyRex/emobooks-dataset
Creator: DiyRex
Published: 2026-04-26 09:28:08
License: 暂无描述

Hugging Face2026-04-26 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/DiyRex/emobooks-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

**emoBooks**数据集是一个精心策划的对话样本集合，旨在训练AI模型进行**情感感知的书籍推荐**。该数据集专注于理解用户情绪，并提供“Singlish”（音译的僧伽罗语）中的相关书籍建议。数据集遵循**匹配/切换**逻辑：- **匹配**：推荐与用户当前情绪状态相符的书籍。- **切换**：推荐帮助用户转换到更积极情绪状态的书籍（例如，从悲伤到快乐）。数据集采用标准的**ChatML/OpenAI格式**，非常适合对Llama-3、Mistral或Gemma等模型进行指令微调。每个条目都是一个包含`messages`列表的JSON对象。数据集涵盖8种主要情绪状态和多种书籍类型，包括儿童故事、小说、翻译作品、传记和短篇故事。该数据集是通过自定义流程合成的，旨在用于微调大型语言模型（LLM）以用于推荐系统、情感感知对话AI研究以及开发双语（僧伽罗语/英语）文学助手。

The **emoBooks** dataset is a curated collection of conversational samples designed to train AI models for **emotion-aware book recommendations**. It focuses on understanding user emotions and providing relevant book suggestions in "Singlish" (transliterated Sinhala). The dataset follows a **Match/Switch** logic: - **Match**: Recommend books that align with the users current emotional state. - **Switch**: Recommend books that help transition the user to a more positive emotional state (e.g., from Sadness to Joy). The dataset is provided in a standard **ChatML/OpenAI format**, making it ideal for instruction fine-tuning of models like Llama-3, Mistral, or Gemma. Each entry is a JSON object with a `messages` list. The dataset covers 8 primary emotional states and various book genres, including Childrens Stories, Novels, Translations, Biographies, and Short Stories. This dataset was synthetically generated using a custom pipeline and is intended for fine-tuning Large Language Models (LLMs) for recommendation systems, research in emotion-aware conversational AI, and developing bilingual (Sinhala/English) literary assistants.

提供机构：

DiyRex

5,000+

优质数据集

54 个

任务类型

进入经典数据集