five

Mattimax/OpenITA-Conversation

收藏
Hugging Face2026-03-20 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Mattimax/OpenITA-Conversation
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-generation language: - it tags: - DAC - M.INC. - conversational pretty_name: OpenITA-Conversation size_categories: - 1K<n<10K --- # Dataset Card: Mattimax/OpenITA-Conversation ## Dataset Overview **Name:** Mattimax/OpenITA-Conversation **Type:** Conversational dataset **Language:** Italian **License:** MIT This dataset contains multi-turn conversations in Italian. Each entry is structured as a list of messages between a user and an assistant, often including advice, emotional support, casual conversation, or very simple code snippets. It also includes metadata for the theme of the conversation. --- ## Dataset Structure Each record is a JSON object with the following structure: ```json { "messages": [ { "role": "user", "content": "string" }, { "role": "assistant", "content": "string" } ], "meta": { "theme": "string" } } ```` **Fields:** * `messages`: List of message objects in a conversation. * `role`: `"user"` or `"assistant"`. * `content`: The text of the message. * `meta`: Metadata object. * `theme`: Theme of the conversation (examples include `"emotivo"`, `"conversazione"`, `"consigli"`, `"riflessione"`). --- ## Features * Multi-turn conversations. * Mix of emotional support, casual chatting, advice, and occasional simple code snippets. * Themes help categorize conversation types for filtering or analysis. --- ## Examples ```json { "messages": [ { "role": "user", "content": "Ti piace camminare nei boschi?" }, { "role": "assistant", "content": "Sì, l’aria fresca e il verde sono molto rilassanti 🌿" } ], "meta": { "theme": "conversazione" } } ``` ```json { "messages": [ { "role": "user", "content": "Mi sento un po' ansioso oggi" }, { "role": "assistant", "content": "Capita, prova a fare respiri profondi e concentrarti sul presente." } ], "meta": { "theme": "emotivo" } } ``` --- ## Potential Uses * Training or evaluating Italian conversational AI models. * Research on emotional support in chatbots. * Dialogue generation, intent recognition, or theme classification. * Studying casual conversation patterns in Italian. --- ## Limitations * The dataset primarily contains short and simple dialogues. * Responses often mix casual conversation with emotional advice and occasional emoji. * Simple code snippets appear rarely and only in basic forms. * Metadata is limited to a single `theme` per conversation, no speaker-specific attributes or timestamps. ## Citation If you use this dataset, please cite it as: ``` @misc{mattimax_openita_conversation, title = {OpenITA-Conversation}, author = {Mattimax}, year = {2026}, note = {Italian multi-turn conversational dataset}, url = {[https://huggingface.co/datasets/Mattimax/OpenITA-Conversation}](https://huggingface.co/datasets/Mattimax/OpenITA-Conversation}) }

许可证:MIT许可证 任务类别: - 文本生成 语言: - 意大利语 标签: - DAC - M.INC. - 会话式(conversational) 展示名称:OpenITA-Conversation 规模类别: - 1K<n<10K # 数据集卡片:Mattimax/OpenITA-Conversation ## 数据集概览 **名称:** Mattimax/OpenITA-Conversation **类型:** 会话式数据集(conversational dataset) **语言:** 意大利语 **许可证:** MIT许可证 本数据集包含意大利语多轮会话。每条数据均以用户与助手之间的消息列表形式组织,内容涵盖建议提供、情绪支持、日常闲聊以及极简单的代码片段。此外,数据集还附带会话主题的元数据。 ## 数据集结构 每条记录为遵循如下结构的JSON对象: json { "messages": [ { "role": "user", "content": "string" }, { "role": "assistant", "content": "string" } ], "meta": { "theme": "string" } } **字段说明:** * `messages`:会话中的消息对象列表。 * `role`:取值为`"user"`(用户)或`"assistant"`(助手)。 * `content`:消息的文本内容。 * `meta`:元数据对象。 * `theme`:会话的主题(示例包括`"emotivo"`(情绪类)、`"conversazione"`(闲聊类)、`"consigli"`(建议类)、`"riflessione"`(反思类))。 ## 数据集特性 * 支持多轮会话 * 涵盖情绪支持、日常闲聊、建议提供以及偶现的简单代码片段 * 主题标签可用于对会话类型进行分类,便于筛选与分析 ## 示例 json { "messages": [ { "role": "user", "content": "Ti piace camminare nei boschi?" }, { "role": "assistant", "content": "Sì, l’aria fresca e il verde sono molto rilassanti 🌿" } ], "meta": { "theme": "conversazione" } } json { "messages": [ { "role": "user", "content": "Mi sento un po' ansioso oggi" }, { "role": "assistant", "content": "Capita, prova a fare respiri profondi e concentrarti sul presente." } ], "meta": { "theme": "emotivo" } } ## 潜在应用场景 * 训练或评估意大利语会话式AI模型 * 开展聊天机器人情绪支持相关研究 * 用于对话生成、意图识别或主题分类任务 * 研究意大利语日常闲聊的会话模式 ## 局限性说明 * 数据集以简短简单的对话为主 * 助手回复常融合日常闲聊、情绪建议,并偶现表情符号 * 简单代码片段出现频次较低,且仅为基础形式 * 元数据仅包含单一会话主题字段,未提供说话人专属属性或时间戳信息 ## 引用方式 若使用本数据集,请按照如下格式进行引用: @misc{mattimax_openita_conversation, title = {OpenITA-Conversation}, author = {Mattimax}, year = {2026}, note = {Italian multi-turn conversational dataset}, url = {https://huggingface.co/datasets/Mattimax/OpenITA-Conversation} }
提供机构:
Mattimax
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作