Mattimax/OpenITA-Conversation
收藏Hugging Face2026-03-20 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Mattimax/OpenITA-Conversation
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-generation
language:
- it
tags:
- DAC
- M.INC.
- conversational
pretty_name: OpenITA-Conversation
size_categories:
- 1K<n<10K
---
# Dataset Card: Mattimax/OpenITA-Conversation
## Dataset Overview
**Name:** Mattimax/OpenITA-Conversation
**Type:** Conversational dataset
**Language:** Italian
**License:** MIT
This dataset contains multi-turn conversations in Italian. Each entry is structured as a list of messages between a user and an assistant, often including advice, emotional support, casual conversation, or very simple code snippets. It also includes metadata for the theme of the conversation.
---
## Dataset Structure
Each record is a JSON object with the following structure:
```json
{
"messages": [
{
"role": "user",
"content": "string"
},
{
"role": "assistant",
"content": "string"
}
],
"meta": {
"theme": "string"
}
}
````
**Fields:**
* `messages`: List of message objects in a conversation.
* `role`: `"user"` or `"assistant"`.
* `content`: The text of the message.
* `meta`: Metadata object.
* `theme`: Theme of the conversation (examples include `"emotivo"`, `"conversazione"`, `"consigli"`, `"riflessione"`).
---
## Features
* Multi-turn conversations.
* Mix of emotional support, casual chatting, advice, and occasional simple code snippets.
* Themes help categorize conversation types for filtering or analysis.
---
## Examples
```json
{
"messages": [
{
"role": "user",
"content": "Ti piace camminare nei boschi?"
},
{
"role": "assistant",
"content": "Sì, l’aria fresca e il verde sono molto rilassanti 🌿"
}
],
"meta": {
"theme": "conversazione"
}
}
```
```json
{
"messages": [
{
"role": "user",
"content": "Mi sento un po' ansioso oggi"
},
{
"role": "assistant",
"content": "Capita, prova a fare respiri profondi e concentrarti sul presente."
}
],
"meta": {
"theme": "emotivo"
}
}
```
---
## Potential Uses
* Training or evaluating Italian conversational AI models.
* Research on emotional support in chatbots.
* Dialogue generation, intent recognition, or theme classification.
* Studying casual conversation patterns in Italian.
---
## Limitations
* The dataset primarily contains short and simple dialogues.
* Responses often mix casual conversation with emotional advice and occasional emoji.
* Simple code snippets appear rarely and only in basic forms.
* Metadata is limited to a single `theme` per conversation, no speaker-specific attributes or timestamps.
## Citation
If you use this dataset, please cite it as:
```
@misc{mattimax_openita_conversation,
title = {OpenITA-Conversation},
author = {Mattimax},
year = {2026},
note = {Italian multi-turn conversational dataset},
url = {[https://huggingface.co/datasets/Mattimax/OpenITA-Conversation}](https://huggingface.co/datasets/Mattimax/OpenITA-Conversation})
}
许可证:MIT许可证
任务类别:
- 文本生成
语言:
- 意大利语
标签:
- DAC
- M.INC.
- 会话式(conversational)
展示名称:OpenITA-Conversation
规模类别:
- 1K<n<10K
# 数据集卡片:Mattimax/OpenITA-Conversation
## 数据集概览
**名称:** Mattimax/OpenITA-Conversation
**类型:** 会话式数据集(conversational dataset)
**语言:** 意大利语
**许可证:** MIT许可证
本数据集包含意大利语多轮会话。每条数据均以用户与助手之间的消息列表形式组织,内容涵盖建议提供、情绪支持、日常闲聊以及极简单的代码片段。此外,数据集还附带会话主题的元数据。
## 数据集结构
每条记录为遵循如下结构的JSON对象:
json
{
"messages": [
{
"role": "user",
"content": "string"
},
{
"role": "assistant",
"content": "string"
}
],
"meta": {
"theme": "string"
}
}
**字段说明:**
* `messages`:会话中的消息对象列表。
* `role`:取值为`"user"`(用户)或`"assistant"`(助手)。
* `content`:消息的文本内容。
* `meta`:元数据对象。
* `theme`:会话的主题(示例包括`"emotivo"`(情绪类)、`"conversazione"`(闲聊类)、`"consigli"`(建议类)、`"riflessione"`(反思类))。
## 数据集特性
* 支持多轮会话
* 涵盖情绪支持、日常闲聊、建议提供以及偶现的简单代码片段
* 主题标签可用于对会话类型进行分类,便于筛选与分析
## 示例
json
{
"messages": [
{
"role": "user",
"content": "Ti piace camminare nei boschi?"
},
{
"role": "assistant",
"content": "Sì, l’aria fresca e il verde sono molto rilassanti 🌿"
}
],
"meta": {
"theme": "conversazione"
}
}
json
{
"messages": [
{
"role": "user",
"content": "Mi sento un po' ansioso oggi"
},
{
"role": "assistant",
"content": "Capita, prova a fare respiri profondi e concentrarti sul presente."
}
],
"meta": {
"theme": "emotivo"
}
}
## 潜在应用场景
* 训练或评估意大利语会话式AI模型
* 开展聊天机器人情绪支持相关研究
* 用于对话生成、意图识别或主题分类任务
* 研究意大利语日常闲聊的会话模式
## 局限性说明
* 数据集以简短简单的对话为主
* 助手回复常融合日常闲聊、情绪建议,并偶现表情符号
* 简单代码片段出现频次较低,且仅为基础形式
* 元数据仅包含单一会话主题字段,未提供说话人专属属性或时间戳信息
## 引用方式
若使用本数据集,请按照如下格式进行引用:
@misc{mattimax_openita_conversation,
title = {OpenITA-Conversation},
author = {Mattimax},
year = {2026},
note = {Italian multi-turn conversational dataset},
url = {https://huggingface.co/datasets/Mattimax/OpenITA-Conversation}
}
提供机构:
Mattimax



