paulofinardi/OIG_small_chip2_portuguese_brasil
收藏Hugging Face2023-03-19 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/paulofinardi/OIG_small_chip2_portuguese_brasil
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: user
dtype: string
- name: chip2
dtype: string
splits:
- name: train
num_examples: 210289
task_categories:
- conversational
- text2text-generation
language:
- pt
---
# Dataset Card for "OIG_small_chip2_portuguese_brasil"
This dataset was translated to Portuguese-Brasil from [here](https://huggingface.co/datasets/0-hero/OIG-small-chip2)
The data was translated with *MarianMT* model and weights [Helsinki-NLP/opus-mt-en-ROMANCE](https://huggingface.co/Helsinki-NLP/opus-mt-en-ROMANCE)
The full details to replicate the translation are here: [translation_notebook](https://github.com/finardi/tutos/blob/master/translate_Laion_OIG.ipynb)
---
license: apache-2.0
---
提供机构:
paulofinardi
原始信息汇总
数据集概述
数据集名称
- OIG_small_chip2_portuguese_brasil
数据集特征
- user: 数据类型为字符串 (string)
- chip2: 数据类型为字符串 (string)
数据集划分
- train: 包含210289个样本
任务类别
- 对话式 (conversational)
- 文本到文本生成 (text2text-generation)
语言
- 葡萄牙语 (pt)



