shi3z/ja_conv_wikipedia_llama2pro8b_10k

Name: shi3z/ja_conv_wikipedia_llama2pro8b_10k
Creator: shi3z
Published: 2024-01-12 06:18:48
License: 暂无描述

Hugging Face2024-01-12 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/shi3z/ja_conv_wikipedia_llama2pro8b_10k

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: llama2 task_categories: - conversational language: - ja size_categories: - 10K<n<100K --- This dataset is based on the Japanese version of Wikipedia dataset and converted into a multi-turn conversation format using llama2Pro8B. After generating 10,000 conversations and screening, only about 3,000 were usable, so I will publish them in this state first. Since it is a llama2 license, it can be used commercially for services. Some strange dialogue may be included as it has not been screened by humans. We generated 30,000 conversations over 24 hours on an A100 80GBx7 machine and automatically screened them. # Model https://huggingface.co/spaces/TencentARC/LLaMA-Pro-8B-Instruct-Chat # Dataset https://huggingface.co/datasets/izumi-lab/wikipedia-ja-20230720 # Compute by Tsuginosuke AI SuperComputer FreeAI Ltd. https://free-ai.ltd

提供机构：

shi3z

原始信息汇总