LiveTaro/vad-synthetic-qwen-data
收藏Hugging Face2025-01-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/LiveTaro/vad-synthetic-qwen-data
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含日语文章对的数据集,基于izumi-lab/llm-japanese-dataset,使用Qwen2.5-14B-Instruct模型生成。每个话题包含四个文章:两个完整的(一个常规完整文章,一个疑问形式完整文章)和两个未完成的。数据以JSON格式组织,提供话题、生成文章文本及文章是否完整的标记。
This dataset contains pairs of Japanese sentences generated based on izumi-lab/llm-japanese-dataset using the Qwen2.5-14B-Instruct model. Each topic includes four sentences: two complete ones (a regular complete sentence and a question-form complete sentence) and two incomplete ones. The data is organized in JSON format, providing the topic, generated text, and a mark indicating whether the sentence is complete.
提供机构:
LiveTaro



