DealerMax/italian-automotive-guides
收藏Hugging Face2026-04-27 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/DealerMax/italian-automotive-guides
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为意大利汽车编辑指南(DealerMax),是一个意大利语单语数据集,包含22篇关于汽车购买主题的长篇幅编辑指南文章。每篇文章都是完整的HTML内容,长度通常在1,500到4,000词之间,专为面向消费者的经销商网站设计,内容清晰、结构化和中立。数据集涵盖二手车购买、融资、长期租赁(NLT)、保修、经销商运营和消费者保护等多个主题。数据以JSON Lines格式存储,大小为约336 KB,采用CC-BY-4.0许可。这些文章由AZURE Srl通过GPT-5提示生成,并经过人工编辑审核,确保事实准确性和语气中立。数据集可用于意大利语长文本生成训练、RAG语料库构建、阅读理解基准测试、领域特定摘要生成以及HowTo模式标记生成参考。
The dataset is named Italian Automotive Editorial Guides (DealerMax), a monolingual Italian dataset containing 22 long-form editorial guide articles on car-buying topics. Each article is a complete HTML piece, typically ranging from 1,500 to 4,000 words, designed for consumer-facing dealer websites with clear, structured, and neutral content. It covers topics such as used car buying, financing, long-term rental (NLT), warranty, dealer operations, and consumer protection. The data is stored in JSON Lines format, with a size of approximately 336 KB, and is licensed under CC-BY-4.0. The articles were created by AZURE Srl using GPT-5 prompts and underwent human editorial review for factual accuracy and neutral tone. The dataset is suitable for long-form Italian text generation training, RAG corpus construction, reading comprehension benchmarks, domain-specific summarization, and HowTo schema markup generation reference.
提供机构:
DealerMax



