gplsi/alia_tourism
收藏Hugging Face2025-12-19 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/gplsi/alia_tourism
下载链接
链接失效反馈官方服务:
资源简介:
ALIA_TOURISM数据集是一个多语言资源,专为旅游领域的文本生成而设计。数据集包含以Markdown格式(`.md`)提供的文本文档,每个文档以结构化的JSONL条目形式呈现。每个条目包括文本的源、语言、格式、文本内容和元数据等信息。数据集涵盖西班牙语(es)、瓦伦西亚语(va)和英语(en)三种语言,领域为旅游,格式为JSON Lines(`.jsonl`)。每个条目代表一个独立的旅游相关文本。数据集由旅游资源自动整理而成,元数据覆盖范围可能因条目而异,内容可能包含用于结构的Markdown格式(如标题、列表、强调等)。
The **ALIA_TOURISM** dataset is a multilingual resource designed for **text generation** within the **tourism domain**. The dataset consists of textual documents formatted in **Markdown (`.md`)**, each provided as structured JSONL entries. Each entry includes information about the texts **source**, **language**, **format**, **text**, and **metadata**. The dataset covers Spanish (es), Valencian (va), and English (en) languages, with the domain being tourism, and the format being JSON Lines (`.jsonl`). Each item represents a standalone tourism-related text. The dataset is automatically curated from tourism sources, and metadata coverage may vary by entry. Content may include Markdown formatting for structure (e.g., headers, lists, emphasis).
提供机构:
gplsi



