BaixuW/aviation-domain-pretrain-corpus
收藏Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/BaixuW/aviation-domain-pretrain-corpus
下载链接
链接失效反馈官方服务:
资源简介:
**航空领域预训练语料库**是一个早期阶段的领域特定数据集,旨在用于航空领域大型语言模型的持续预训练(PT)。该数据集目前处于非常早期的阶段,主要用于研究探索和方法验证,而非生产使用。数据集专注于向LLM注入基本航空知识,包括:航空交通管制(ATC)、航空气象、飞行性能、航空信息等。
The **Aviation Domain Pretrain Corpus** is an early-stage domain-specific dataset designed for continued pretraining (PT) of large language models in the aviation field. This dataset is currently in a very early stage. It is intended primarily for research exploration and method validation rather than production use. The dataset focuses on injecting basic aviation knowledge into LLMs, including: Air Traffic Control (ATC), Aviation Meteorology, Flight Performance, Aeronautical Information.
提供机构:
BaixuW



