LUMIN: Language-based Understanding and Multimodal Interface for NASA PDS
收藏DataCite Commons2025-12-07 更新2026-05-03 收录
下载链接:
http://dataverse.jpl.nasa.gov/citation?persistentId=doi:10.48577/jpl.XOYL94
下载链接
链接失效反馈官方服务:
资源简介:
Accessing and querying NASA’s Planetary Data System (PDS) can present a steep learning curve due to the complex metadata structures and large volume of filterable parameters. To address this, we developed a Large Language Model (LLM)-powered assistant to help users generate accurate filter selections for planetary datasets, with an initial focus on Mars Reconnaissance Orbiter (MRO) data. Our approach leverages Retrieval-Augmented Generation (RAG) with hybrid methods, including BM25 and dense embeddings, to interpret natural language queries and recommend relevant filters. We also introduce a similarity search function that enables users to retrieve scientifically relevant datasets based on semantically related queries or previous selections, improving the refinement and contextualization of results. In addition, we developed an image analysis module that allows users to input planetary images and returns visually and contextually similar images by using a vision encoder and a pre-indexed embedding database. This functionality is currently limited to the Mars Science Laboratory (MSL) mission, specifically imagery related to the Curiosity rover. This module also extracts available details about the input image, enhancing user understanding and supporting visual-based data discovery. For PDS3 datasets (legacy format), we implemented a script that achieves over 80 percent accuracy and precision in predicting appropriate filters. For PDS-4 (the modern metadata structure), we are testing a new AI assistant that leverages the PDS API client to guide users through the well-organized dropdown filters, potentially removing the need for direct URL prediction due to the improved schema. A unified graphical user interface (GUI) is also in development to integrate all assistant modules and further enhance accessibility. This system enhances usability across both metadata structures, significantly lowering the barrier for scientific data discovery and increasing the efficiency of user interaction with planetary archives.
提供机构:
Root
创建时间:
2025-12-07



