five

Dataset for Large Language Models Classification of Astronomical Transient : Survey Images, Labels, and Zooniverse Results

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14561777
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset supports the study on the application of Large Language Models (LLMs) to astronomical transient classification. It contains data from three wide-field optical surveys: Pan-STARRS, MeerLICHT, and ATLAS, and includes the following components: Survey Image Files Image triplets (New, Reference, and Difference images) for each transient candidate in the three datasets. Images are organized by survey (Pan-STARRS, MeerLICHT, and ATLAS). Survey Label Files Ground-truth classification labels for each candidate in the three surveys. Labels include categories such as Real (e.g., transients and variable stars for MeerLICHT, and only explosive transients for Pan-STARRS and ATLAS) and Bogus (various types of artifacts), as determined by professional astronomers. Zooniverse Classification Results Results from the Zooniverse campaign evaluating the quality of Gemini’s outputs. Includes responses from professional astronomers who rated the coherence of Gemini's classifications and explanations on a 0–5 scale. Purpose: The dataset is intended for researchers interested in exploring: The use of LLMs for transient classification. Ground-truth labels and their application in astronomical classification tasks. Human evaluation of AI-generated outputs in astronomy. Use and Citation: Please cite this dataset using the DOI provided by Zenodo if used in your research. For details on the methodology, refer to the associated publication: "Large Language Models Enable Textual Interpretation of Image-Based Astronomical Transient Classifications", Under Review, 2025
创建时间:
2025-01-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作