BP and RP spectra of the Gaia Photometric Alerts
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4411969
下载链接
链接失效反馈官方服务:
资源简介:
By means of Web Scraping techniques, the values of the BP (Blue Photometer) and RP (Red Photometer) spectra detected by Gaia have been extracted. This data extraction was carried out with the aim of studying the feasibility of classifying the alerts automatically according to their origin.
Spectrums.csv
id: Unique name of the alert.
order: Order of the spectrum on a given alert.
bp: Values as a list of the Blue Photometer.
rp: Values as a list of the Red Photometer.
a_d: If the spectrum corresponds to a detection it have the letter "D". If the spectrum corresponds to an alert it have the letter "A".
feature: It show us the class or comment of a spectrum.
Spectrums_columns.csv
We have the same values as in the "Spectrums.csv" but, the bp and rp lists are shown as columns. This dataset is useful for machine learning models.
The databases are applied in the master's thesis: Feasibility study of a spectra-based classication of the
Gaia Photometric Alerts.
Abstract
The photometric alerts obtained by the Gaia satellite are collected when a significant change from a constant magnitude is detected. This alert is recorded to be later studied and classified, in other words, to know what has caused it (variable star, microlensing effects, transits...). The project focuses on the alerts that have been published and classified in order to study the feasibility of automating the process of classifying these alerts.
After selecting the alerts with the greatest representation, a web scraping process is carried out where the photometric spectra of each of the alerts participating in the study are obtained. Once we obtain a dataset formed by the photometric spectra and the classification of the alert, various supervised machine learning techniques are implemented. Given the large volume of data we work with, a balanced random subset of 4000 elements is selected to obtain the best hyperparameters and evaluate the performance of the following classifiers: Decision Trees, Support Vector Machines, Random Forests and Gradient Boosting Classifier. This process is repeated on the complete dataset using the hyperparameters obtained in the subset. Finally, the performance of different models for an Artificial Neural Network is built and evaluated.
The best model obtained is the Gradient Boosting Classifier which, with a maximum depth of 7 nodes, 200 estimators and a learning rate of 0.1, gives an accuracy of 66.8%. Although the results are not excessively good, we can affirm that the classification of Gaia photometric alerts according to their spectra is feasible.
- Keywords: "Gaia", "Photometry", "Classification", "Machine Learning", "Web Scraping".
创建时间:
2021-01-03



