Please use this identifier to cite or link to this item: http://repositorio.udec.cl/jspui/handle/11594/11940
Title: Positional encodings for light curve transformers: an evaluation of their impact in the pretraining and classification task.
Authors: Cabrera Vives, Guillermo
Moreno Cartagena, Daniel Andrés
Keywords: Data sets;Light curves;Data encoding (Computer science)
Issue Date: 2024
Publisher: Universidad de Concepción
Abstract: The vast volume of astronomical data generated nightly by observatories, such as the Vera C. Rubin Observatory, presents significant challenges in the classification and analysis of light curves. These curves, characterized by their unique distributions across various bands, irregular sampling, and varied cadences, necessitate sophisticated models capable of generalization across diverse astronomical surveys. In this work, we conducted empirical experiments to assess the transferability of a light curve transformer model to datasets with different cadences and magnitude distributions, utilizing various positional encodings. We proposed a new approach to directly incorporate temporal information into the output of the last attention layer. Additionally, we modified the common finetuning approach to assess the adaptability of the light curve transformer in contexts where the cadence is markedly different from that of the dataset used for its pretraining. Our results indicate that using trainable positional encodings leads to significant improvements in transformer performance and training times. Our proposed positional encoding, applied to the attention mechanism, can be trained more quickly than the traditional non-trainable positional encoding transformer, while still achieving competitive results when transferred to other datasets. Our approach to adapting the model to a dataset with a very different cadence demonstrates that, in terms of reconstruction of astronomical time series, both training time and computational space can be reduced. This approach achieves an adaptation in the cadence of the survey without the need to train the entire model, indicating a promising direction for future research in astronomical data analysis.
Description: Tesis presentada para optar al grado de Magíster en Ciencias de la Computación.
URI: http://repositorio.udec.cl/jspui/handle/11594/11940
Appears in Collections:Ingeniería Informática y Ciencias de la Computación - Tesis Magister

Files in This Item:
File Description SizeFormat 
moreno_c_d_2024_MAG.pdf2,75 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.