Back to Search Start Over

Stacking for machine learning redshifts applied to SDSS galaxies

Authors :
Zitlau, Roman
Hoyle, Ben
Paech, Kerstin
Weller, Jochen
Rau, Markus Michael
Seitz, Stella
Publication Year :
2016

Abstract

We present an analysis of a general machine learning technique called 'stacking' for the estimation of photometric redshifts. Stacking techniques can feed the photometric redshift estimate, as output by a base algorithm, back into the same algorithm as an additional input feature in a subsequent learning round. We shown how all tested base algorithms benefit from at least one additional stacking round (or layer). To demonstrate the benefit of stacking, we apply the method to both unsupervised machine learning techniques based on self-organising maps (SOMs), and supervised machine learning methods based on decision trees. We explore a range of stacking architectures, such as the number of layers and the number of base learners per layer. Finally we explore the effectiveness of stacking even when using a successful algorithm such as AdaBoost. We observe a significant improvement of between 1.9% and 21% on all computed metrics when stacking is applied to weak learners (such as SOMs and decision trees). When applied to strong learning algorithms (such as AdaBoost) the ratio of improvement shrinks, but still remains positive and is between 0.4% and 2.5% for the explored metrics and comes at almost no additional computational cost.<br />Comment: 13 pages, 3 tables, 7 figures version accepted by MNRAS, minor text updates. Results and conclusions unchanged

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.1602.06294
Document Type :
Working Paper
Full Text :
https://doi.org/10.1093/mnras/stw1454