|| Checking for direct PDF access through Ovid
This paper provides a discussion of the effects of different multi-level learning approaches on the resulting out of sample forecast errors in the case of difficult real-world forecasting problems with large noise terms in the training data, frequently occurring structural breaks and quickly changing environments. In order to benefit from the advantages of learning on different aggregation levels and to reduce the risks of high noise terms on low level predictions and overgeneralization on higher levels, various approaches of using information at different levels are analysed in relation to their effects on the bias, variance and Bayes error components proposed by James and Hastie. We provide an extension of this decomposition for the multi-level case. An extensive analysis is also carried out answering the question of why the combination of predictions using information learned at different levels constitutes a significantly better approach in comparison to using only the predictions generated at one of the levels or other multi-level approaches. Additionally we argue why multi-level combinations should be used in addition to thick modelling and the use of different function spaces. Significant forecast improvements have been obtained when using the proposed multi-level combination approaches.