Abstract
The standard framework of machine learning problems assumes that the available data is independent and identically distributed (i.i.d.). However, in some applications such as image classification, the training data are often collected from multiple sources and heterogeneous. Ensemble learning is a proven effective approach to heterogeneous data, which uses multiple classification models to capture the diverse aspects of heterogeneous data. If an ensemble can learn the relationship between different portions of data and their corresponding models, the ensemble can selectively apply models to unseen data according to the learned relationship. We propose a novel approach to enable the learning of the relationships between data and models by creating a set of 'switches' that can route a testing instance to appropriate classification models in an ensemble. Our empirical study on both real-world data and benchmark data shows that the proposed approach to ensemble learning can achieve significant performance improvement for heterogeneous data.
Original language | English (US) |
---|---|
Title of host publication | Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) |
Editors | J.-F. Boulicaut, F. Esposito, D. Pedreschi, F. Giannotti |
Pages | 560-562 |
Number of pages | 3 |
Volume | 3201 |
State | Published - 2004 |
Event | 15th European Conference on Machine Learning, ECML 2004 - Pisa, Italy Duration: Sep 20 2004 → Sep 24 2004 |
Other
Other | 15th European Conference on Machine Learning, ECML 2004 |
---|---|
Country/Territory | Italy |
City | Pisa |
Period | 9/20/04 → 9/24/04 |
ASJC Scopus subject areas
- Hardware and Architecture