Title
Classification performance of bagging and boosting type ensemble methods with small training sets
Authors
Faisal ZAMAN, Hideo HIROSE
Source

New Generation Computing, special issue on Hybrid and Ensemble Methods in Machine Learning, Vol. 29, No. 3, pp.277-292 , August 2011

Abstract
Classification performance of an ensemble method can be deciphered by studying the bias and variance contribution to its classification error. Statistically, the bias and variance of a single classifier is controlled by the size of the training set and the complexity of the classifier. It has been both theoretically and empirically established that the classification performance (hence bias and variance) of a single classifier can be improved partially by using a suitable ensemble method of the classifier and resampling the original training set. In this paper we have empirically examined the bias-variance decomposition of three different types of ensemble methods with different training sample sizes consisting of 10¥% to maximum 63¥% of the observations from the original training sample. First ensemble is bagging, second one is a boosting type ensemble named adaboost and the last one is a bagging type hybrid ensemble method, called bundling. All the ensembles are trained on training samples constructed with small subsampling ratios (SSR) 0.10, 0.20, 0.30, 0.40, 0.50 and bootstrapping. The experiments are all done on 20 UCI Machine Learning repository datasets and designed to find out the optimal training sample size (smaller than the original training sample) for each ensemble and then find out the optimal ensemble with smaller trianing sets with respect to the bias-variance performance. The bias-variance decomposition of bundling show that this ensemble method with small subsamples have significantly lower bias and variance than subsampled and bootstrapped version of bagging and adaboost.
Key Words
Classification Performance, Bias-Variance Decomposition, Small Subsample, Bagging, Boosting
Citation

Times Cited in Web of Science: 4

Times Cited in Google Scholar: 5

Cited in Books:

Cited in Proceedings:

Mathematical Review:

WoS: MEDICAL PHYSICS 巻: 40 号: 10 記事番号: 101906 発行: OCT 2013; JOURNAL OF UNIVERSAL COMPUTER SCIENCE 巻: 19 号: 4 ページ: 521-538 発行: 2013; INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE 巻: 22 号: 4 ページ: 841-854 DOI: 10.2478/v10006-012-0062-1 発行: DEC 2012; INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE 巻: 22 号: 4 ページ: 867-881 DOI: 10.2478/v10006-012-0064-z 発行: DEC 2012