"Boosted Classification Trees and Class Probability/Quantile Estimation" by David Mease, A. Wyner et al.

Faculty Publications

Title

Boosted Classification Trees and Class Probability/Quantile Estimation

Authors

David Mease, San Jose State UniversityFollow
A. Wyner, Wharton School, University of Pennsylvania
A. Buja, Wharton School, University of Pennsylvania

Document Type

Article

Publication Date

1-1-2007

Publication Title

Journal of Machine Learning Research

Volume

First Page

409

Last Page

439

Keywords

boosting algorithms, LogitBoost, AdaBoost, class probability estimation, over-sampling, under-sampling, stratification, data jittering

Disciplines

Marketing | Statistics and Probability

Abstract

The standard by which binary classifiers are usually judged, misclassification error, assumes equal costs of misclassifying the two classes or, equivalently, classifying at the 1/2 quantile of the conditional class probability function P[y = 1jx]. Boosted classification trees are known to perform quite well for such problems. In this article we consider the use of standard, off-the-shelf boosting for two more general problems: 1) classification with unequal costs or, equivalently, classification at quantiles other than 1/2, and 2) estimation of the conditional class probability function P[y = 1jx]. We first examine whether the latter problem, estimation of P[y = 1jx], can be solved with Logit- Boost, and with AdaBoost when combined with a natural link function. The answer is negative: both approaches are often ineffective because they overfit P[y = 1jx] even though they perform well as classifiers. A major negative point of the present article is the disconnect between class probability estimation and classification. Next we consider the practice of over/under-sampling of the two classes. We present an algorithm that uses AdaBoost in conjunction with Over/Under-Sampling and Jittering of the data (“JOUS-Boost”). This algorithm is simple, yet successful, and it preserves the advantage of relative protection against overfitting, but for arbitrary misclassification costs and, equivalently, arbitrary quantile boundaries. We then use collections of classifiers obtained from a grid of quantiles to form estimators of class probabilities. The estimates of the class probabilities compare favorably to those obtained by a variety of methods across both simulated and real data sets.

Comments

Recommended Citation

David Mease, A. Wyner, and A. Buja. "Boosted Classification Trees and Class Probability/Quantile Estimation" Journal of Machine Learning Research (2007): 409-439.

Download

Included in

Marketing Commons, Statistics and Probability Commons

COinS