Foundations and Trends® in Computer Graphics and Vision > Vol 7 > Issue 2–3

Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning

By Antonio Criminisi, Microsoft Research Ltd., UK, antcrim@microsoft.com | Jamie Shotton, Microsoft Research Ltd., UK, jamiesho@microsoft.com | Ender Konukoglu, Microsoft Research Ltd., UK, enderk@microsoft.com

 
Suggested Citation
Antonio Criminisi, Jamie Shotton and Ender Konukoglu (2012), "Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning", Foundations and Trends® in Computer Graphics and Vision: Vol. 7: No. 2–3, pp 81-227. http://dx.doi.org/10.1561/0600000035

Publication Date: 29 Mar 2012
© 2012 A. Criminisi, J. Shotton and E. Konukoglu
 
Subjects
Learning and statistical methods,  Object and scene recognition
 

Free Preview:

Download extract

Share

Download article
In this article:
1. Overview and Scope 
2. The Random Decision Forest Model 
3. Classification Forests 
4. Regression Forests 
5. Density Forests 
6. Manifold Forests 
7. Semi-supervised Forests 
8. Random Ferns and Other Forest Variants 
9. Conclusions 
Appendix A. Deriving the Regression Information Gain 
Acknowledgments 
References 

Abstract

This review presents a unified, efficient model of random decision forests which can be applied to a number of machine learning, computer vision, and medical image analysis tasks.

Our model extends existing forest-based techniques as it unifies classification, regression, density estimation, manifold learning, semi-supervised learning, and active learning under the same decision forest framework. This gives us the opportunity to write and optimize the core implementation only once, with application to many diverse tasks.

The proposed model may be used both in a discriminative or generative way and may be applied to discrete or continuous, labeled or unlabeled data.

The main contributions of this review are: (1) Proposing a unified, probabilistic and efficient model for a variety of learning tasks; (2) Demonstrating margin-maximizing properties of classification forests; (3) Discussing probabilistic regression forests in comparison with other nonlinear regression algorithms; (4) Introducing density forests for estimating probability density functions; (5) Proposing an efficient algorithm for sampling from a density forest; (6) Introducing manifold forests for nonlinear dimensionality reduction; (7) Proposing new algorithms for transductive learning and active learning. Finally, we discuss how alternatives such as random ferns and extremely randomized trees stem from our more general forest model.

This document is directed at both students who wish to learn the basics of decision forests, as well as researchers interested in the new contributions. It presents both fundamental and novel concepts in a structured way, with many illustrative examples and real-world applications. Thorough comparisons with state-of-the-art algorithms such as support vector machines, boosting and Gaussian processes are presented and relative advantages and disadvantages discussed. The many synthetic examples and existing commercial applications demonstrate the validity of the proposed model and its flexibility.

DOI:10.1561/0600000035
ISBN: 978-1-60198-540-8
168 pp. $99.00
Buy book (pb)
 
ISBN: 978-1-60198-541-5
168 pp. $220.00
Buy E-book (.pdf)
Table of contents:
1. Overview and Scope
2. The Random Decision Forest Model
3. Classification Forests
4. Regression Forests
5. Density Forests
6. Manifold Forests
7. Semi-supervised Forests
8. Random Ferns and Other Forest Variants
9. Conclusions
Appendix A. Deriving the Regression Information Gain
Acknowledgements
References

Decision Forests

In recent years, decision forests have established themselves as one of the most promising techniques in machine learning, computer vision and medical image analysis. This book is directed at engineers and PhD students who wish to learn the basics of decision forests as well as more senior researchers who wish to push the state of the art in automated image understanding.

The authors presents a unified, efficient model of random decision forests which can be used in a number of applications such as scene recognition from photographs, object recognition in images, automatic diagnosis from radiological scans and document analysis. Such applications have traditionally been addressed by different, supervised or unsupervised machine learning techniques. In contrast, here we cast diverse tasks such as regression, classification and semi-supervised learning as instances of the same general decision forest model. The flexibility of the forest framework further extends to tasks such as density estimation, manifold learning and semi-supervised learning. The unified forest framework gives us the opportunity to implement and optimize the underlying algorithm only once, and then easily adapt it to individual applications with relatively small changes.

The theoretical basis and numerous explanatory examples presented in this book serve as a solid platform upon which to build exciting future research.

 
CGV-035