How does RegressionTree.fit works?

1 view (last 30 days)
Stephanie
Stephanie on 4 Feb 2013
Hi,
I made a regression tree. now I will describe the process. But I cannot find the procedure as described in the literature. I know that the split is formed with MSE, but the calculation is never done.
The documentation center (<http://www.mathworks.de/de/help/stats/classification-trees-and-regression-trees.html>) describe the process a little bit, but there are no code.
Can you help me and give a hint where i can find this?
thanks

Accepted Answer

Ilya
Ilya on 4 Feb 2013
The MATLAB implementation is based on this book:
Breiman, L., J. Friedman, R. Olshen, and C. Stone. Classification and Regression Trees. Boca Raton, FL: CRC Press, 1984.
  3 Comments
Ilya
Ilya on 4 Feb 2013
You won't be able to see the actual code. The split search in RegressionTree is coded in C++ for speed, and the code is not shipped.
You can compute MSE using the var function from the Statistics Toolbox. For example, let y be a vector of response values for all observations landing on a specific node of the decision tree. The value predicted by the tree for this node is then mean(y). The MSE associated with this node is var(y,1). The weighted MSE used to compute the split gain is numel(y)*var(y,1).
Stephanie
Stephanie on 4 Feb 2013
Thanks this is a great answer. I was really confused that I cannot see the code, but now its clear. That is an argumentation that I can use in my attachment, thank you very much.

Sign in to comment.

More Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!