Description about bagging and boosting wrong?

Question

HAICHUAN TANG on 8 Aug 2018

0
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/414048-description-about-bagging-and-boosting-wrong

Answered: Akshat on 26 Nov 2024

In the "Framework for Ensemble Learning", the ensemble algorithms are described as "Bag generally constructs deep trees. This construction is both time consuming and memory-intensive. This also leads to relatively slow predictions. Boost algorithms generally use very shallow trees. This construction uses relatively little time or memory. However, for effective predictions, boosted trees might need more ensemble members than bagged trees. Therefore it is not always clear which class of algorithms is superior." Should this be inverse? To my understanding, Bag is shallow and boost is deep...

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Akshat on 26 Nov 2024

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/414048-description-about-bagging-and-boosting-wrong#answer_1550423

Hi @HAICHUAN TANG,

The description you provided from the "Framework for Ensemble Learning" is accurate in terms of general characteristics of bagging and boosting, but there might be some confusion regarding the depth of trees used in these methods:

Bagging (Bootstrap Aggregating):

Deep Trees: Bagging typically uses deep trees, such as in the Random Forest algorithm, where each tree is grown to a large depth to capture complex patterns. This results in high variance for individual trees, which bagging mitigates by averaging their predictions.
Resource Intensive: The creation of deep trees is indeed time-consuming and memory-intensive, leading to slower predictions.

Boosting:

Shallow Trees: Boosting often uses shallow trees, sometimes referred to as "stumps," which are trees with a small number of levels (often just one or two). The idea is to sequentially correct the errors of previous trees by focusing on difficult-to-predict instances.
More Trees Needed: Because each tree is weak, boosting typically requires more trees to achieve strong predictive performance.

Thus, the description correctly contrasts the two methods in terms of tree depth and resource usage.

You can refer to this link to get more information on why Bagging involves deep trees and Boosting uses shallow trees.

https://datascience.stackexchange.com/questions/61771/bagging-vs-boosting-bias-vs-variance-depth-of-trees

Hope this helps!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Description about bagging and boosting wrong?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

Description about bagging and boosting wrong?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments