Gradient Boosting

Overview

Unlike Random Forest, where trees are built independently in parallel, Gradient Boosting builds trees one after another. Each new tree attempts to minimize the 'residual error' (the difference between the actual and predicted values) of the existing ensemble.

The Process

Start with a simple model (like the average of the target values).
Calculate the errors (residuals).
Train a new model to predict those residuals.
Add the new model to the ensemble (with a small 'learning rate' to prevent overfitting).
Repeat for many iterations.

Popular Implementations

XGBoost
LightGBM
CatBoost

Overview

The Process

Popular Implementations

Related Terms