Generate a GLM Tree model.

Parameters:

Parameters:
See dedicated page for more information.
Apply cluster-wise generalized linear models (GLM Trees) to discover segments where a different GLM best explains the target. The node jointly learns:
It’s ideal when a single global regression under-fits because relationships vary across sub-populations (e.g., price elasticity differs by visit intensity).
A single table with at least these columns:
gaussian, strictly positive for gamma, etc.)| Column | Role | Notes |
|---|---|---|
Sales |
Target | numeric (continuous) |
Price |
Predictor | numeric |
Promo |
Predictor | 0/1 or numeric |
Visits |
Covariate | numeric; used for splits |
| Section / ID | What it means | Typical setting |
|---|---|---|
| Target | Column to predict | Sales |
| Predictors (max 3) | Columns inside the GLM | Price, Promo (or up to 3) |
| Covariates | Columns that split the tree (segment drivers) | Visits (can be multiple) |
| OUT: output type | What to emit | chart (HTML report) during exploration; model to export a model |
| MOD: Model family | GLM family inside each leaf | gaussian (continuous); gamma (positive, skewed); others as provided |
| Title | Title on charts/reports | e.g., Explorative Model |
| Missing values | NA handling for numerics | replace by mean is a practical default |
| Quantity of breaks | Max candidate split points (coarsening) | 6 (balanced speed/quality) |
| Maximum depth | Max tree depth (number of splits) | 5 for moderate complexity |
| Pruning | Complexity criterion | bic (parsimonious) |
| Set model name | Path when OUT = model |
e.g., records/glmtree_sales.rModel |
Notes
- Predictors and covariates may overlap, but they do not need to.
- For
gammafamily: target values must be strictly positive.- Categorical predictors should be pre-encoded (e.g., 0/1 dummies).
Depending on OUT: output type:
chart / HTML document
model
records/glmtree_sales.rModel) you can reuse for scoring with the appropriate Apply Model action button.With the table shaped as described and the parameters set via the column pickers, this node will build a segmented regression that uncovers distinct response behavior across covariate-defined groups—often the clearest way to surface varying elasticities or campaign effects in a single, interpretable model.
