-
Notifications
You must be signed in to change notification settings - Fork 777
Description
Hi all,
I am working on a simple binary classification task using ExplainableBoostingClassifier, and I have a couple of questions about smoothing feature contributions and an artifact I'm seeing in spatial predictions.
I am trying to get smoother feature contribution plots, especially by reducing jagged contributions across bins. With my default setup of EBM, the contribution of a variable of my interest is

I then tried setting smoothing_rounds == max_rounds (or very large smoothing_rounds) to force smoother versions. The contribution of the same variable becomes smoother, but I have also noticed a change in the scores, particularly in the upper bins: the contribution shifts up compared to the jagged version.
- Is it expected that increasing smoothing_rounds changes the magnitude of the contribution?
- Is smoothing_rounds designed to obtain smoother plots, or is it better to force smoothness with other parameters? Are there suggested ranges of values of smoothing_rounds/other parameters to get smoother plots?
- Should I also try with regularization (reg_alpha, reg_lambda) for this purpose?
My dataset is spatial, and after fitting the model, I display predicted probabilities across space. In both classes, the resulting probability map reproduces spatial pattern of such a predictor (the same one on which I am interested in getting a smoother contribution). This predictor also reports the highest mean absolute score. It looks like the model is relying heavily on this feature (a sort of overfitting). However, when using a "smoother" EBM version, I observed that these artifacts become less pronounced. By artifacts, I mean spatial changes in probabilities that closely follow the changes of that variable. I short, I observe sharp increases or decreases in probabilities that align with where this variable changes.
- Are there suggested strategies to mitigate this "spatial pattern" copying beyond just increasing smoothing?