Smoothing feature contribution and spatial pattern copying from dominant feature

Hi all,
I am working on a simple binary classification task using ExplainableBoostingClassifier, and I have a couple of questions about smoothing feature contributions and an artifact I'm seeing in spatial predictions.

I am trying to get smoother feature contribution plots, especially by reducing jagged contributions across bins. With my default setup of EBM, the contribution of a variable of my interest is 
<img width="1207" height="345" alt="Image" src="https://github.com/user-attachments/assets/28aaa373-8c74-41e7-b98d-47fed13a0ab3" />

I then tried setting smoothing_rounds == max_rounds (or very large smoothing_rounds) to force smoother versions.  The contribution of the same variable becomes smoother, but I have also noticed a change in the scores, particularly in the upper bins: the contribution shifts up compared to the jagged version.

<img width="1301" height="478" alt="Image" src="https://github.com/user-attachments/assets/8448da7c-662f-44ac-bd45-f15de79225b0" />

1. Is it expected that increasing smoothing_rounds changes the magnitude of the contribution? 
2. Is smoothing_rounds designed to obtain smoother plots, or is it better to force smoothness with other parameters? Are there suggested ranges of values of smoothing_rounds/other parameters to get smoother plots?
3. Should I also try with regularization (reg_alpha, reg_lambda) for this purpose? 

My dataset is spatial, and after fitting the model, I display predicted probabilities across space. In both classes, the resulting probability map reproduces spatial pattern of such a predictor (the same one on which I am interested in getting a smoother contribution). This predictor also reports the highest mean absolute score. It looks like the model is relying heavily on this feature (a sort of overfitting). However, when using a "smoother" EBM version, I observed that these artifacts become less pronounced.  By artifacts, I mean spatial changes in probabilities that closely follow the changes of that variable. I short, I observe sharp increases or decreases in probabilities that align with where this variable changes.

1. Are there suggested strategies to mitigate this "spatial pattern" copying beyond just increasing smoothing?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smoothing feature contribution and spatial pattern copying from dominant feature #651

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Smoothing feature contribution and spatial pattern copying from dominant feature #651

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions