Skip to content

Smoothing feature contribution and spatial pattern copying from dominant feature #651

@fcaleca1

Description

@fcaleca1

Hi all,
I am working on a simple binary classification task using ExplainableBoostingClassifier, and I have a couple of questions about smoothing feature contributions and an artifact I'm seeing in spatial predictions.

I am trying to get smoother feature contribution plots, especially by reducing jagged contributions across bins. With my default setup of EBM, the contribution of a variable of my interest is
Image

I then tried setting smoothing_rounds == max_rounds (or very large smoothing_rounds) to force smoother versions. The contribution of the same variable becomes smoother, but I have also noticed a change in the scores, particularly in the upper bins: the contribution shifts up compared to the jagged version.

Image
  1. Is it expected that increasing smoothing_rounds changes the magnitude of the contribution?
  2. Is smoothing_rounds designed to obtain smoother plots, or is it better to force smoothness with other parameters? Are there suggested ranges of values of smoothing_rounds/other parameters to get smoother plots?
  3. Should I also try with regularization (reg_alpha, reg_lambda) for this purpose?

My dataset is spatial, and after fitting the model, I display predicted probabilities across space. In both classes, the resulting probability map reproduces spatial pattern of such a predictor (the same one on which I am interested in getting a smoother contribution). This predictor also reports the highest mean absolute score. It looks like the model is relying heavily on this feature (a sort of overfitting). However, when using a "smoother" EBM version, I observed that these artifacts become less pronounced. By artifacts, I mean spatial changes in probabilities that closely follow the changes of that variable. I short, I observe sharp increases or decreases in probabilities that align with where this variable changes.

  1. Are there suggested strategies to mitigate this "spatial pattern" copying beyond just increasing smoothing?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions