fixed the logit lens implementation inside ActivationCache.accumulated_resid to match the standard definition in literature and the expected and defined behavior as per the documentation in the docstring and in the docs#1077
Merged
jlarson4 merged 4 commits intoTransformerLensOrg:devfrom Mar 16, 2026