Enable Indexer cache for DS v3.2 decoding by RissyRan · Pull Request #3195 · AI-Hypercomputer/maxtext

RissyRan · 2026-02-19T19:53:22Z

Description

Enable Indexer cache for DS v3.2 decoding, to unblock the eval benchmark for DS v3.2 model with sparse attention bringup.

DS reference implementation for Indexer is here
Add init_indexer_cache & update_indexer_cache for indexer cache
Other small changes

Tests

All runners are green
Training end-to-end (no impact) with a smaller model version: link
Test against reference implementation still green: link
Decoding gives reasonable
- small seq len to skip the indexer: link
- large seq len to process the indexer max_prefill_predict_length=3072 max_target_length=4096: link

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

github-actions · 2026-02-19T20:31:02Z

🤖 Hi @RissyRan, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

codecov · 2026-02-19T20:31:38Z

Codecov Report

❌ Patch coverage is 86.95652% with 6 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/layers/attention_mla.py	84.21%	2 Missing and 4 partials ⚠️

📢 Thoughts on this report? Let us know!

github-actions · 2026-02-19T20:36:14Z

🤖 I'm sorry @RissyRan, but I was unable to process your request. Please see the logs for more details.

shuningjin

Thank you! There were some attention autoregressive/decoding tests:

mha:

maxtext/tests/unit/attention_test.py

Line 371 in 3fbe3be

def test_autoregression(self):
mla:

maxtext/tests/unit/attention_test.py

Line 1195 in 3fbe3be

def test_autoregression(self, rope_type):

Shall we add a similar one to test indexer cache in mla? example from gemini

RissyRan · 2026-02-20T17:43:46Z

Thank you! There were some attention autoregressive/decoding tests:

mha:

maxtext/tests/unit/attention_test.py

Line 371 in 3fbe3be

def test_autoregression(self):

mla:

maxtext/tests/unit/attention_test.py

Line 1195 in 3fbe3be

def test_autoregression(self, rope_type):

Shall we add a similar one to test indexer cache in mla? example from gemini

Sounds good. Also enabled the mla assertion, and did some sanity check for decoding: long seq, short seq

RissyRan requested review from A9isha, NicoGrande, NuojCheng, SurbhiJainUSC, aireenmei, bvandermoon, gagika, gobbleturk, hengtaoguo, jesselu-google, jiangjy1982, khatwanimohit, parambole, richjames0, shralex, shuningjin, suexu1025 and vipannalla as code owners February 19, 2026 19:53

RissyRan changed the title ~~Dsv32 decode~~ Enable Indexer cache for DS v3.2 decoding Feb 19, 2026

RissyRan force-pushed the dsv32_decode branch 4 times, most recently from 257e3fe to 510e955 Compare February 19, 2026 20:04

RissyRan added the gemini-review label Feb 19, 2026

RissyRan removed the gemini-review label Feb 19, 2026

RissyRan assigned shuningjin and gpolovets1 Feb 19, 2026

shuningjin reviewed Feb 19, 2026

View reviewed changes

RissyRan force-pushed the dsv32_decode branch 3 times, most recently from a470971 to 9f2ba53 Compare February 20, 2026 08:29

Enable Indexer cache for DS v3.2 decoding

7414d69

RissyRan force-pushed the dsv32_decode branch from 9f2ba53 to 7414d69 Compare February 20, 2026 08:40

Let's find it

0dd9b13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Indexer cache for DS v3.2 decoding#3195

Enable Indexer cache for DS v3.2 decoding#3195
RissyRan wants to merge 2 commits intomainfrom
dsv32_decode

RissyRan commented Feb 19, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

codecov bot commented Feb 19, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

shuningjin left a comment •

edited

Loading

Uh oh!

RissyRan commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

RissyRan commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

codecov bot commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

shuningjin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RissyRan commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

RissyRan commented Feb 19, 2026 •

edited

Loading

codecov bot commented Feb 19, 2026 •

edited

Loading

shuningjin left a comment •

edited

Loading