[tinker] Fix single request batching in TinkerEngine by pcmoritz · Pull Request #1489 · NovaSky-AI/SkyRL

pcmoritz · 2026-04-10T02:26:44Z

For each model, we currently make sure to not batch forward/backward requests that come after a destructive update like optim_step or load_weights. In addition, we also need to make sure to not batch destructive updates that come after forward/backward requests.

E.g. for a sequence like optim1 → fwdbwd2 → optim2, we do not want to process optim2 before fwdbwd2 has run.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

gemini-code-assist

Code Review

This pull request introduces scheduling barriers to ensure that single requests, such as optimization steps, do not execute before preceding forward or forward-backward passes for the same model. It refactors the identification of destructive barriers into a reusable helper method and adds several regression tests to verify the scheduling logic. Feedback was provided regarding the performance of the logic used to identify blocked passes in find_single_requests, noting that the current implementation could be optimized to avoid potential performance issues as the number of pending requests grows.

gemini-code-assist · 2026-04-10T02:31:02Z

+        if destructive_barriers:
+            pending_passes = session.exec(
+                select(FutureDB.model_id, FutureDB.request_id)
+                .where(
+                    (FutureDB.request_type == types.RequestType.FORWARD_BACKWARD)
+                    | (FutureDB.request_type == types.RequestType.FORWARD)
+                )
+                .where(FutureDB.status == RequestStatus.PENDING)
+                .order_by(FutureDB.request_id)
+            ).all()
+            for model_id, req_id in pending_passes:
+                if model_id in destructive_barriers and req_id >= destructive_barriers[model_id]:
+                    blocked_pass_barriers.setdefault(model_id, req_id)


The logic for identifying blocked passes involves a nested loop over all pending passes for every call to find_single_requests. This can be optimized by using a dictionary lookup or a more efficient SQL query to avoid O(N*M) complexity where N is the number of pending passes and M is the number of models.

[tinker] Fix single request batching in TinkerEngine

91e9ad3

devin-ai-integration bot reviewed Apr 10, 2026

View reviewed changes

gemini-code-assist bot reviewed Apr 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tinker] Fix single request batching in TinkerEngine#1489

[tinker] Fix single request batching in TinkerEngine#1489
pcmoritz wants to merge 1 commit intoNovaSky-AI:mainfrom
pcmoritz:tinker-fix-barriers

pcmoritz commented Apr 10, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pcmoritz commented Apr 10, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pcmoritz commented Apr 10, 2026 •

edited by devin-ai-integration bot

Loading