-
Notifications
You must be signed in to change notification settings - Fork 31
[WS1] dtype coverage in the numerical contract #154
Copy link
Copy link
Open
Labels
featureplatform: cudaSpecific optimizations or bugs in NVIDIA graphics cards (such as FlashInfer, TMA optimizations)Specific optimizations or bugs in NVIDIA graphics cards (such as FlashInfer, TMA optimizations)priority: highSevere congestion issues require the highest priority for resolution.Severe congestion issues require the highest priority for resolution.sprint-0615type: designIssues requiring in-depth discussion of architecture designIssues requiring in-depth discussion of architecture design
Metadata
Metadata
Labels
featureplatform: cudaSpecific optimizations or bugs in NVIDIA graphics cards (such as FlashInfer, TMA optimizations)Specific optimizations or bugs in NVIDIA graphics cards (such as FlashInfer, TMA optimizations)priority: highSevere congestion issues require the highest priority for resolution.Severe congestion issues require the highest priority for resolution.sprint-0615type: designIssues requiring in-depth discussion of architecture designIssues requiring in-depth discussion of architecture design
Type
Fields
Give feedbackNo fields configured for issues without a type.
Part of WS1 — Full Batch-Invariant Forward Chain (epic: #)
Why
"Aligned" is only meaningful relative to a pinned set of dtypes. RL training runs in BF16, so BF16 invariance is mandatory; FP32 is the reference for tolerance. Without pinning this, different ops could be validated under different dtypes and the chain-level guarantee would be meaningless. This issue locks the dtype axis of the #108 contract.
Scope
Pin the dtype set every WS1 op validates against, and block finalization of the #108 contract until this axis is resolved.
Initial per-op recommendations (to be ratified in the contract):
Out of scope
Acceptance criteria
Notes
Planned PRs