You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Rename live_step_len parameter to unpadded_len for clarity
- Rename live_step_len -> unpadded_len across attention and KV cache modules
- Update documentation to clarify that unpadded_len specifies the number of
non-padding tokens per sequence, with actual behavior depending on KV cache implementation
- Fix pre-existing pylint error in rattention.py where rla_output was used before assignment
- Update all test files to use the new parameter name
The new name better reflects the parameter's purpose: indicating the number of
non-padding tokens in each sequence, rather than the ambiguous "live step length".
Implementation behavior varies by KV cache type:
- Standard KVCache: ignores the parameter
- SlidingWindowKVCache: uses it for sequence masking
- PagedKVCache: ignores the parameter
GitOrigin-RevId: 5b0d848
0 commit comments