Google's newest Gemma 4 models are both powerful and useful.
"""Precomputed metadata shared across multiple backend instances. seqlens_expanded: torch.Tensor # int32, [expanded_size] nsa_cache_seqlens: torch.Tensor # int32 ...
# you may not use this file except in compliance with the License. # You may obtain a copy of the License at # http://www.apache.org/licenses/LICENSE-2.0 ...