← All posts · Tag cloud

Posts tagged attention

MiniMax Sparse Attention: Teaching Long-Context Models to Use an Index
MiniMax Sparse Attention turns long context into searchable memory: a learned index selects relevant key-value blocks, then exact softmax attention reads only those blocks.
12 min read · June 15, 2026
2026 · LLM · attention · long context · sparse attention · pretraining