Mamba-3 SSM Drops With Inference-First Design Beating Transformers at Decode
3 days ago
5
Together.ai releases Mamba-3, an open-source state space model built for inference that outperforms Mamba-2 and matches Transformer decode speeds at 16K sequences. (Read More)