Mano: Restriking Manifold Optimization for LLM Training Paper • 2601.23000 • Published 9 days ago • 2
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers Paper • 2602.01077 • Published 7 days ago • 3
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers Paper • 2602.01077 • Published 7 days ago • 3
Autoregressive Image Generation with Randomized Parallel Decoding Paper • 2503.10568 • Published Mar 13, 2025 • 9