view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 19 days ago • 138
Memory Bank Compression for Continual Adaptation of Large Language Models Paper • 2601.00756 • Published Jan 2 • 2
mlx-community/LFM2.5-Audio-1.5B-bf16 Audio-to-Audio • 1B • Updated about 1 month ago • 274 • 10
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 17 days ago • 10.2k • 475