deqing/convergent-llama-300M-muon-6digit-addition_6digit_custom6 1B • Updated about 1 month ago • 169
deqing/convergent-llama-300M-muon-6digit-addition_6digit_llama Text Generation • 0.3B • Updated Jun 3 • 233 • 1
deqing/convergent-llama-300M-muon-6digit-addition_6digit_custom3 Text Generation • 0.2B • Updated Jun 2 • 189 • 1
deqing/convergent-llama-300M-muon-base15-addition_base15 Text Generation • 0.2B • Updated May 31 • 93
deqing/convergent-llama-300M-muon-base12-addition_base12 Text Generation • 0.2B • Updated May 30 • 180
deqing/convergent-llama-300M-muon-4digit-addition_4digit_custom3_right2left 0.2B • Updated May 30 • 11