inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt4 2B • Updated 1 day ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-muon-ckpt4 2B • Updated 1 day ago
inference-optimization/Qwen3-8B-speculator.dflash.fullattn-qwen235b-instruct-bs16-ckpt0 2B • Updated 2 days ago • 1
inference-optimization/Qwen3-8B-speculator.dflash.fullattn-qwen235b-instruct-bs16-ckpt0 2B • Updated 2 days ago • 1
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt4 2B • Updated 6 days ago • 23
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt4 2B • Updated 6 days ago • 23
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt3 2B • Updated 7 days ago • 227
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt3 2B • Updated 7 days ago • 227
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated 8 days ago • 278
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated 8 days ago • 278
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated 8 days ago • 8
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated 8 days ago • 8
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 11 days ago • 130
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt5 0.6B • Updated 11 days ago • 1.11k
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 11 days ago • 130
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt0 2B • Updated 12 days ago • 113