inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated about 7 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step210040 2B • Updated about 7 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated about 7 hours ago
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt2 2B • Updated about 7 hours ago
inference-optimization/dflash-DeepSeek-V4-Flash-swa-muon-speculators-50k 2B • Updated about 1 hour ago
inference-optimization/dflash-DeepSeek-V4-Flash-swa-muon-speculators-50k 2B • Updated about 1 hour ago
inference-optimization/Qwen3-8B-from-Qwen3-8B_regen-speculators.eagle31-llamaarch-ckpt1 1B • Updated about 23 hours ago • 12
inference-optimization/Qwen3-8B-from-Qwen3-8B_regen-speculators.eagle31-llamaarch-ckpt1 1B • Updated about 23 hours ago • 12
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 3 days ago • 127
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt5 0.6B • Updated 3 days ago • 225
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step189036 2B • Updated 3 days ago • 127
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt0 2B • Updated 4 days ago • 112
inference-optimization/Qwen3-8B-speculator.dflash.swa.causal-qwen235b-instruct-bs16-ckpt0 2B • Updated 4 days ago • 112
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step126024 2B • Updated 4 days ago • 329