Stheno v3.4 on WebGPU
Sao10K Stheno v3.4 (Llama-3.1-8B) running in browser via WebGPU. Upgrade from the v3.2 MLC package. 4.6 GB Q4_K_M.
The voice model behind the Garden entity system. Character embodiment, creative writing, entity voice.
Quick Start
- Download Q4_K_M GGUF from bartowski
- Split: llama-gguf-split --split --split-max-size 1G
- Place in model_splits/
- node serve.js (port 8210)
- Open http://localhost:8210
Credits
Built by Joshua (LJTSG) and Claude. Model by Sao10K. Co-Authored-By: Claude noreply@anthropic.com
Model tree for LJTSG/Stheno-v3.4-webgpu
Base model
Sao10K/Llama-3.1-8B-Stheno-v3.4