Commit History

Fix vision image freeze: reduce maxPixels, parallelize ViT attention
f265480

Ex0bit Claude Opus 4.6 commited on

Remove stale build assets
b33a606

Ex0bit commited on

Fix vision preprocessing: correct normalization and min_pixels
4916c34

Ex0bit Claude Opus 4.6 commited on

Remove debug tokenizer exposure, clean build
a064377

Ex0bit commited on

Fix thinking mode: always show tokens, auto-switch sampling params, expose toggle
f57241c

Ex0bit commited on

Clean chat template: EOS before callback, skip_special_tokens, full-sequence decode for UTF-8
2427e68

Ex0bit commited on

Fix smart_resize to match HF reference (floor/ceil rounding, correct min/max pixels)
82e4e21

Ex0bit commited on

Fix vision preprocessing: CLIP normalization and ViT RoPE frequencies
8c77fa3

Ex0bit Claude Opus 4.6 commited on

Add vision/multimodal support across all Qwen3.5 models
53772ae

Ex0bit Claude Opus 4.6 commited on

Lock to 2B model, instruct-general only, rep penalty 1.15
c174d98

Ex0bit Claude Opus 4.6 commited on

Default to 0.8B, cap at 2B for MacBooks, repetition penalty 1.15
88291b0

Ex0bit commited on

Fix boot regression: render synchronously, probe GPU in background
f606de1

Ex0bit Claude Opus 4.6 commited on

Add WebGPU support check and hardware-adaptive model selection
65413d3

Ex0bit Claude Opus 4.6 commited on

Sync latest local build with updated model & GPU ops
db79cca

Ex0bit Claude Opus 4.6 commited on

Fix asset paths: use relative base for HF Spaces compatibility
0c2960a

Ex0bit commited on

Deploy TensorBend — browser-based LLM inference via WebGPU
21a8eeb

Ex0bit commited on

initial commit
05acc9f
verified

Ex0bit commited on