AI & ML interests
On-device AI, GGUF quantization, Apple Silicon, macOS automation
Recent Activity
BatiAI โ The Frontier, On Your Mac ๐
Kimi K2.6 โ 1T MoE, SWE-Bench Pro 58.6 (beats GPT-5.4 xhigh, Claude Opus 4.6) โ running locally on M3 Ultra. Gemma 4 E4B โ 57 tokens/sec on a 16GB Mac mini M4. No API costs. No rate limits. No cloud.
We quantize popular open-weight models for every Mac โ direct from official publisher weights, calibrated with imatrix, verified on real hardware, signed with provenance metadata.
โก Quick Start
# 16GB Mac mini M4 โ entry, 57 t/s
ollama pull batiai/gemma4-e4b:q4
# 512GB M3 Ultra โ 1T MoE frontier
ollama pull batiai/kimi-k2.6:iq4
Pick Your Mac โ Real Hardware Benchmarks
Every speed measured on actual hardware. Full reports in each model card. We continuously add new hardware as it ships.
| Your Mac | Best Pick | Size | Speed | Use Case |
|---|---|---|---|---|
| Mac mini M4 16GB | batiai/gemma4-e4b:q4 |
5.0 GB | 57 t/s | Daily chat, balanced |
| MacBook Air 16GB | batiai/qwen3.5-9b:q4 |
5.2 GB | 12.5 t/s | Tool calling, JSON |
| Mac mini M4 Pro 24GB | batiai/gemma4-26b:iq4 |
15 GB | 85 t/s | MoE, larger context |
| MacBook Pro 48GB | batiai/qwen3.6-35b:iq4 |
22 GB | ~30 t/s | Tools + thinking, MoE |
| MacBook Pro 96GB | batiai/qwen3.6-35b:q6 |
29 GB | ~27 t/s | Top quality chat |
| MacBook Pro M4 Max 128GB | batiai/minimax-m2.7:iq3 |
82 GB | 36.7 t/s | 229B Dense โ frontier class |
| Mac Studio M3 Ultra 512GB | batiai/kimi-k2.6:iq4 |
509 GB | โ | 1T MoE, SWE-Bench Pro 58.6 |
Real measurements, not estimates. Numbers expand as benchmarks come in.
Browse the Collections below โ for series-by-series lineup.
๐ฏ Why BatiAI?
๐ Direct from SourceQuantized from the publisher's official FP8/BF16 weights โ never re-quantized from third-party GGUFs. Every file signed |
๐ Verified on Real MacsTested on Mac mini M4 16GB + MacBook Pro M4 Max 128GB. Korean validation, tool-call JSON, 200-token throughput โ measured, reproducible, documented in each model card. |
โก imatrix-CalibratedEvery model uses importance-matrix calibration with wikitext-2. Aggressive IQ3_XXS keeps quality where plain Q3_K_M visibly degrades. |
๐ Frontier-CapableWe handle 1T MoE (Kimi K2.6) and 229B Dense (MiniMax M2.7). Most providers stop at 70B โ we have the storage, pipeline, and experience to go further. |
๐งฐ Built for Real Mac Users
| BatiFlow | 5MB native macOS app. Connects BatiAI models to 60+ tools โ KakaoTalk, iMessage, Slack, Calendar, Notes, Chrome, file system, browser. Free. Unlimited. 100% private. Even drives your Mac via Telegram / Discord / Slack bots. |
| Bati CIS | K-Beauty Commerce Intelligence โ settlement processing 3 days โ 3 hours, 42+ marketplaces. Trusted by COSRX, Pharma Research (Rejuran). Revenue Anomaly ยท Repurchase Cohort ยท Marketing Budget Optimizer. |
๐ง jk@bati.ai (enterprise) ยท ๐ฌ GitHub Issues ยท ๐ bati.ai
Private by default ยท On-device first ยท Verified every step.