baa.ai

company

https://baa.ai

black-sheep-ai-baa

Activity Feed

AI & ML interests

Model Quantization

Recent Activity

tomkay updated a collection 11 days ago

LTX-2.3 RAM Quantized (MLX)

tomkay updated a collection 11 days ago

LTX-2.3 RAM Quantized (MLX)

tomkay updated a model 11 days ago

baa-ai/LTX-2.3-22B-RAM-24GB-MLX

View all activity

baa-ai 's collections 16

LTX-2.3 RAM Quantized (MLX)

Mixed-precision MLX quantizations of LTX-2.3 for Apple Silicon using the RAM MCKP allocator. 12 GB and 24 GB variants.

baa-ai/LTX-2.3-22B-RAM-12GB-MLX

Updated 11 days ago
baa-ai/LTX-2.3-22B-RAM-24GB-MLX

Updated 11 days ago

Kimi-K2.6

Mixed-precision GGUF quantizations of moonshotai/Kimi-K2.6 from the RAM pipeline (per-tensor bit allocation via sensitivity probing).

baa-ai/Kimi-K2.6-RAM-344GB-GGUF

Text Generation • 1T • Updated Apr 21 • 610 • 1
baa-ai/Kimi-K2.6-RAM-447GB-GGUF

Text Generation • 1T • Updated Apr 21 • 648

DeepSeek-V3.2 (MLX)

Mixed-precision MLX builds of deepseek-ai/DeepSeek-V3.2 for Apple Silicon.

baa-ai/DeepSeek-V3.2-RAM-350GB-MLX

Text Generation • 672B • Updated Apr 16 • 414

Gemma 4

RAM optimised Gemma 4 models by baa.ai

baa-ai/Gemma-4-31B-it-RAM-30GB-MLX

9B • Updated Apr 16 • 340 • 2
baa-ai/Gemma-4-26B-A4B-it-RAM-20GB-MLX

7B • Updated Apr 16 • 344 • 4
baa-ai/Gemma-4-26B-A4B-it-RAM-14GB-MLX

5B • Updated Apr 16 • 858 • 6
baa-ai/Gemma-4-26B-A4B-it-RAM-18GB-MLX

6B • Updated Apr 16 • 111 • 1

MiniMax M2.5

MINT & SWAN quantized versions of MiniMax-M2.5 (MLX & GGUF)

baa-ai/MiniMax-M2.5-RAM-120GB-MLX

229B • Updated Apr 16 • 65

Llama 3

SWAN quantized versions of Llama 3.1 and 3.3 70B Instruct (MLX)

baa-ai/Llama-3.3-70B-Instruct-RAM-50GB-MLX

71B • Updated Apr 16 • 365 • 1
baa-ai/Llama-3.1-70B-Instruct-SWAN-5bit-MLX

71B • Updated Apr 16 • 163

Qwen3

MINT & SWAN quantized versions of Qwen3 models (MLX)

baa-ai/Qwen3-30B-A3B-RAM-20GB-MLX

31B • Updated Apr 16 • 63
baa-ai/Qwen3-8B-SWAN-6bit-MLX

8B • Updated Apr 16 • 10 • 1

Qwen3.5-122B-A10B

MINT quantized versions of Qwen3.5-122B-A10B at multiple budget targets (MLX & GGUF)

baa-ai/Qwen3.5-122B-A10B-RAM-48GB-MLX

15B • Updated Apr 16 • 1.2k • 7
baa-ai/Qwen3.5-122B-A10B-RAM-60GB-MLX

17B • Updated Apr 16 • 171 • 1
baa-ai/Qwen3.5-122B-A10B-RAM-100GB-MLX

28B • Updated Apr 16 • 93
baa-ai/Qwen3.5-122B-A10B-RAM-140GB-MLX

37B • Updated Apr 16 • 703

Qwen3.6-27B

Mixed-precision MLX builds of Qwen/Qwen3.6-27B at the predicted local and global operating points.

baa-ai/Qwen3.6-27B-RAM-16GB-MLX

Text Generation • 27B • Updated Apr 27 • 1.39k • 5
baa-ai/Qwen3.6-27B-RAM-28GB-MLX

Text Generation • 27B • Updated Apr 27 • 507 • 2

Qwen3.6-35B-A3B

Mixed-precision MLX builds of Qwen/Qwen3.6-35B-A3B for Apple Silicon. Size points: 19 GB, 25 GB.

baa-ai/Qwen3.6-35B-A3B-RAM-19GB-MLX

Text Generation • 35B • Updated Apr 17 • 108
baa-ai/Qwen3.6-35B-A3B-RAM-25GB-MLX

Text Generation • 35B • Updated Apr 17 • 404 • 3
baa-ai/Qwen3.6-35B-A3B-RAM-26GB-GGUF

Text Generation • 35B • Updated Apr 20 • 111

MiniMax-M2.7 (MLX)

RAM quantized versions of MiniMaxAI/MiniMax-M2.7 for Apple Silicon. Size points: 91 GB, 100 GB, 111 GB, 116 GB, 120 GB.

baa-ai/MiniMax-M2.7-RAM-100GB-MLX

Text Generation • 229B • Updated Apr 15 • 611 • 5
baa-ai/MiniMax-M2.7-RAM-120GB-MLX

Text Generation • 229B • Updated Apr 15 • 304 • 3
baa-ai/MiniMax-M2.7-RAM-116GB-MLX

Text Generation • 229B • Updated Apr 15 • 129 • 2
baa-ai/MiniMax-M2.7-RAM-111GB-MLX

Text Generation • 229B • Updated Apr 15 • 80 • 1

Nemotron 3 Super

MINT quantized Nemotron-3-Super-120B — hybrid Mamba-MoE-Attention (MLX & GGUF)

baa-ai/Nemotron-3-Super-120B-A12B-MINT-MLX

121B • Updated Apr 16 • 260 • 1
baa-ai/Nemotron-3-Super-120B-A12B-MINT-GGUF

121B • Updated Apr 16 • 73 • 1

GLM

Baa.ai quantized versions of GLM models

baa-ai/GLM-5-SWAN-5bit-MLX

744B • Updated Apr 16 • 71
baa-ai/GLM-4.7-Flash-RAM-20GB-MLX

30B • Updated Apr 16 • 138
baa-ai/GLM-5.1-RAM-270GB-MLX

744B • Updated Apr 16 • 510 • 2
baa-ai/GLM-5.1-RAM-420GB-MLX

744B • Updated Apr 16 • 1.51k • 5

Llama 4

MINT & SWAN quantized versions of Llama 4 Scout and Maverick (MLX & GGUF)

baa-ai/Llama-4-Scout-17B-16E-Instruct-RAM-60GB-MLX

18B • Updated Apr 16 • 711
baa-ai/Llama-4-Maverick-17B-128E-Instruct-RAM-170GB-MLX

52B • Updated Apr 16 • 630

Qwen3.5-35B-A3B

Mixed-precision MLX builds of Qwen/Qwen3.5-35B-A3B for Apple Silicon, quantized by baa.ai. Size points: 12.5-21, 25, 29, 31 GB.

baa-ai/Qwen3.5-35B-A3B-RAM-25GB-MLX

8B • Updated Apr 16 • 35
baa-ai/Qwen3.5-35B-A3B-RAM-29GB-MLX

9B • Updated Apr 16 • 19
baa-ai/Qwen3.5-35B-A3B-RAM-31GB-MLX

9B • Updated Apr 16 • 21
baa-ai/Qwen3.5-35B-A3B-RAM-14GB-MLX

Text Generation • 35B • Updated 16 days ago • 304

Qwen3.5-397B-A17B

MINT & SWAN quantized versions of Qwen3.5-397B-A17B (MLX & GGUF)

baa-ai/Qwen3.5-397B-A17B-RAM-220GB-MLX

61B • Updated Apr 16 • 77

LTX-2.3 RAM Quantized (MLX)

Mixed-precision MLX quantizations of LTX-2.3 for Apple Silicon using the RAM MCKP allocator. 12 GB and 24 GB variants.

baa-ai/LTX-2.3-22B-RAM-12GB-MLX

Updated 11 days ago
baa-ai/LTX-2.3-22B-RAM-24GB-MLX

Updated 11 days ago

Qwen3.6-27B

Mixed-precision MLX builds of Qwen/Qwen3.6-27B at the predicted local and global operating points.

baa-ai/Qwen3.6-27B-RAM-16GB-MLX

Text Generation • 27B • Updated Apr 27 • 1.39k • 5
baa-ai/Qwen3.6-27B-RAM-28GB-MLX

Text Generation • 27B • Updated Apr 27 • 507 • 2

Kimi-K2.6

Mixed-precision GGUF quantizations of moonshotai/Kimi-K2.6 from the RAM pipeline (per-tensor bit allocation via sensitivity probing).

baa-ai/Kimi-K2.6-RAM-344GB-GGUF

Text Generation • 1T • Updated Apr 21 • 610 • 1
baa-ai/Kimi-K2.6-RAM-447GB-GGUF

Text Generation • 1T • Updated Apr 21 • 648

Qwen3.6-35B-A3B

Mixed-precision MLX builds of Qwen/Qwen3.6-35B-A3B for Apple Silicon. Size points: 19 GB, 25 GB.

baa-ai/Qwen3.6-35B-A3B-RAM-19GB-MLX

Text Generation • 35B • Updated Apr 17 • 108
baa-ai/Qwen3.6-35B-A3B-RAM-25GB-MLX

Text Generation • 35B • Updated Apr 17 • 404 • 3
baa-ai/Qwen3.6-35B-A3B-RAM-26GB-GGUF

Text Generation • 35B • Updated Apr 20 • 111

DeepSeek-V3.2 (MLX)

Mixed-precision MLX builds of deepseek-ai/DeepSeek-V3.2 for Apple Silicon.

baa-ai/DeepSeek-V3.2-RAM-350GB-MLX

Text Generation • 672B • Updated Apr 16 • 414

MiniMax-M2.7 (MLX)

RAM quantized versions of MiniMaxAI/MiniMax-M2.7 for Apple Silicon. Size points: 91 GB, 100 GB, 111 GB, 116 GB, 120 GB.

baa-ai/MiniMax-M2.7-RAM-100GB-MLX

Text Generation • 229B • Updated Apr 15 • 611 • 5
baa-ai/MiniMax-M2.7-RAM-120GB-MLX

Text Generation • 229B • Updated Apr 15 • 304 • 3
baa-ai/MiniMax-M2.7-RAM-116GB-MLX

Text Generation • 229B • Updated Apr 15 • 129 • 2
baa-ai/MiniMax-M2.7-RAM-111GB-MLX

Text Generation • 229B • Updated Apr 15 • 80 • 1

Gemma 4

RAM optimised Gemma 4 models by baa.ai

baa-ai/Gemma-4-31B-it-RAM-30GB-MLX

9B • Updated Apr 16 • 340 • 2
baa-ai/Gemma-4-26B-A4B-it-RAM-20GB-MLX

7B • Updated Apr 16 • 344 • 4
baa-ai/Gemma-4-26B-A4B-it-RAM-14GB-MLX

5B • Updated Apr 16 • 858 • 6
baa-ai/Gemma-4-26B-A4B-it-RAM-18GB-MLX

6B • Updated Apr 16 • 111 • 1

Nemotron 3 Super

MINT quantized Nemotron-3-Super-120B — hybrid Mamba-MoE-Attention (MLX & GGUF)

baa-ai/Nemotron-3-Super-120B-A12B-MINT-MLX

121B • Updated Apr 16 • 260 • 1
baa-ai/Nemotron-3-Super-120B-A12B-MINT-GGUF

121B • Updated Apr 16 • 73 • 1

MiniMax M2.5

MINT & SWAN quantized versions of MiniMax-M2.5 (MLX & GGUF)

baa-ai/MiniMax-M2.5-RAM-120GB-MLX

229B • Updated Apr 16 • 65

GLM

Baa.ai quantized versions of GLM models

baa-ai/GLM-5-SWAN-5bit-MLX

744B • Updated Apr 16 • 71
baa-ai/GLM-4.7-Flash-RAM-20GB-MLX

30B • Updated Apr 16 • 138
baa-ai/GLM-5.1-RAM-270GB-MLX

744B • Updated Apr 16 • 510 • 2
baa-ai/GLM-5.1-RAM-420GB-MLX

744B • Updated Apr 16 • 1.51k • 5

Llama 3

SWAN quantized versions of Llama 3.1 and 3.3 70B Instruct (MLX)

baa-ai/Llama-3.3-70B-Instruct-RAM-50GB-MLX

71B • Updated Apr 16 • 365 • 1
baa-ai/Llama-3.1-70B-Instruct-SWAN-5bit-MLX

71B • Updated Apr 16 • 163

Llama 4

MINT & SWAN quantized versions of Llama 4 Scout and Maverick (MLX & GGUF)

baa-ai/Llama-4-Scout-17B-16E-Instruct-RAM-60GB-MLX

18B • Updated Apr 16 • 711
baa-ai/Llama-4-Maverick-17B-128E-Instruct-RAM-170GB-MLX

52B • Updated Apr 16 • 630

Qwen3

MINT & SWAN quantized versions of Qwen3 models (MLX)

baa-ai/Qwen3-30B-A3B-RAM-20GB-MLX

31B • Updated Apr 16 • 63
baa-ai/Qwen3-8B-SWAN-6bit-MLX

8B • Updated Apr 16 • 10 • 1

Qwen3.5-35B-A3B

Mixed-precision MLX builds of Qwen/Qwen3.5-35B-A3B for Apple Silicon, quantized by baa.ai. Size points: 12.5-21, 25, 29, 31 GB.

baa-ai/Qwen3.5-35B-A3B-RAM-25GB-MLX

8B • Updated Apr 16 • 35
baa-ai/Qwen3.5-35B-A3B-RAM-29GB-MLX

9B • Updated Apr 16 • 19
baa-ai/Qwen3.5-35B-A3B-RAM-31GB-MLX

9B • Updated Apr 16 • 21
baa-ai/Qwen3.5-35B-A3B-RAM-14GB-MLX

Text Generation • 35B • Updated 16 days ago • 304

Qwen3.5-122B-A10B

MINT quantized versions of Qwen3.5-122B-A10B at multiple budget targets (MLX & GGUF)

baa-ai/Qwen3.5-122B-A10B-RAM-48GB-MLX

15B • Updated Apr 16 • 1.2k • 7
baa-ai/Qwen3.5-122B-A10B-RAM-60GB-MLX

17B • Updated Apr 16 • 171 • 1
baa-ai/Qwen3.5-122B-A10B-RAM-100GB-MLX

28B • Updated Apr 16 • 93
baa-ai/Qwen3.5-122B-A10B-RAM-140GB-MLX

37B • Updated Apr 16 • 703

Qwen3.5-397B-A17B

MINT & SWAN quantized versions of Qwen3.5-397B-A17B (MLX & GGUF)

baa-ai/Qwen3.5-397B-A17B-RAM-220GB-MLX

61B • Updated Apr 16 • 77

AI & ML interests

Recent Activity

Team members 1

baa-ai 's collections 16