John Leimgruber III PRO

ubergarm

https://blog.aifoundry.org/p/adventures-in-model-quantization

AI & ML interests

Open LLMs and Astrophotography image processing.

Recent Activity

new activity about 7 hours ago

ubergarm/Kimi-K2.6-GGUF:Q4_X on Blackwell + EPYC + DDR 5

new activity about 7 hours ago

ubergarm/Kimi-K2.6-GGUF:No think tags.

new activity about 7 hours ago

ubergarm/Kimi-K2.6-GGUF:The Best Kimi Quant!

View all activity

Organizations

New activity in ubergarm/Kimi-K2.6-GGUF about 7 hours ago

Q4_X on Blackwell + EPYC + DDR 5

👍🔥 4

#3 opened about 14 hours ago by

sousekd

No think tags.

#4 opened about 8 hours ago by

DrRos

The Best Kimi Quant!

🔥 4

#1 opened 1 day ago by

anikifoss

New activity in ubergarm/Qwen3.5-122B-A10B-GGUF about 8 hours ago

How to split this model between 2 (3) GPUs and CPU/RAM ?

#12 opened about 1 month ago by

mancub

liked a model about 8 hours ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 7 days ago • 458k • 1.15k

New activity in unsloth/Kimi-K2.6-GGUF about 9 hours ago

Q4_0 vs native INT4 QAT fidelity

👍 1

#4 opened about 11 hours ago by

SpacetimeAI

New activity in unsloth/Kimi-K2.6-GGUF about 14 hours ago

What kind of Q4_0 are you using for ffn_(gate|up|down)_exps?

🔥👍 2

#2 opened about 14 hours ago by

ubergarm

New activity in ubergarm/Kimi-K2.6-GGUF about 16 hours ago

Update README.md

#2 opened about 21 hours ago by

jpsequeira

liked a model 1 day ago

AesSedai/Kimi-K2.6-GGUF

1T • Updated 40 minutes ago • 14 • 9

updated a model 1 day ago

ubergarm/Kimi-K2.6-GGUF

Text Generation • 1T • Updated about 15 hours ago • 492 • 26

published a model 1 day ago

ubergarm/Kimi-K2.6-GGUF

Text Generation • 1T • Updated about 15 hours ago • 492 • 26

New activity in AesSedai/Qwen3.6-35B-A3B-GGUF 1 day ago

Imatrix and Mmprojs degrade quality

#2 opened 5 days ago by

Trilogix1

New activity in ubergarm/MiniMax-M2.5-GGUF 1 day ago

ik_llama.cpp version

#11 opened 2 months ago by

geveent

New activity in ubergarm/Step-3.5-Flash-GGUF 1 day ago

Comparison with APEX quants

#14 opened 2 days ago by

FBykov

New activity in ubergarm/MiniMax-M2.7-GGUF 1 day ago

Testing IQ5_K

❤️ 1

#8 opened 8 days ago by

shewin

liked a model 1 day ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 1 day ago • 8.24k • • 731

New activity in ubergarm/GLM-5.1-GGUF 1 day ago

render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.

#7 opened 6 days ago by

whoisjeremylam

New activity in ubergarm/MiniMax-M2.7-GGUF 5 days ago

Testing Q5 flavors (ubergarm / aessedai / unsloth) for "speed" on 8x RTX 3090

🔥 1

#10 opened 6 days ago by

dehnhaide

4 bpw suggestions

👀 1

#1 opened 10 days ago by

ndroidph

New activity in ubergarm/GLM-5.1-GGUF 5 days ago

Testing smol-IQ4_K

#4 opened 12 days ago by

shewin

John Leimgruber III PRO

AI & ML interests

Recent Activity

Organizations

ubergarm's activity

Q4_X on Blackwell + EPYC + DDR 5

No think tags.

The Best Kimi Quant!

How to split this model between 2 (3) GPUs and CPU/RAM ?

Q4_0 vs native INT4 QAT fidelity

What kind of Q4_0 are you using for ffn_(gate|up|down)_exps?

Update README.md

Imatrix and Mmprojs degrade quality

ik_llama.cpp version

Comparison with APEX quants

Testing IQ5_K

render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.

Testing Q5 flavors (ubergarm / aessedai / unsloth) for "speed" on 8x RTX 3090

4 bpw suggestions

Testing smol-IQ4_K