Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
427
20
396
John Leimgruber III
PRO
ubergarm
Follow
performanceoptician's profile picture
NikolayKozloff's profile picture
fdaler's profile picture
400 followers
·
62 following
https://blog.aifoundry.org/p/adventures-in-model-quantization
ubergarm
john-leimgruber
AI & ML interests
Open LLMs and Astrophotography image processing.
Recent Activity
new
activity
about 7 hours ago
ubergarm/Kimi-K2.6-GGUF:
Q4_X on Blackwell + EPYC + DDR 5
new
activity
about 7 hours ago
ubergarm/Kimi-K2.6-GGUF:
No think tags.
new
activity
about 7 hours ago
ubergarm/Kimi-K2.6-GGUF:
The Best Kimi Quant!
View all activity
Organizations
ubergarm
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
ubergarm/Kimi-K2.6-GGUF
about 7 hours ago
Q4_X on Blackwell + EPYC + DDR 5
👍
🔥
4
5
#3 opened about 14 hours ago by
sousekd
No think tags.
5
#4 opened about 8 hours ago by
DrRos
The Best Kimi Quant!
🔥
4
9
#1 opened 1 day ago by
anikifoss
New activity in
ubergarm/Qwen3.5-122B-A10B-GGUF
about 8 hours ago
How to split this model between 2 (3) GPUs and CPU/RAM ?
25
#12 opened about 1 month ago by
mancub
liked
a model
about 8 hours ago
Qwen/Qwen3.6-35B-A3B
Image-Text-to-Text
•
36B
•
Updated
7 days ago
•
458k
•
1.15k
New activity in
unsloth/Kimi-K2.6-GGUF
about 9 hours ago
Q4_0 vs native INT4 QAT fidelity
👍
1
2
#4 opened about 11 hours ago by
SpacetimeAI
New activity in
unsloth/Kimi-K2.6-GGUF
about 14 hours ago
What kind of Q4_0 are you using for ffn_(gate|up|down)_exps?
🔥
👍
2
#2 opened about 14 hours ago by
ubergarm
New activity in
ubergarm/Kimi-K2.6-GGUF
about 16 hours ago
Update README.md
1
#2 opened about 21 hours ago by
jpsequeira
liked
a model
1 day ago
AesSedai/Kimi-K2.6-GGUF
1T
•
Updated
40 minutes ago
•
14
•
9
updated
a model
1 day ago
ubergarm/Kimi-K2.6-GGUF
Text Generation
•
1T
•
Updated
about 15 hours ago
•
492
•
26
published
a model
1 day ago
ubergarm/Kimi-K2.6-GGUF
Text Generation
•
1T
•
Updated
about 15 hours ago
•
492
•
26
New activity in
AesSedai/Qwen3.6-35B-A3B-GGUF
1 day ago
Imatrix and Mmprojs degrade quality
8
#2 opened 5 days ago by
Trilogix1
New activity in
ubergarm/MiniMax-M2.5-GGUF
1 day ago
ik_llama.cpp version
17
#11 opened 2 months ago by
geveent
New activity in
ubergarm/Step-3.5-Flash-GGUF
1 day ago
Comparison with APEX quants
1
#14 opened 2 days ago by
FBykov
New activity in
ubergarm/MiniMax-M2.7-GGUF
1 day ago
Testing IQ5_K
❤️
1
10
#8 opened 8 days ago by
shewin
liked
a model
1 day ago
moonshotai/Kimi-K2.6
Image-Text-to-Text
•
1.1T
•
Updated
1 day ago
•
8.24k
•
•
731
New activity in
ubergarm/GLM-5.1-GGUF
1 day ago
render_message_to_json: Neither string content nor typed content is supported by the template. This is unexpected and may lead to issues.
4
#7 opened 6 days ago by
whoisjeremylam
New activity in
ubergarm/MiniMax-M2.7-GGUF
5 days ago
Testing Q5 flavors (ubergarm / aessedai / unsloth) for "speed" on 8x RTX 3090
🔥
1
1
#10 opened 6 days ago by
dehnhaide
4 bpw suggestions
👀
1
18
#1 opened 10 days ago by
ndroidph
New activity in
ubergarm/GLM-5.1-GGUF
5 days ago
Testing smol-IQ4_K
8
#4 opened 12 days ago by
shewin
Load more