Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
20
AI & ML interests
None defined yet.
Recent Activity
krishnateja95
updated
a model
about 5 hours ago
inference-optimization/Qwen3-30B-A3B_6.0_bits_mode_heuristic
krishnateja95
published
a model
about 5 hours ago
inference-optimization/Qwen3-30B-A3B_6.0_bits_mode_heuristic
krishnateja95
updated
a model
about 6 hours ago
inference-optimization/Qwen3-30B-A3B_6.0_bits_mode_noise
View all activity
Team members
15
inference-optimization
's models
226
Sort: Recently updated
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B
•
Updated
27 days ago
•
13
inference-optimization/test_qwen3_next_mtp
Updated
28 days ago
•
41
inference-optimization/test_tencentbac_fastmtp
Updated
28 days ago
•
38
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
29 days ago
•
55
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic
31B
•
Updated
29 days ago
•
55
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Block
31B
•
Updated
29 days ago
•
48
inference-optimization/Qwen3-Coder-Next.w4a16-old
Text Generation
•
12B
•
Updated
Feb 26
•
18
inference-optimization/Kimi-K2-Instruct-0905-BF16-NVFP4
Updated
Feb 24
•
1
inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4
Updated
Feb 4
•
5
inference-optimization/Ministral-3-14B-Instruct-2512.w8a8
Updated
Feb 4
inference-optimization/Ministral-3-14B-Instruct-2512.w4a16
Updated
Feb 3
inference-optimization/Meta-Llama-3-8B-Instruct-NVFP4-GPTQ-Quant
5B
•
Updated
Jan 29
•
1
inference-optimization/Meta-Llama-3-8B-Instruct-NVFP4-GPTQ-MSE
5B
•
Updated
Jan 29
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_6.5-bits
7B
•
Updated
Jan 26
•
3
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_6.25-bits
6B
•
Updated
Jan 26
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_6.0-bits
6B
•
Updated
Jan 26
•
1
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_5.75-bits
6B
•
Updated
Jan 26
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_5.5-bits
6B
•
Updated
Jan 26
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_5.25-bits
6B
•
Updated
Jan 26
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_5.0-bits
5B
•
Updated
Jan 26
•
2
inference-optimization/DeepSeek-V3-debug-multiply-FP8_DYNAMIC
1B
•
Updated
Jan 24
•
1
inference-optimization/DeepSeek-V3-debug-add-FP8_DYNAMIC
1B
•
Updated
Jan 24
•
1
inference-optimization/DeepSeek-V3-debug-empty-FP8_DYNAMIC
1B
•
Updated
Jan 23
•
352
inference-optimization/DeepSeek-V3-debug-multiply-NVFP4A16
0.9B
•
Updated
Jan 23
inference-optimization/DeepSeek-V3-debug-add-NVFP4A16
0.9B
•
Updated
Jan 23
•
5
inference-optimization/DeepSeek-V3-debug-empty-NVFP4A16
0.9B
•
Updated
Jan 23
•
102
inference-optimization/DeepSeek-V3-debug-add
1B
•
Updated
Jan 23
•
5
inference-optimization/DeepSeek-V3-debug-multiply
1B
•
Updated
Jan 23
•
3
inference-optimization/Qwen3-0.6B-debug-add-FP8_BLOCK
0.6B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-debug-multiply-FP8_BLOCK
0.6B
•
Updated
Jan 23
Previous
1
...
4
5
6
7
8
Next