arxiv:2510.01582
๐๏ธ Building on HF
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model about 23 hours ago
inference-optimization/Qwen3-8B-FP8-Dynamic published a model about 23 hours ago
inference-optimization/Qwen3-8B-FP8-Dynamic updated a model 4 days ago
RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8