Udbhav Bamba's picture

4 1

Udbhav Bamba

udbhavbamba

·

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

upvoted an article 22 days ago

Your MoE Model Does Not Have to Select Fixed Number of Experts

authored a paper about 1 month ago

S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations

View all activity

Organizations

authored a paper 5 days ago

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

Paper • 2603.06621 • Published Feb 20

authored 3 papers about 1 month ago

S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations

Paper • 2602.14432 • Published Feb 16

XRPO: Pushing the limits of GRPO with Targeted Exploration and Exploitation

Paper • 2510.06672 • Published Oct 8, 2025

CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models

Paper • 2601.00659 • Published Jan 2 • 1