Rajesh Mehta's picture

Rajesh Mehta

since-2010

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

liked a model 3 days ago

MohammadGholizadeh/Phi-3.5-mini-instruct-owl-gen5-s43

liked a model 5 days ago

bhargavi-2005/pcos-model

View all activity

Organizations

None yet

upvoted a paper 1 day ago

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

Paper • 2605.29156 • Published 9 days ago • 13

upvoted a paper 8 days ago

Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild

Paper • 2605.24213 • Published 14 days ago • 12

upvoted a paper 15 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 24 days ago • 195

upvoted 2 papers 25 days ago

IntentGrasp: A Comprehensive Benchmark for Intent Understanding

Paper • 2605.06832 • Published 29 days ago • 8

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 29 days ago • 233

upvoted 2 papers about 2 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

upvoted a paper 2 months ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted 3 papers 3 months ago

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

upvoted 6 papers 4 months ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 245

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 210

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Paper • 2602.09070 • Published Feb 9 • 46

The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Paper • 2602.09877 • Published Feb 10 • 197

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154