Aamer Mihaysi

O96a

58 3 2

https://www.mehaisi.com/

AI & ML interests

Ethical AI, NLP & Cognitive architectures

Recent Activity

commentedon a paper about 11 hours ago

Multi-Agent LLMs Fail to Explore Each Other

commentedon a paper 1 day ago

Weak-to-Strong Generalization via Direct On-Policy Distillation

commentedon a paper 2 days ago

KronQ: LLM Quantization via Kronecker-Factored Hessian

View all activity

Organizations

commented a paper about 11 hours ago

Multi-Agent LLMs Fail to Explore Each Other

Paper • 2607.11250 • Published 3 days ago • 8 •

commented a paper 1 day ago

Weak-to-Strong Generalization via Direct On-Policy Distillation

Paper • 2607.05394 • Published 8 days ago • 112 •

commented a paper 2 days ago

KronQ: LLM Quantization via Kronecker-Factored Hessian

Paper • 2607.07964 • Published 8 days ago • 23 •

commented a paper 3 days ago

Remember When It Matters: Proactive Memory Agent for Long-Horizon Agents

Paper • 2607.08716 • Published 7 days ago • 10 •

commented a paper 4 days ago

Why Can't I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition

Paper • 2601.16211 • Published 14 days ago • 50 •

commented a paper 5 days ago

A Quantized Native Runtime for On-Device Semantic Audio Generation

Paper • 2607.08526 • Published 7 days ago • 3 •

commented a paper 6 days ago

Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training

Paper • 2607.01232 • Published 14 days ago • 6 •

commented a paper 7 days ago

CanvasAgent: Enabling Complex Image Creation and Editing via Visual Tool Orchestration

Paper • 2607.05465 • Published 10 days ago • 11 •

commented a paper 8 days ago

KVpop -- Key-Value Cache Compression with Predictive Online Pruning

Paper • 2607.05061 • Published 10 days ago • 21 •

commented a paper 9 days ago

VLA-Corrector: Lightweight Detect-and-Correct Inference for Adaptive Action Horizon

Paper • 2607.01804 • Published 14 days ago • 30 •

commented a paper 11 days ago

AutoMem: Automated Learning of Memory as a Cognitive Skill

Paper • 2607.01224 • Published 15 days ago • 19 •

commented a paper 12 days ago

AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents

Paper • 2607.02255 • Published 14 days ago • 62 •

commented a paper 13 days ago

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

Paper • 2607.00466 • Published 15 days ago • 31 •

commented a paper 14 days ago

Managing Procedural Memory in LLM Agents: Control, Adaptation, and Evaluation

Paper • 2606.23127 • Published 24 days ago • 24 •

commented a paper 15 days ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper • 2606.28733 • Published 19 days ago • 148 •

commented a paper 16 days ago

Thinking While Speaking: Inference-Time Knowledge Transfer for Responsive and Intelligent Conversational Voice Agents

Paper • 2511.07397 • Published 15 days ago • 12 •

commented a paper 17 days ago

When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models

Paper • 2606.27288 • Published 21 days ago • 4 •

commented a paper 18 days ago

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

Paper • 2606.26027 • Published 22 days ago • 18 •

commented a paper 19 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 22 days ago • 51 •

commented a paper 20 days ago

Constraint Tax in Open-Weight LLMs: An Empirical Study of Tool Calling Suppression Under Structured Output Constraints

Paper • 2606.25605 • Published 22 days ago • 3 •

Aamer Mihaysi

AI & ML interests

Recent Activity

Organizations

O96a's activity