🔄 In a Training Loop

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

liked a model 3 days ago

AlexWortega/SIQ-1-35B

liked a model 3 days ago

nvidia/Kimi-K2.6-DFlash

liked a Space 3 days ago

AlexWortega/hermes-agent-zerogpu

View all activity

Organizations

upvoted a paper 3 days ago

Are We Ready For An Agent-Native Memory System?

Paper • 2606.24775 • Published 11 days ago • 123

upvoted an article 5 days ago

Article

Prompt Phrasing Shifts Model Performance More Than Tier — and Which LLMs Will Get Your Jokes

Wayfinder6

•

7 days ago

• 2

upvoted 5 papers 13 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

Paper • 2606.06036 • Published about 1 month ago • 75

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 22 days ago • 93

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Paper • 2606.11926 • Published 24 days ago • 126

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 23 days ago • 142

Looped World Models

Paper • 2606.18208 • Published 18 days ago • 476

upvoted 6 papers 15 days ago

Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

Paper • 2606.12291 • Published 24 days ago • 60

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 18 days ago • 76

Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?

Paper • 2606.08063 • Published 28 days ago • 82

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Paper • 2606.12397 • Published 24 days ago • 89

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 19 days ago • 121

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Paper • 2606.07502 • Published 29 days ago • 99

upvoted an article 22 days ago

Article

Build Small Hackathon With Cohere Models

CohereLabs

•

30 days ago

• 5

upvoted 6 papers 23 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published about 1 month ago • 29

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 26 days ago • 33

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 26 days ago • 54

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 30 days ago • 69

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published about 1 month ago • 66

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published Jun 2 • 67