Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper 5 days ago

Trust Region On-Policy Distillation

liked a model 20 days ago

zai-org/GLM-4.7-Flash

upvoted a paper 21 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

View all activity

Organizations

upvoted a paper 5 days ago

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 9 days ago • 42

upvoted a paper 21 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 22 days ago • 30

upvoted a paper 29 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published May 7 • 37

upvoted 3 papers about 2 months ago

upvoted 3 papers 2 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 57

upvoted 6 papers 3 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 187

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 154

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published Mar 10 • 76

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 54

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

upvoted 5 papers 4 months ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 29

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Paper • 2602.03442 • Published Feb 3 • 21

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 228

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity