Julian Kindel's picture

4

Julian Kindel

JulianKindel

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

upvoted a paper 2 months ago

ProAct: Agentic Lookahead in Interactive Environments

upvoted a paper 2 months ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

View all activity

Organizations

upvoted a paper 23 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

upvoted 2 papers 2 months ago

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published Feb 2 • 36

upvoted an article 3 months ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

Jan 20

•

12

updated a model 7 months ago

JulianKindel/grpo_outputs

Updated Sep 20, 2025

published 2 models 7 months ago

JulianKindel/grpo_outputs

Updated Sep 20, 2025

JulianKindel/Qwen2.5-VL-3B-Instruct-Thinking

Updated Sep 16, 2025