Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Julian Kindel's picture
4

Julian Kindel

JulianKindel
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
upvoted a paper 2 months ago
ProAct: Agentic Lookahead in Interactive Environments
upvoted a paper 2 months ago
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
View all activity

Organizations

FAU-LM's profile picture

upvoted a paper 23 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220
upvoted 2 papers 2 months ago

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published Feb 2 • 36
upvoted an article 3 months ago
view article
Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

Jan 20
•
12
updated a model 7 months ago

JulianKindel/grpo_outputs

Updated Sep 20, 2025
published 2 models 7 months ago

JulianKindel/grpo_outputs

Updated Sep 20, 2025

JulianKindel/Qwen2.5-VL-3B-Instruct-Thinking

Updated Sep 16, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs