-
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 80 -
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper • 2502.06329 • Published • 133 -
Competitive Programming with Large Reasoning Models
Paper • 2502.06807 • Published • 69
Julian Wergieluk
jwergieluk
·
AI & ML interests
machine learning, mathematics, optimization
Recent Activity
liked a model about 1 month ago
google/gemma-4-31B-it liked a model 4 months ago
zai-org/GLM-4.7-Flash liked a model 4 months ago
nvidia/Nemotron-Orchestrator-8B