Lutalica
Lutalica
AI & ML interests
Computer vision, Image Processing
Recent Activity
commented on
a paper
19 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use upvoted a paper 19 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use commented on
a paper
5 months ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient