Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Liv d'Aliberti
od2961
Follow
0 followers
·
1 following
https://liv-daliberti.github.io/
liv-daliberti
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
The Illusion of Insight in Reasoning Models
upvoted
a
paper
2 days ago
The Illusion of Insight in Reasoning Models
updated
a dataset
14 days ago
od2961/illusion-of-reasoning-main-traces
View all activity
Organizations
od2961
's models
44
Sort: Recently updated
od2961/Qwen2.5-1.5B-Open-R1-GRPO-math-2k
2B
•
Updated
Dec 13, 2025
•
55
od2961/Qwen2.5-7B-Open-R1-MaxEnt-GRPO-math-2k
333k
•
Updated
Dec 9, 2025
od2961/Qwen2.5-7B-Open-R1-GRPO-math-2k
333k
•
Updated
Dec 7, 2025
od2961/Qwen2.5-1.5B-Open-R1-MaxEnt-GRPO-math-v1
2B
•
Updated
Nov 26, 2025
od2961/Qwen2.5-1.5B-Open-R1-MaxEnt-GRPO-BASELINE-math-v1
2B
•
Updated
Nov 26, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-math-v1
Text Generation
•
2B
•
Updated
Nov 25, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-math-v1-grpoonly
Updated
Nov 25, 2025
od2961/Qwen2.5-1.5B-OpenR1-GRPO-GUN
2B
•
Updated
Oct 31, 2025
od2961/Qwen2.5-1.5B-OpenR1-GRAIL-WAGE
2B
•
Updated
Oct 31, 2025
od2961/Qwen2.5-1.5B-OpenR1-GRAIL-GUN
2B
•
Updated
Oct 31, 2025
od2961/Qwen2.5-1.5B-OpenR1-GRPO
2B
•
Updated
Oct 31, 2025
od2961/Qwen2.5-1.5B-OpenR1-GRAIL
Text Generation
•
2B
•
Updated
Oct 18, 2025
•
138
od2961/Llama-8B-Open-R1-GRPO-math-v2
8B
•
Updated
Oct 6, 2025
•
1
od2961/Llama-8B-Open-R1-GRPO-math-v1
Updated
Oct 3, 2025
od2961/Qwen2.5-7B-Open-R1-GRPO-math-7b
Text Generation
•
8B
•
Updated
Oct 3, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v03
2B
•
Updated
Sep 23, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v04
Updated
Sep 22, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v02
2B
•
Updated
Sep 21, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v01
2B
•
Updated
Sep 20, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-carpark-v1
Updated
Sep 11, 2025
od2961/Qwen2.5-1.5B-OpenR1-no-GRAIL
2B
•
Updated
Aug 28, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-math-v2
2B
•
Updated
Aug 19, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v1
2B
•
Updated
Aug 13, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v11
Updated
Aug 12, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v10
Updated
Aug 9, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v9
Updated
Aug 9, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v7
2B
•
Updated
Aug 8, 2025
•
74
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v8
2B
•
Updated
Aug 8, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v6
2B
•
Updated
Aug 7, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5
2B
•
Updated
Aug 5, 2025
Previous
1
2
Next