5 1

Aboneda

Abdelrahma

AbdelrahmanAbounida

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper about 2 months ago

Advancing Open-source World Models

updated a model 7 months ago

Abdelrahma/ppo-LunarLander-v2

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 171

Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 132

updated a model 7 months ago

Abdelrahma/ppo-LunarLander-v2

Reinforcement Learning • Updated Aug 13, 2025 • 2

published a model 7 months ago

Abdelrahma/ppo-LunarLander-v2

Reinforcement Learning • Updated Aug 13, 2025 • 2

upvoted a paper 7 months ago

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 3

updated a model 9 months ago

Abdelrahma/outputs

Updated Jul 7, 2025

published a model 9 months ago

Abdelrahma/outputs

Updated Jul 7, 2025

updated a model 9 months ago

Abdelrahma/Llama-3.2-1B-ladder2

Updated Jul 5, 2025

published 3 models 9 months ago

upvoted an article 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

280

updated a model 11 months ago

Abdelrahma/trainer_output

0.5B • Updated May 11, 2025

published a model 11 months ago

Abdelrahma/trainer_output

0.5B • Updated May 11, 2025

updated 3 models 11 months ago

Abdelrahma/lig_tiny_llama_1b

Updated May 7, 2025

Abdelrahma/lig_qwen_0.5b

Updated May 7, 2025

Abdelrahma/lig_qwen_7b

Updated May 7, 2025

published 3 models 11 months ago

Abdelrahma/lig_tiny_llama_1b

Updated May 7, 2025

Abdelrahma/lig_qwen_0.5b

Updated May 7, 2025

Abdelrahma/lig_qwen_7b

Updated May 7, 2025

Aboneda

AI & ML interests

Recent Activity

Organizations

Abdelrahma's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge