AI & ML interests
None yet
Organizations
None yet
models 10
dominicpeel/qwen2-7b-instruct-trl-sft-ChartQA
Updated
dominicpeel/a2c-PandaReachDense-v3
Reinforcement Learning
• Updated • 2
dominicpeel/ppo-PyramidsRND
Reinforcement Learning
• Updated dominicpeel/ppo-SnowballTarget
Reinforcement Learning
• Updated • 1
dominicpeel/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
• Updated dominicpeel/Reinforce-CartPole-v1
Reinforcement Learning
• Updated dominicpeel/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 2
Reinforcement Learning
• Updated dominicpeel/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated dominicpeel/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 2