AI & ML interests
None yet
Organizations
None yet
seangogo/summary_from_human_feedback_grpo_100
Feature Extraction
• 0.5B • Updated seangogo/dpo_summary_from_human_feedback
Feature Extraction
• 0.5B • Updated • 1
seangogo/Qwen2.5-1.5B_reward_model_v2_normalized
Feature Extraction
• 2B • Updated seangogo/Qwen2.5-1.5B_reward_model_v2
Feature Extraction
• 2B • Updated seangogo/Qwen2.5-1.5B_reward_model
seangogo/poca-SoccerTwos-v2
Reinforcement Learning
• Updated • 1
seangogo/ppo-SnowballTarget-real
Reinforcement Learning
• Updated • 1
seangogo/ppo-SnowballTarget
Reinforcement Learning
• Updated • 6
seangogo/a2c-PandaReachDense-v3
Reinforcement Learning
• Updated • 4
seangogo/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated seangogo/ppo-CartPole-v1-ppo-from-scratch
Reinforcement Learning
• Updated seangogo/Reinforce-PixelCopter-v2
Reinforcement Learning
• Updated seangogo/Reinforce-PixelCopter
Reinforcement Learning
• Updated seangogo/Reinforce-CartPole-v1
Reinforcement Learning
• Updated seangogo/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 10
Reinforcement Learning
• Updated seangogo/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated seangogo/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 5