โ๏ธ PPO Agent on SnowballTarget
This repository contains a trained Proximal Policy Optimization (PPO) agent that plays the SnowballTarget environment using the Unity ML-Agents Library.
๐ Model Card
Model Name: ppo-SnowballTarget
Environment: SnowballTarget (Unity ML-Agents)
Algorithm: PPO (Proximal Policy Optimization)
Performance Metric:
- Achieves stable performance in target-hitting tasks
- Demonstrates convergence to an effective policy
๐ Usage (with ML-Agents)
Documentation: ML-Agents Toolkit Docs
mlagents-learn <your_configuration_file_path.yaml> --run-id=<run_id> --resume
# Example: loading the trained PPO model
# (requires Unity ML-Agents setup)
model_id = "KraTUZen/ppo-SnowballTarget"
# Select your .nn or .onnx file from the repo
๐ง Notes
- The agent is trained using PPO, a robust on-policy algorithm widely used in Unity ML-Agents.
- The environment involves throwing snowballs at targets, requiring precision and timing.
- The trained model is stored as
.nnor.onnxfiles for direct Unity integration.
๐ Repository Structure
SnowballTarget.nn/SnowballTarget.onnxโ Trained PPO policyREADME.mdโ Documentation and usage guide
โ Results
- The agent learns to consistently hit targets with snowballs.
- Demonstrates stable training and effective policy convergence using PPO.
๐ Environment Overview
- Observation Space: Continuous (agent position, target position, environment state)
- Action Space: Continuous (throwing angle, force)
- Objective: Maximize hits on targets with snowballs
- Reward: Positive reward for successful hits, penalties for misses
๐ Learning Highlights
- Algorithm: PPO (Proximal Policy Optimization)
- Update Rule: Clipped surrogate objective to ensure stable updates
- Strengths: Robust, stable, widely used in Unity ML-Agents
- Limitations: Requires careful tuning of hyperparameters (clip ratio, learning rate, batch size)
๐ฎ Watch Your Agent Play
You can watch your agent directly in your browser:
- Visit Unity ML-Agents on Hugging Face
- Find your model ID:
KraTUZen/ppo-SnowballTarget - Select your
.nnor.onnxfile - Click Watch the agent play ๐
- Downloads last month
- 41
Evaluation results
- mean_reward on SnowballTargetself-reported3.270
- std_reward on SnowballTargetself-reported1.750