Spaces:

saketh1201
/

inventory_env

Sleeping

saketh1201 commited on Apr 5

Commit

8e28c8f

verified ·

1 Parent(s): ff7be06

Upload folder using huggingface_hub

Files changed (2) hide show

README.md CHANGED Viewed

@@ -183,6 +183,34 @@ docker build -t inventory-env .
 docker run -p 8000:8000 inventory-env
 ```
 ## Step Execution Order
 Each `step()` call processes in this order:

 docker run -p 8000:8000 inventory-env
 ```
+## API Endpoints
+| Endpoint | Method | Description |
+|----------|--------|-------------|
+| `/health` | GET | Health check — returns 200 if server is running |
+| `/reset` | POST | Reset environment, returns initial observation |
+| `/step` | POST | Submit an action (JSON body), returns next observation with reward |
+| `/state` | GET | Get current episode state (day, cash, inventory) |
+| `/tasks` | GET | List all 3 tasks with full config (stock, capacity, demand ranges, events) |
+| `/grader` | POST | Score an episode given task name and agent profit |
+| `/baseline` | GET | Run LLM inference on a task and return the score |
+### Example Queries
+```bash
+# List all tasks with full schemas
+curl http://localhost:8000/tasks
+# Grade a specific profit
+curl -X POST "http://localhost:8000/grader?task_name=easy&agent_profit=5000"
+# → {"task_name":"easy","agent_profit":5000.0,"floor":2200.0,"ceiling":10011.0,"score":0.358}
+# Run baseline inference (requires API keys in container env)
+curl "http://localhost:8000/baseline"
+curl "http://localhost:8000/baseline?task_name=hard"
+# → {"task_name":"easy","score":0.822}
+```
 ## Step Execution Order
 Each `step()` call processes in this order:

server/app.py CHANGED Viewed

@@ -64,27 +64,18 @@ def baseline_endpoint(task_name: str = "easy"):
             env=env,
         )
         output = result.stdout
-        stderr = result.stderr
         # parse score from output
         score = None
-        profit = None
         for line in output.splitlines():
             if task_name + ":" in line and "profit" in line:
                 score_match = re.search(r"(\d+\.\d+)\s*\(profit", line)
-                profit_match = re.search(r"profit:\s*\$([0-9.]+)", line)
                 if score_match:
                     score = float(score_match.group(1))
-                if profit_match:
-                    profit = float(profit_match.group(1))
         return {
             "task_name": task_name,
             "score": score,
-            "profit": profit,
-            "stdout": output[-2000:] if len(output) > 2000 else output,
-            "stderr": stderr[-500:] if stderr else None,
-            "returncode": result.returncode,
         }
     except subprocess.TimeoutExpired:
         return {"error": "Inference timed out (20 min limit)"}

             env=env,
         )
         output = result.stdout
         # parse score from output
         score = None
         for line in output.splitlines():
             if task_name + ":" in line and "profit" in line:
                 score_match = re.search(r"(\d+\.\d+)\s*\(profit", line)
                 if score_match:
                     score = float(score_match.group(1))
         return {
             "task_name": task_name,
             "score": score,
         }
     except subprocess.TimeoutExpired:
         return {"error": "Inference timed out (20 min limit)"}