Upload PPO drone policy + logs
Browse files- README.md +46 -0
- best/best_model.zip +3 -0
- eval_logs/evaluations.npz +3 -0
- policy.pth +3 -0
- ppo_policy.zip +3 -0
- safe_policy_meta.json +1 -0
- tb_logs/PPO_1/events.out.tfevents.1756478967.tantrum.1444109.0 +3 -0
- tb_logs/PPO_2/events.out.tfevents.1756479142.tantrum.1445250.0 +3 -0
- tb_logs/PPO_3/events.out.tfevents.1756479433.tantrum.1447230.0 +3 -0
- tb_logs/PPO_4/events.out.tfevents.1756480670.tantrum.1454850.0 +3 -0
README.md
ADDED
|
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- ppo
|
| 4 |
+
- reinforcement-learning
|
| 5 |
+
- swarm
|
| 6 |
+
- drone
|
| 7 |
+
- bittensor
|
| 8 |
+
license: mit
|
| 9 |
+
language:
|
| 10 |
+
- en
|
| 11 |
+
library_name: stable-baselines3
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
# π Swarm PPO Drone
|
| 15 |
+
|
| 16 |
+
This repository contains a **Proximal Policy Optimization (PPO)** model trained for **swarm/drone control**.
|
| 17 |
+
The model was trained using **Gymnasium environments** with Stable-Baselines3 and exported for use in **Bittensor Subnet 124 (Swarm)**.
|
| 18 |
+
|
| 19 |
+
---
|
| 20 |
+
|
| 21 |
+
## π Files
|
| 22 |
+
|
| 23 |
+
- `policy.pth` β Trained PPO policy weights (PyTorch).
|
| 24 |
+
- `ppo_policy.zip` β Stable-Baselines3 PPO saved model (reload with `PPO.load()`).
|
| 25 |
+
- `safe_policy_meta.json` β Metadata for policy compliance.
|
| 26 |
+
- `best/` β Best checkpointed model during training.
|
| 27 |
+
- `eval_logs/` β Evaluation logs.
|
| 28 |
+
- `tb_logs/` β TensorBoard logs.
|
| 29 |
+
|
| 30 |
+
---
|
| 31 |
+
|
| 32 |
+
## π οΈ Usage
|
| 33 |
+
|
| 34 |
+
### Load with Stable-Baselines3
|
| 35 |
+
```python
|
| 36 |
+
from stable_baselines3 import PPO
|
| 37 |
+
import gymnasium as gym
|
| 38 |
+
|
| 39 |
+
# Load model
|
| 40 |
+
model = PPO.load("ppo_policy.zip")
|
| 41 |
+
|
| 42 |
+
# Example run
|
| 43 |
+
env = gym.make("CartPole-v1")
|
| 44 |
+
obs, _ = env.reset()
|
| 45 |
+
action, _ = model.predict(obs)
|
| 46 |
+
print("Predicted action:", action)
|
best/best_model.zip
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bd4d36fbc90382f9c3c7600bacf2d1c59a4bde48eeaa8c02f465a54016a5a402
|
| 3 |
+
size 453110
|
eval_logs/evaluations.npz
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1c543d076208115a61a60d2545fb34d60fa4d7a4b5fb6850fd5fda15f7af0982
|
| 3 |
+
size 1210
|
policy.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:00e25942efd5a414191650af665a6a0abd9d9380a0262c7b679373fd893a2727
|
| 3 |
+
size 41325
|
ppo_policy.zip
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9baf2c2bf777d356aaa2302a47a3354e595e1a831485901d5dc2da6d6f3393bb
|
| 3 |
+
size 36137
|
safe_policy_meta.json
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
{"activation_fn":"relu","net_arch":{"pi":[64,64],"vf":[64,64]},"use_sde":false}
|
tb_logs/PPO_1/events.out.tfevents.1756478967.tantrum.1444109.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5c4dc0238a2156b584741687919dd50d86e3af016b6f2dc85a1e04b893b6fedf
|
| 3 |
+
size 251
|
tb_logs/PPO_2/events.out.tfevents.1756479142.tantrum.1445250.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c4a73832683ced8e630acf21120f55a9438eff4a2cebe754e5bdfb7048f7681e
|
| 3 |
+
size 11620
|
tb_logs/PPO_3/events.out.tfevents.1756479433.tantrum.1447230.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:398c9b0ae6a88ecab8a3b52c9fef8ffaba3ebcd27b0b82e470b6d51f234e3c9e
|
| 3 |
+
size 6682
|
tb_logs/PPO_4/events.out.tfevents.1756480670.tantrum.1454850.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6804c9d13b2a4137ab43c026568f7e991f9b0482e005aa1050a51bbc113f1871
|
| 3 |
+
size 20118
|