lia21 commited on
Commit
bae773a
Β·
1 Parent(s): 437d478

Upload PPO drone policy + logs

Browse files
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - ppo
4
+ - reinforcement-learning
5
+ - swarm
6
+ - drone
7
+ - bittensor
8
+ license: mit
9
+ language:
10
+ - en
11
+ library_name: stable-baselines3
12
+ ---
13
+
14
+ # πŸš€ Swarm PPO Drone
15
+
16
+ This repository contains a **Proximal Policy Optimization (PPO)** model trained for **swarm/drone control**.
17
+ The model was trained using **Gymnasium environments** with Stable-Baselines3 and exported for use in **Bittensor Subnet 124 (Swarm)**.
18
+
19
+ ---
20
+
21
+ ## πŸ“‚ Files
22
+
23
+ - `policy.pth` – Trained PPO policy weights (PyTorch).
24
+ - `ppo_policy.zip` – Stable-Baselines3 PPO saved model (reload with `PPO.load()`).
25
+ - `safe_policy_meta.json` – Metadata for policy compliance.
26
+ - `best/` – Best checkpointed model during training.
27
+ - `eval_logs/` – Evaluation logs.
28
+ - `tb_logs/` – TensorBoard logs.
29
+
30
+ ---
31
+
32
+ ## πŸ› οΈ Usage
33
+
34
+ ### Load with Stable-Baselines3
35
+ ```python
36
+ from stable_baselines3 import PPO
37
+ import gymnasium as gym
38
+
39
+ # Load model
40
+ model = PPO.load("ppo_policy.zip")
41
+
42
+ # Example run
43
+ env = gym.make("CartPole-v1")
44
+ obs, _ = env.reset()
45
+ action, _ = model.predict(obs)
46
+ print("Predicted action:", action)
best/best_model.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bd4d36fbc90382f9c3c7600bacf2d1c59a4bde48eeaa8c02f465a54016a5a402
3
+ size 453110
eval_logs/evaluations.npz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c543d076208115a61a60d2545fb34d60fa4d7a4b5fb6850fd5fda15f7af0982
3
+ size 1210
policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:00e25942efd5a414191650af665a6a0abd9d9380a0262c7b679373fd893a2727
3
+ size 41325
ppo_policy.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9baf2c2bf777d356aaa2302a47a3354e595e1a831485901d5dc2da6d6f3393bb
3
+ size 36137
safe_policy_meta.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"activation_fn":"relu","net_arch":{"pi":[64,64],"vf":[64,64]},"use_sde":false}
tb_logs/PPO_1/events.out.tfevents.1756478967.tantrum.1444109.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c4dc0238a2156b584741687919dd50d86e3af016b6f2dc85a1e04b893b6fedf
3
+ size 251
tb_logs/PPO_2/events.out.tfevents.1756479142.tantrum.1445250.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c4a73832683ced8e630acf21120f55a9438eff4a2cebe754e5bdfb7048f7681e
3
+ size 11620
tb_logs/PPO_3/events.out.tfevents.1756479433.tantrum.1447230.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:398c9b0ae6a88ecab8a3b52c9fef8ffaba3ebcd27b0b82e470b6d51f234e3c9e
3
+ size 6682
tb_logs/PPO_4/events.out.tfevents.1756480670.tantrum.1454850.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6804c9d13b2a4137ab43c026568f7e991f9b0482e005aa1050a51bbc113f1871
3
+ size 20118