Commit graph

16 commits

Author SHA1 Message Date
Vasilis Valatsos
8d3f4506ba Updated architecture and default hyperparams 2024-01-21 21:10:27 +01:00
Vasilis Valatsos
6d169e5b3c Found huge bug, updated some basic stuff 2023-12-14 18:28:45 +01:00
Vasilis Valatsos
3d7f973789 Removed comments 2023-12-10 20:16:00 +01:00
Vasilis Valatsos
c278170847 Massive improvement 2023-12-10 20:15:40 +01:00
Vasilis Valatsos
8e2664533f Removed grad clip 2023-12-10 11:59:27 +01:00
Vasilis Valatsos
939a90dd0f Updated grad clips 2023-12-10 06:55:07 +01:00
Vasilis Valatsos
85c1532920 Updated critic to use leaky ReLu 2023-12-09 13:48:16 +01:00
Vasilis Valatsos
948ae9af4f Once again player with rewards, and added clipping to the params 2023-12-08 22:08:25 +01:00
Vasilis Valatsos
84000dd28b Major update, made single main file for both multi and single agent, added argsparse, polished everything 2023-11-29 11:53:30 +01:00
Vasilis Valatsos
0bdcc8ca6f Found bug again, fixed bug again 2023-11-25 00:26:47 +01:00
Vasilis Valatsos
a9868e6c1a Weird bug showed, added diagnostics 2023-11-24 21:15:50 +01:00
Vasilis Valatsos
1f91ec9d5d Training done 2023-11-24 14:23:12 +01:00
Vasilis Valatsos
8809c1b06c Fixed errors for MARL 2023-11-23 16:37:02 +01:00
Vasilis Valatsos
3fb147afff Updated some stuff 2023-11-20 01:51:54 +01:00
Vasilis Valatsos
da649ccca8 Added more rewards 2023-11-19 04:27:47 +01:00
Vasilis Valatsos
115b2e4151 Hopefully implemented PPO 2023-11-17 03:19:03 +01:00