Commit graph

12 commits

Author SHA1 Message Date
Vasilis Valatsos
8d3f4506ba Updated architecture and default hyperparams 2024-01-21 21:10:27 +01:00
Vasilis Valatsos
4394cc7452 separated reward function 2023-12-22 12:32:35 +02:00
Vasilis Valatsos
6d169e5b3c Found huge bug, updated some basic stuff 2023-12-14 18:28:45 +01:00
Vasilis Valatsos
939a90dd0f Updated grad clips 2023-12-10 06:55:07 +01:00
Vasilis Valatsos
948ae9af4f Once again player with rewards, and added clipping to the params 2023-12-08 22:08:25 +01:00
Vasilis Valatsos
3b9b25441e removed redundant if is_dead() check 2023-12-06 14:14:43 +01:00
Vasilis Valatsos
6d316834d3 Updated reward and state features 2023-12-06 13:58:00 +01:00
Vasilis Valatsos
0aea150454 exp per player now resets for each episode 2023-12-06 13:03:59 +01:00
Vasilis Valatsos
ca29a0e6dc Implemented Maxwell distribution for rewards 2023-12-04 05:08:41 +01:00
Vasilis Valatsos
bebc660060 Updated enemy detection 2023-11-30 18:28:17 +01:00
Vasilis Valatsos
c7ddaa630c fixed three bugs 2023-11-29 12:10:04 +01:00
Vasilis Valatsos
84000dd28b Major update, made single main file for both multi and single agent, added argsparse, polished everything 2023-11-29 11:53:30 +01:00