Vasilis Valatsos
|
8d3f4506ba
|
Updated architecture and default hyperparams
|
2024-01-21 21:10:27 +01:00 |
|
Vasilis Valatsos
|
4394cc7452
|
separated reward function
|
2023-12-22 12:32:35 +02:00 |
|
Vasilis Valatsos
|
6d169e5b3c
|
Found huge bug, updated some basic stuff
|
2023-12-14 18:28:45 +01:00 |
|
Vasilis Valatsos
|
c278170847
|
Massive improvement
|
2023-12-10 20:15:40 +01:00 |
|
Vasilis Valatsos
|
948ae9af4f
|
Once again player with rewards, and added clipping to the params
|
2023-12-08 22:08:25 +01:00 |
|
Vasilis Valatsos
|
84000dd28b
|
Major update, made single main file for both multi and single agent, added argsparse, polished everything
|
2023-11-29 11:53:30 +01:00 |
|
Vasilis Valatsos
|
8809c1b06c
|
Fixed errors for MARL
|
2023-11-23 16:37:02 +01:00 |
|
Vasilis Valatsos
|
1a6ed25673
|
Update rewared structure (fixed major bugs)
|
2023-11-23 12:44:23 +01:00 |
|
Vasilis Valatsos
|
115b2e4151
|
Hopefully implemented PPO
|
2023-11-17 03:19:03 +01:00 |
|
Vasilis Valatsos
|
b4a6e99fce
|
Implemented (badly) agent
|
2023-11-14 22:44:43 +01:00 |
|