Fixed bug in from_checkpoint.py for recurrent PG/PPO models (!4) · Merge requests · CHIP-GT / AntiPoachingGame

Merged Maddila Siva Sri Prasanna requested to merge fix-recurrent-replay into dev 10 months ago

This bug was linked to the use of compute_single_action, where the seq_lens and the state parameters were empty. This bugged out the script, preventing us from simulating learned policies using from_checkpoint.py. This has since been fixed. The QMIX LSTM model does not apparently suffer from this bug, therefore it is untouched for now.

Activity

Please register or sign in to reply

Fixed bug in from_checkpoint.py for recurrent PG/PPO models

Merge request reports

Activity