Open
Description
Hello,
I tried to reproduce the result (with n_action_repeat 1) on the computer with GTX 1080, however the performance is not as good as shown in the figure. After 2.88 M steps the average reward is 0.0174,
the average ep_reward is 3.1071, and the max ep_reward is 7.
Maybe I did something wrong in the setting or misread some information. Could you give me some suggestions? Thanks a lot!
Chih-Chieh
Metadata
Metadata
Assignees
Labels
No labels