deecf3c0d7452276a87eaeeee533fad8c4ba24b5,ch06/01_dqn_pong.py,,,#,146
Before Change
losses.append(loss_v.data.cpu().numpy())
epsilon *= 0.99
epsilon = max(epsilon, 0.1)
print("Loss %.6f, epsilon=%f" % (np.mean(losses), epsilon))
pass
After Change
losses.append(loss_v.data.cpu().numpy())
epsilon *= 0.99
epsilon = max(epsilon, 0.1)
writer.add_scalar("epsilon", epsilon, iter_idx)
writer.add_scalar("loss", np.mean(losses), iter_idx)
if rewards:
writer.add_scalar("reward", np.mean(rewards))
iter_idx += 1
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 11
Instances
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: deecf3c0d7452276a87eaeeee533fad8c4ba24b5
Time: 2017-10-17
Author: max.lapan@gmail.com
File Name: ch06/01_dqn_pong.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126
Time: 2017-10-22
Author: max.lapan@gmail.com
File Name: ch06/04_dqn_pong_ptan.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: f33446be0e9e8deb631477db30f20ac436491f24
Time: 2018-02-17
Author: max.lapan@gmail.com
File Name: ch16/01_cartpole_es.py
Class Name:
Method Name: