deecf3c0d7452276a87eaeeee533fad8c4ba24b5,ch06/01_dqn_pong.py,,,#,146

Before Change


            losses.append(loss_v.data.cpu().numpy())
        epsilon *= 0.99
        epsilon = max(epsilon, 0.1)
        print("Loss %.6f, epsilon=%f" % (np.mean(losses), epsilon))

    pass

After Change


            losses.append(loss_v.data.cpu().numpy())
        epsilon *= 0.99
        epsilon = max(epsilon, 0.1)
        writer.add_scalar("epsilon", epsilon, iter_idx)
        writer.add_scalar("loss", np.mean(losses), iter_idx)
        if rewards:
            writer.add_scalar("reward", np.mean(rewards))
        iter_idx += 1
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 11

Instances


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: deecf3c0d7452276a87eaeeee533fad8c4ba24b5
Time: 2017-10-17
Author: max.lapan@gmail.com
File Name: ch06/01_dqn_pong.py
Class Name:
Method Name:


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126
Time: 2017-10-22
Author: max.lapan@gmail.com
File Name: ch06/04_dqn_pong_ptan.py
Class Name:
Method Name:


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: f33446be0e9e8deb631477db30f20ac436491f24
Time: 2018-02-17
Author: max.lapan@gmail.com
File Name: ch16/01_cartpole_es.py
Class Name:
Method Name: