8acf099847ebf73ad8cdae1341d0f768dbe1c094,ch09/04_pong_pg.py,,,#,28

Before Change



            if train_step_idx % 100 == 0:
                writer.add_scalar("baseline", baseline, step_idx)
                writer.add_scalar("batch_scales", np.mean(batch_scales), step_idx)
                writer.add_scalar("loss_entropy", entropy_loss_v.data.cpu().numpy()[0], step_idx)
                writer.add_scalar("loss_policy", loss_policy_v.data.cpu().numpy()[0], step_idx)
                writer.add_scalar("loss_total", loss_v.data.cpu().numpy()[0], step_idx)

After Change


            entropy_loss_v = ENTROPY_BETA * (prob_v * log_prob_v).sum()
            loss_v = loss_policy_v + entropy_loss_v

            m_baseline.append(baseline)
            m_batch_scales.append(np.mean(batch_scales))
            m_loss_entropy.append(entropy_loss_v.data.cpu().numpy()[0])
            m_loss_policy.append(loss_policy_v.data.cpu().numpy()[0])
            m_loss_total.append(loss_v.data.cpu().numpy()[0])
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 2

Instances


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: 8acf099847ebf73ad8cdae1341d0f768dbe1c094
Time: 2017-12-04
Author: max.lapan@gmail.com
File Name: ch09/04_pong_pg.py
Class Name:
Method Name:


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: e70bdb2d089ae283781c45b8d97963823a984baa
Time: 2017-12-15
Author: max.lapan@gmail.com
File Name: ch10/00_pong_pg.py
Class Name:
Method Name:


Project Name: lufficc/SSD
Commit Name: 94a995defe223eed0898f25d2332ba6178a92abe
Time: 2018-12-19
Author: luffy.lcc@gmail.com
File Name: ssd/engine/trainer.py
Class Name:
Method Name: do_train