8acf099847ebf73ad8cdae1341d0f768dbe1c094,ch09/04_pong_pg.py,,,#,28
Before Change
if train_step_idx % 100 == 0:
writer.add_scalar("baseline", baseline, step_idx)
writer.add_scalar("batch_scales", np.mean(batch_scales), step_idx)
writer.add_scalar("loss_entropy", entropy_loss_v.data.cpu().numpy()[0], step_idx)
writer.add_scalar("loss_policy", loss_policy_v.data.cpu().numpy()[0], step_idx)
writer.add_scalar("loss_total", loss_v.data.cpu().numpy()[0], step_idx)
After Change
entropy_loss_v = ENTROPY_BETA * (prob_v * log_prob_v).sum()
loss_v = loss_policy_v + entropy_loss_v
m_baseline.append(baseline)
m_batch_scales.append(np.mean(batch_scales))
m_loss_entropy.append(entropy_loss_v.data.cpu().numpy()[0])
m_loss_policy.append(loss_policy_v.data.cpu().numpy()[0])
m_loss_total.append(loss_v.data.cpu().numpy()[0])
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 2
Instances
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: 8acf099847ebf73ad8cdae1341d0f768dbe1c094
Time: 2017-12-04
Author: max.lapan@gmail.com
File Name: ch09/04_pong_pg.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: e70bdb2d089ae283781c45b8d97963823a984baa
Time: 2017-12-15
Author: max.lapan@gmail.com
File Name: ch10/00_pong_pg.py
Class Name:
Method Name:
Project Name: lufficc/SSD
Commit Name: 94a995defe223eed0898f25d2332ba6178a92abe
Time: 2018-12-19
Author: luffy.lcc@gmail.com
File Name: ssd/engine/trainer.py
Class Name:
Method Name: do_train