c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126,ch06/04_dqn_pong_ptan.py,,,#,64
Before Change
epsilon_greedy_selector.epsilon = max(EPSILON_FINAL, EPSILON_START - frame_idx / EPSILON_DECAY_LAST_FRAME)
new_rewards = exp_source.pop_total_rewards()
if new_rewards:
total_rewards.extend(new_rewards)
print("%d: %s" % (frame_idx, new_rewards))
if len(buffer) < REPLAY_START_SIZE:
continue
optimizer.zero_grad()
After Change
env = ptan.common.wrappers.wrap_dqn(env)
env = ptan.common.wrappers.ScaledFloatFrame(env)
writer = SummaryWriter(comment="-pong-ptan")
net = dqn_model.DQN(env.observation_space.shape, env.action_space.n)
if args.cuda:
net.cuda()
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 6
Instances
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126
Time: 2017-10-22
Author: max.lapan@gmail.com
File Name: ch06/04_dqn_pong_ptan.py
Class Name:
Method Name:
Project Name: dmlc/dgl
Commit Name: ff345c2e2210061141653a19ef1431c6866cbf68
Time: 2021-02-03
Author: wcy_james@outlook.com
File Name: examples/pytorch/pointcloud/pointnet/train_partseg.py
Class Name:
Method Name:
Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c829d1a0bc7ded09ed0057ad4bc8493b3ab12fa0
Time: 2018-02-20
Author: max.lapan@gmail.com
File Name: ch16/02_breakout_es.py
Class Name:
Method Name: