c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126,ch06/04_dqn_pong_ptan.py,,,#,64

Before Change


        epsilon_greedy_selector.epsilon = max(EPSILON_FINAL, EPSILON_START - frame_idx / EPSILON_DECAY_LAST_FRAME)

        new_rewards = exp_source.pop_total_rewards()
        if new_rewards:
            total_rewards.extend(new_rewards)
            print("%d: %s" % (frame_idx, new_rewards))

        if len(buffer) < REPLAY_START_SIZE:
            continue

        optimizer.zero_grad()

After Change


    env = ptan.common.wrappers.wrap_dqn(env)
    env = ptan.common.wrappers.ScaledFloatFrame(env)

    writer = SummaryWriter(comment="-pong-ptan")
    net = dqn_model.DQN(env.observation_space.shape, env.action_space.n)
    if args.cuda:
        net.cuda()
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 6

Instances


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126
Time: 2017-10-22
Author: max.lapan@gmail.com
File Name: ch06/04_dqn_pong_ptan.py
Class Name:
Method Name:


Project Name: dmlc/dgl
Commit Name: ff345c2e2210061141653a19ef1431c6866cbf68
Time: 2021-02-03
Author: wcy_james@outlook.com
File Name: examples/pytorch/pointcloud/pointnet/train_partseg.py
Class Name:
Method Name:


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: c829d1a0bc7ded09ed0057ad4bc8493b3ab12fa0
Time: 2018-02-20
Author: max.lapan@gmail.com
File Name: ch16/02_breakout_es.py
Class Name:
Method Name: