c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126,ch06/04_dqn_pong_ptan.py,,,#,64

Before Change


        epsilon_greedy_selector.epsilon = max(EPSILON_FINAL, EPSILON_START - frame_idx / EPSILON_DECAY_LAST_FRAME)

        new_rewards = exp_source.pop_total_rewards()
        if new_rewards:
            total_rewards.extend(new_rewards)
            print("%d: %s" % (frame_idx, new_rewards))

        if len(buffer) < REPLAY_START_SIZE:
            continue

        optimizer.zero_grad()

After Change


    env = ptan.common.wrappers.wrap_dqn(env)
    env = ptan.common.wrappers.ScaledFloatFrame(env)

    writer = SummaryWriter(comment="-pong-ptan")
    net = dqn_model.DQN(env.observation_space.shape, env.action_space.n)
    if args.cuda:
        net.cuda()

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 6

Instances

Link

Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On

Commit Name: c4bfe2ab1d355f8ed0c9881fd4f9d114cc627126

Time: 2017-10-22

Author: max.lapan@gmail.com

File Name: ch06/04_dqn_pong_ptan.py

Class Name:

Method Name:

Link

Project Name: dmlc/dgl

Commit Name: ff345c2e2210061141653a19ef1431c6866cbf68

Time: 2021-02-03

Author: wcy_james@outlook.com

File Name: examples/pytorch/pointcloud/pointnet/train_partseg.py

Class Name:

Method Name:

Link

Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On

Commit Name: c829d1a0bc7ded09ed0057ad4bc8493b3ab12fa0

Time: 2018-02-20

Author: max.lapan@gmail.com

File Name: ch16/02_breakout_es.py

Class Name:

Method Name: