73b81f0b1909341c1f5633242c101498b12e84c9,chapter06/maximization_bias.py,,figure_6_7,#,103

Before Change


        for ep in range(0, episodes):
            left_counts_q[run, ep] = q_learning(q)
            left_counts_double_q[run, ep] = q_learning(q1, q2)
    left_counts_q = np.add.accumulate(left_counts_q, axis=1).mean(axis=0) / np.arange(1, episodes + 1)
    left_counts_double_q = np.add.accumulate(left_counts_double_q, axis=1).mean(axis=0) / np.arange(1, episodes + 1)

    plt.plot(left_counts_q, label="Q-Learning")
    plt.plot(left_counts_double_q, label="Double Q-Learning")

After Change


            left_counts_q[run, ep] = q_learning(q)
            left_counts_double_q[run, ep] = q_learning(q1, q2)
    left_counts_q = left_counts_q.mean(axis=0)
    left_counts_double_q = left_counts_double_q.mean(axis=0)

    plt.plot(left_counts_q, label="Q-Learning")
    plt.plot(left_counts_double_q, label="Double Q-Learning")
    plt.plot(np.ones(episodes) * 0.05, label="Optimal")
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: ShangtongZhang/reinforcement-learning-an-introduction
Commit Name: 73b81f0b1909341c1f5633242c101498b12e84c9
Time: 2018-09-13
Author: lbin_sjtu@sjtu.edu.cn
File Name: chapter06/maximization_bias.py
Class Name:
Method Name: figure_6_7


Project Name: dmlc/dgl
Commit Name: 927d2b319bc7a6c31c63ad64e6124477c2a5b583
Time: 2020-12-28
Author: mufeili1996@gmail.com
File Name: examples/pytorch/ogb/ogbn-products/graphsage/main.py
Class Name: SAGE
Method Name: inference


Project Name: brainiak/brainiak
Commit Name: 74dae883a6f0bb31f55b8c1bd426a832b429ff4a
Time: 2017-11-09
Author: cellis@princeton.edu
File Name: brainiak/utils/fmrisim.py
Class Name:
Method Name: calc_noise