1646ac6e24a0c760f7e2d232169dbf42dc4acfb8,ch13/wob_click_train.py,,,#,20

Before Change


                // calculate policy gradients only
                loss_policy_v.backward(retain_graph=True)
                grads = np.concatenate([p.grad.data.cpu().numpy().flatten()
                                        for p in net.parameters()
                                        if p.grad is not None])

                // apply entropy and value gradients
                loss_v = entropy_loss_v + loss_value_v

After Change


    parser.add_argument("--port-ofs", type=int, default=0, help="Offset for container"s ports, default=0")
    parser.add_argument("--env", default=ENV_NAME, help="Environment name to solve, default=" + ENV_NAME)
    parser.add_argument("--demo", help="Demo dir to load. Default=No demo")
    parser.add_argument("--host", default="localhost", help="Host with docker containers")
    args = parser.parse_args()

    env_name = args.env
    if not env_name.startswith("wob.mini."):
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 4

Non-data size: 2

Instances


Project Name: PacktPublishing/Deep-Reinforcement-Learning-Hands-On
Commit Name: 1646ac6e24a0c760f7e2d232169dbf42dc4acfb8
Time: 2018-01-29
Author: max.lapan@gmail.com
File Name: ch13/wob_click_train.py
Class Name:
Method Name:


Project Name: pyprob/pyprob
Commit Name: 5ac32fa1aa7633276f3e16a5e81c204661a92567
Time: 2017-05-16
Author: atilimgunes.baydin@gmail.com
File Name: infcomp/compile.py
Class Name:
Method Name: main


Project Name: lcswillems/torch-rl
Commit Name: d664fab69fcdb71d2e919112a45169fa937ef7ea
Time: 2017-10-24
Author: khrylx@gmail.com
File Name: examples/trpo_gym.py
Class Name:
Method Name:


Project Name: pyprob/pyprob
Commit Name: 1b9d55d574175553bf9a6959f6c8f8222c24fd32
Time: 2017-05-16
Author: atilimgunes.baydin@gmail.com
File Name: infcomp/compile.py
Class Name:
Method Name: main