998c532248615e7ee5fcc974e0b77036764b6a3d,deepchem/rl/tests/test_ppo.py,TestPPO,test_roulette,#TestPPO#,14
Before Change
env,
TestPolicy(),
max_rollout_length=20,
optimizer=dc.models.tensorgraph.TFWrapper(
tf.train.AdamOptimizer, learning_rate=0.001) )
ppo.fit(30000)
// It should have learned that the expected value is very close to zero, and that the best
After Change
env,
TestPolicy(),
max_rollout_length=20,
optimizer=Adam(learning_rate=0.001) )
ppo.fit(30000)
// It should have learned that the expected value is very close to zero, and that the best
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 6
Instances Project Name: deepchem/deepchem
Commit Name: 998c532248615e7ee5fcc974e0b77036764b6a3d
Time: 2017-07-28
Author: peastman@stanford.edu
File Name: deepchem/rl/tests/test_ppo.py
Class Name: TestPPO
Method Name: test_roulette
Project Name: deepchem/deepchem
Commit Name: 998c532248615e7ee5fcc974e0b77036764b6a3d
Time: 2017-07-28
Author: peastman@stanford.edu
File Name: deepchem/rl/tests/test_a3c.py
Class Name: TestA3C
Method Name: test_roulette
Project Name: deepchem/deepchem
Commit Name: 998c532248615e7ee5fcc974e0b77036764b6a3d
Time: 2017-07-28
Author: peastman@stanford.edu
File Name: contrib/rl/tictactoe.py
Class Name:
Method Name: eval_tic_tac_toe