9abf7a10295c16cecb76acb9472fadbfe4ca7c9a,safe_learning/tests/test_rl.py,TestPolicyIteration,test_integration,#TestPolicyIteration#,30
Before Change
value_iter = rl.value_iteration()
adapt_policy = tf.train.GradientDescentOptimizer(0.01).minimize(
-tf.reduce_sum(rl.future_values(rl.state_space)),
var_list=rl.policy.parameters)
After Change
value_iter = rl.value_iteration()
loss = -tf.reduce_sum(rl.future_values(rl.state_space))
optimizer = tf.train.GradientDescentOptimizer(0.01)
adapt_policy = optimizer.minimize(loss,
var_list=rl.policy.parameters)
sess.run(tf.global_variables_initializer())
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 3
Instances
Project Name: befelix/safe_learning
Commit Name: 9abf7a10295c16cecb76acb9472fadbfe4ca7c9a
Time: 2017-04-26
Author: fberkenkamp@gmail.com
File Name: safe_learning/tests/test_rl.py
Class Name: TestPolicyIteration
Method Name: test_integration
Project Name: HyperGAN/HyperGAN
Commit Name: c8f075268f7e5645a77eef21591a62a07e7e8baa
Time: 2017-02-28
Author: mikkel@255bits.com
File Name: hypergan/trainers/sgd_trainer.py
Class Name:
Method Name: create
Project Name: p2irc/deepplantphenomics
Commit Name: c4225216a131206747cdf5ca05cb1d4ef6fa3ac9
Time: 2018-05-22
Author: nicoreekohiggs@gmail.com
File Name: deepplantphenomics/deepplantpheno.py
Class Name: DPPModel
Method Name: __assemble_graph