fbf3bafd970d3539ad774bae5f46a4b730901cb5,tensorforce/updater/trpo_updater.py,TRPOUpdater,create_training_operations,#TRPOUpdater#,99

Before Change


            mean_kl_divergence = get_kl_divergence_gaussian(self.prev_action_means, self.prev_action_log_stds,
                                                            self.action_means, self.action_log_stds) / float(
                self.batch_size)
            mean_entropy = get_entropy_gaussian(self.action_log_stds) / float(self.batch_size)

            self.losses = [surrogate_loss, mean_kl_divergence, mean_entropy]

            // Get symbolic gradient expressions

After Change



            surrogate_loss = -tf.reduce_mean(prob_ratio * self.advantage)
            variables = tf.trainable_variables()
            batch_float = tf.cast(self.batch_size, tf.float32)

            mean_kl_divergence = get_kl_divergence_gaussian(self.prev_action_means, self.prev_action_log_stds,
                                                            self.action_means, self.action_log_stds) / batch_float
            mean_entropy = get_entropy_gaussian(self.action_log_stds) / batch_float
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: reinforceio/tensorforce
Commit Name: fbf3bafd970d3539ad774bae5f46a4b730901cb5
Time: 2017-01-15
Author: mi.schaarschmidt@gmail.com
File Name: tensorforce/updater/trpo_updater.py
Class Name: TRPOUpdater
Method Name: create_training_operations


Project Name: tensorflow/tpu
Commit Name: 21cd0774c8c3d41a8464427c81629075c953e7e3
Time: 2018-09-05
Author: xiejw0217@gmail.com
File Name: models/official/retinanet/retinanet_model.py
Class Name:
Method Name: learning_rate_schedule


Project Name: tensorflow/mesh
Commit Name: 5527dea6aa673f05fed6061255340a9e96e3fc90
Time: 2020-02-12
Author: marcvanzee@google.com
File Name: mesh_tensorflow/transformer/learning_rate_schedules.py
Class Name:
Method Name: linear_decay_learning_rate