78c163f37be165673b040cff49e1ee98eb89ca0f,tf_agents/bandits/agents/greedy_reward_prediction_agent_test.py,AgentTest,testTrainAgentWithConstraint,#AgentTest#,253

Before Change


        constraints=[neural_constraint])
    observations = np.array([[1, 2], [3, 4]], dtype=np.float32)
    actions = np.array([0, 1], dtype=np.int32)
    rewards = np.array([[0.5, 6.0], [3.0, 4.0]], dtype=np.float32)
    initial_step, final_step = _get_initial_and_final_steps(
        observations, rewards)
    action_step = _get_action_step(actions)
    experience = _get_experience(initial_step, action_step, final_step)

After Change


        self._action_spec,
        constraint_network=constraint_net)

    reward_spec = {
        "reward": tensor_spec.TensorSpec(
            shape=(), dtype=tf.float32, name="reward"),
        "constraint": tensor_spec.TensorSpec(
            shape=(), dtype=tf.float32, name="constraint")
    }
    self._time_step_spec = ts.time_step_spec(self._obs_spec, reward_spec)

    agent = greedy_agent.GreedyRewardPredictionAgent(
        self._time_step_spec,
        self._action_spec,
        reward_network=reward_net,
        optimizer=optimizer,
        constraints=[neural_constraint])
    observations = np.array([[1, 2], [3, 4]], dtype=np.float32)
    actions = np.array([0, 1], dtype=np.int32)
    rewards = {
        "reward": np.array([0.5, 3.0], dtype=np.float32),
        "constraint": np.array([6.0, 4.0], dtype=np.float32)
    }
    initial_step, final_step = _get_initial_and_final_steps_nested_rewards(
        observations, rewards)
    action_step = _get_action_step(actions)
    experience = _get_experience(initial_step, action_step, final_step)

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 12

Instances

Link

Project Name: tensorflow/agents

Commit Name: 78c163f37be165673b040cff49e1ee98eb89ca0f

Time: 2020-06-12

Author: kokiopou@google.com

File Name: tf_agents/bandits/agents/greedy_reward_prediction_agent_test.py

Class Name: AgentTest

Method Name: testTrainAgentWithConstraint

Link

Project Name: tensorflow/agents

Commit Name: 78c163f37be165673b040cff49e1ee98eb89ca0f

Time: 2020-06-12

Author: kokiopou@google.com

File Name: tf_agents/bandits/agents/greedy_reward_prediction_agent_test.py

Class Name: AgentTest

Method Name: testTrainAgentWithMaskAndConstraint

Link

Project Name: tensorflow/agents

Commit Name: 78c163f37be165673b040cff49e1ee98eb89ca0f

Time: 2020-06-12

Author: kokiopou@google.com

File Name: tf_agents/bandits/agents/greedy_reward_prediction_agent_test.py

Class Name: AgentTest

Method Name: testTrainAgentWithConstraint

Link

Project Name: tensorflow/agents

Commit Name: 78c163f37be165673b040cff49e1ee98eb89ca0f

Time: 2020-06-12

Author: kokiopou@google.com

File Name: tf_agents/bandits/agents/greedy_reward_prediction_agent_test.py

Class Name: AgentTest

Method Name: testTrainPerArmAgentWithConstraint