b2aad723220e31bc8e950c112b557732b608b97a,PyPi/algorithms/td.py,DoubleQLearning,fit,#DoubleQLearning#Any#Any#,86
Before Change
assert n_fit_iterations == 1
state, action, reward, next_state, absorbing, _ =\
parse_dataset(dataset[-1],
self.mdp_info["observation_space"].dim,
self.mdp_info["action_space"].dim)
sa = (state, action)
sa_idx = np.append(self.mdp_info["observation_space"].get_idx(state),
self.mdp_info["action_space"].get_idx(action))
After Change
assert n_fit_iterations == 1
state, action, reward, next_state, absorbing, _ = parse_dataset(
dataset[-1])
sa = [state, action]
sa_idx = np.append(self.mdp_info["observation_space"].get_idx(state),
self.mdp_info["action_space"].get_idx(action))
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 8
Instances
Project Name: AIRLab-POLIMI/mushroom
Commit Name: b2aad723220e31bc8e950c112b557732b608b97a
Time: 2017-06-04
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: DoubleQLearning
Method Name: fit
Project Name: AIRLab-POLIMI/mushroom
Commit Name: b2aad723220e31bc8e950c112b557732b608b97a
Time: 2017-06-04
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: DoubleQLearning
Method Name: fit
Project Name: AIRLab-POLIMI/mushroom
Commit Name: b2aad723220e31bc8e950c112b557732b608b97a
Time: 2017-06-04
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/batch_td.py
Class Name: FQI
Method Name: partial_fit
Project Name: AIRLab-POLIMI/mushroom
Commit Name: b2aad723220e31bc8e950c112b557732b608b97a
Time: 2017-06-04
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: TD
Method Name: fit