06279f054fcfa03517b1bd51ef156a9182af146f,PyPi/algorithms/td.py,DoubleQLearning,fit,#DoubleQLearning#Any#Any#,83

Before Change


        state, action, reward, next_state, absorbing, _ = parse_dataset(
            [dataset[-1]])

        sa = [state, action]

        approximator_idx = 0 if np.random.uniform() < 0.5 else 1

        q_current = self.approximator[approximator_idx].predict(sa)

After Change


        state, action, reward, next_state, absorbing, _ = parse_dataset(
            [dataset[-1]])

        sa = state_action(state, action)

        approximator_idx = 0 if np.random.uniform() < 0.5 else 1

        q_current = self.approximator[approximator_idx].predict(sa)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 5

Non-data size: 4

Instances


Project Name: AIRLab-POLIMI/mushroom
Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f
Time: 2017-06-16
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: DoubleQLearning
Method Name: fit


Project Name: AIRLab-POLIMI/mushroom
Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f
Time: 2017-06-16
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: DoubleQLearning
Method Name: _next_q


Project Name: AIRLab-POLIMI/mushroom
Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f
Time: 2017-06-16
Author: carlo.deramo@gmail.com
File Name: PyPi/utils/dataset.py
Class Name:
Method Name: max_QA


Project Name: AIRLab-POLIMI/mushroom
Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f
Time: 2017-06-16
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: WeightedQLearning
Method Name: fit


Project Name: AIRLab-POLIMI/mushroom
Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f
Time: 2017-06-16
Author: carlo.deramo@gmail.com
File Name: PyPi/algorithms/td.py
Class Name: TD
Method Name: fit