06279f054fcfa03517b1bd51ef156a9182af146f,PyPi/algorithms/td.py,DoubleQLearning,fit,#DoubleQLearning#Any#Any#,83

Before Change


        state, action, reward, next_state, absorbing, _ = parse_dataset(
            [dataset[-1]])

        sa = [state, action]

        approximator_idx = 0 if np.random.uniform() < 0.5 else 1

        q_current = self.approximator[approximator_idx].predict(sa)

After Change


        state, action, reward, next_state, absorbing, _ = parse_dataset(
            [dataset[-1]])

        sa = state_action(state, action)

        approximator_idx = 0 if np.random.uniform() < 0.5 else 1

        q_current = self.approximator[approximator_idx].predict(sa)

In pattern: SUPERPATTERN

Frequency: 5

Non-data size: 4

Instances

Link

Project Name: AIRLab-POLIMI/mushroom

Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f

Time: 2017-06-16

Author: carlo.deramo@gmail.com

File Name: PyPi/algorithms/td.py

Class Name: DoubleQLearning

Method Name: fit

Link

Project Name: AIRLab-POLIMI/mushroom

Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f

Time: 2017-06-16

Author: carlo.deramo@gmail.com

File Name: PyPi/algorithms/td.py

Class Name: DoubleQLearning

Method Name: _next_q

Link

Project Name: AIRLab-POLIMI/mushroom

Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f

Time: 2017-06-16

Author: carlo.deramo@gmail.com

File Name: PyPi/utils/dataset.py

Class Name:

Method Name: max_QA

Link

Project Name: AIRLab-POLIMI/mushroom

Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f

Time: 2017-06-16

Author: carlo.deramo@gmail.com

File Name: PyPi/algorithms/td.py

Class Name: WeightedQLearning

Method Name: fit

Link

Project Name: AIRLab-POLIMI/mushroom

Commit Name: 06279f054fcfa03517b1bd51ef156a9182af146f

Time: 2017-06-16

Author: carlo.deramo@gmail.com

File Name: PyPi/algorithms/td.py

Class Name: TD

Method Name: fit