3ce4fd552a89c6fd5e479ad3713418b036337ff6,mushroom_rl/algorithms/value/batch_td/fqi.py,FQI,fit,#FQI#Any#,52
Before Change
for _ in trange(self._n_iterations, dynamic_ncols=True,
disable=self._quiet, leave=False):
fit(dataset)
def _fit(self, x):
Single fit iteration.
After Change
if self._target is None:
self._target = reward
else:
q = self.approximator.predict(next_state)
if np.any(absorbing):
q *= 1 - absorbing.reshape(-1, 1)
max_q = np.max(q, axis=1)
self._target = reward + self.mdp_info.gamma * max_q
self.approximator.fit(state, action, self._target, **self._fit_params)
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 6
Instances
Project Name: AIRLab-POLIMI/mushroom
Commit Name: 3ce4fd552a89c6fd5e479ad3713418b036337ff6
Time: 2021-01-08
Author: carlo.deramo@gmail.com
File Name: mushroom_rl/algorithms/value/batch_td/fqi.py
Class Name: FQI
Method Name: fit
Project Name: IndicoDataSolutions/finetune
Commit Name: 6d72949e9931bfef846ca8a42fc6e24f573e1de3
Time: 2020-04-23
Author: benlt@hotmail.co.uk
File Name: finetune/datasets/reuters.py
Class Name:
Method Name:
Project Name: ncullen93/torchsample
Commit Name: 1344dee35dbacaaaaabdaf452f0dfe74e3ab50e4
Time: 2017-04-19
Author: ncullen@modv-vlan533.0288.apn.wlan.wireless-pennnet.upenn.edu
File Name: torchsample/modules/example.py
Class Name:
Method Name: