bafeed46fb76fa337a771ebb41d65bb95039565a,fairseq/trainer.py,Trainer,train_step,#Trainer#Any#Any#Any#,261
Before Change
// printed out if another exception happens.
// NB(jerry): added a flush to mitigate this
print(msg, file=sys.stderr)
if torch.cuda.is_available() and hasattr(torch.cuda, "memory_summary"):
for device_idx in range(torch.cuda.device_count()):
print(torch.cuda.memory_summary(device=device_idx),
file=sys.stderr)
sys.stderr.flush()
if raise_oom:
raise ValueError(msg)
After Change
self.zero_grad()
logging_output = None
except RuntimeError as e:
if "out of memory" in str(e):
self._log_oom(e)
print("| ERROR: OOM during optimization, irrecoverable")
raise e
if self.args.fp16:
self.meters["loss_scale"].reset()
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 5
Instances
Project Name: elbayadm/attn2d
Commit Name: bafeed46fb76fa337a771ebb41d65bb95039565a
Time: 2019-11-06
Author: noreplyspamblackhole@gmail.com
File Name: fairseq/trainer.py
Class Name: Trainer
Method Name: train_step
Project Name: ultralytics/yolov3
Commit Name: 3825e99ee36bd237d11f702af93a2acba4496c9b
Time: 2019-04-08
Author: glenn.jocher@ultralytics.com
File Name: utils/torch_utils.py
Class Name:
Method Name: select_device