serhii.net

In the middle of the desert you can say anything you want

11 Mar 2021

Day 800

Detectron2 dataloader training in parallel num_workers (“process exited unexpectedly”)

When training on different GPUs on the same server, I get errors like RuntimeError: DataLoader worker (pid 30141) exited unexpectedly with exit code 1.

The fix was to set the number of workers to 0: 1

cfg.DATALOADER.NUM_WORKERS = 2
Nel mezzo del deserto posso dire tutto quello che voglio.