How to select fairseq option `--ddp-backend`

Question

I'm learning how to use fairseq to implement a simple translation model based on Transformer.

I would like to use 2 GeForce RTX 3090 GPUs on my lab server. Which option should I select for --ddp-backend of fairseq-train?

Furthermore, could you explain about the meaning of all following options for --ddp-backend and when to use them respectively?

From fairseq Documentation: Command-line Tools => fairseq-train => distributed_training

--ddp-backend:

Possible choices: c10d, fully_sharded, legacy_ddp, no_c10d, pytorch_ddp, slowmo

DistributedDataParallel backend

Default: “pytorch_ddp”

I'm new to stack exchange community, sorry if there is any inappropriate action.

score 1 · Answer 1 · answered Jul 03 '22 at 00:39

I am not too sure, but I found this on GitHub

DDP_BACKEND_CHOICES = ChoiceEnum(
    [
        "c10d",  # alias for pytorch_ddp
        "fully_sharded",  # FullyShardedDataParallel from fairscale
        "legacy_ddp",
        "no_c10d",  # alias for legacy_ddp
        "pytorch_ddp",
        "slowmo",
    ]
)

Might be helpful, but I am also struggling with this

Yuan Gao · Answer 2 · 2022-08-21T04:09:15.173

0

You can find this in the options.py file, hope it's helpful. But they only describe the difference between "c10d" and "no_c10d". So we should keep going to find more.

This is the link.

edited Aug 21 '22 at 04:09

answered Aug 21 '22 at 04:06

Yuan Gao

1
1

How to select fairseq option `--ddp-backend`

2 Answers2