fix: model.set_requires_gradient_sync(False) should be called to turn off gradient synchronization in FSDP2 #3762

EquationWalker · 2025-09-05T10:16:25Z

In FSDP2, the model(FSDPModule) does not have no_sync() and instead calls model.set_requires_gradient_sync(False) to turn off gradient synchronization. See at torch.distributed.fsdp.FSDPModule.set_requires_gradient_sync

…rn off gradient synchronization in FSDP2.

HuggingFaceDocBuilderDev · 2025-09-06T20:30:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

S1ro1

Thanks!

bghira · 2025-11-07T11:53:17Z

@EquationWalker this ~~causes~~ surfaces a possible user-side bug:

 'list' object has no attribute 'set_requires_gradient_sync'

did not happen before because everything inside the function silently fails through when whatever they're looking for isn't there - see the lines for DeepSpeed ZeRO 3 check earlier where it uses getattr with a default that then returns a null context.

not sure that it helps anyone to know this, or if some earlier check should occur when a user passes an unpacked list to accumulate()

EquationWalker · 2025-12-20T06:05:17Z

@EquationWalker this ~~causes~~ surfaces a possible user-side bug:
 'list' object has no attribute 'set_requires_gradient_sync' 
did not happen before because everything inside the function silently fails through when whatever they're looking for isn't there - see the lines for DeepSpeed ZeRO 3 check earlier where it uses getattr with a default that then returns a null context.

not sure that it helps anyone to know this, or if some earlier check should occur when a user passes an unpacked list to accumulate()

I think this check should occur in Accelerator.accmulate(). If the user passes a list object and we use nullcontext, the user will not know that gradient synchronization is not actually turned off. Instead, we should unpack its list object in Accelerator.accumulate() or throw an Expection when user passed a list object.

EquationWalker added 2 commits September 5, 2025 13:14

fix :model.set_requires_gradient_sync(False) should be called to tu…

d18d018

…rn off gradient synchronization in FSDP2.

fix: remove trailing whitespace

7b7a21a

S1ro1 approved these changes Sep 6, 2025

View reviewed changes

S1ro1 merged commit ec92b1a into huggingface:main Sep 6, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: model.set_requires_gradient_sync(False) should be called to turn off gradient synchronization in FSDP2 #3762

fix: model.set_requires_gradient_sync(False) should be called to turn off gradient synchronization in FSDP2 #3762

Uh oh!

EquationWalker commented Sep 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 6, 2025

Uh oh!

S1ro1 left a comment

Uh oh!

Uh oh!

bghira commented Nov 7, 2025 •

edited

Loading

Uh oh!

EquationWalker commented Dec 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: model.set_requires_gradient_sync(False) should be called to turn off gradient synchronization in FSDP2 #3762

fix: model.set_requires_gradient_sync(False) should be called to turn off gradient synchronization in FSDP2 #3762

Uh oh!

Conversation

EquationWalker commented Sep 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 6, 2025

Uh oh!

S1ro1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bghira commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EquationWalker commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bghira commented Nov 7, 2025 •

edited

Loading

EquationWalker commented Dec 20, 2025 •

edited

Loading