Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Force param sync when using distributed optimizer and overlap_param_g…
…ather (NVIDIA#11486) * Add disable/enable forward pre hook for DDP and overlap param gather Signed-off-by: Hemil Desai <[email protected]> * Fix Signed-off-by: Hemil Desai <[email protected]> * Force param sync before saving checkpoint Signed-off-by: Hemil Desai <[email protected]> * fix Signed-off-by: Hemil Desai <[email protected]> * Apply isort and black reformatting Signed-off-by: hemildesai <[email protected]> --------- Signed-off-by: Hemil Desai <[email protected]> Signed-off-by: hemildesai <[email protected]> Co-authored-by: hemildesai <[email protected]>
- Loading branch information