Merge non dist and dist train (#2440)
* Merge _dist_train and _non_dist_train * Add missing distributed arg to Fp16OptimizerHook
Please register or sign in to comment
* Merge _dist_train and _non_dist_train * Add missing distributed arg to Fp16OptimizerHook