1. 03 12月, 2020 3 次提交
    • L
      fix shape of tile_grad op (#29289) (#29324) · 8cd8cd53
      Leo Chen 提交于
      8cd8cd53
    • S
      [cherry-pick]Change the api of DataParallel and Fleet (#29288) · ec57656e
      ShenLiang 提交于
      * Change the api of DataParallel and Fleet (#29224)
      ec57656e
    • Z
      [Cherry-pick] Add pure fp16 training with master weights. (#29301) · d8ea8a06
      Zhen Wang 提交于
      * Add pure fp16 training with master weights. (#27712)
      
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      d8ea8a06
  2. 02 12月, 2020 2 次提交
  3. 01 12月, 2020 3 次提交
  4. 30 11月, 2020 10 次提交
  5. 28 11月, 2020 1 次提交
  6. 27 11月, 2020 8 次提交
  7. 26 11月, 2020 9 次提交
  8. 25 11月, 2020 4 次提交