1. 20 12月, 2021 1 次提交
    • S
      Support FP16 for more ops (#38123) · 1f445bf3
      sneaxiy 提交于
      * support FP16 for more ops
      
      * add amp list tests
      
      * refine reduce_mean_grad
      
      * fix OP benchmark ci
      
      * fix fp16 reduce_mean
      
      * updat ut, but still have some problems
      
      * remove mean/reduce_mean fp16 kernel
      1f445bf3
  2. 15 9月, 2021 1 次提交
  3. 17 6月, 2021 1 次提交
  4. 03 6月, 2021 1 次提交
  5. 07 5月, 2021 1 次提交
  6. 28 4月, 2021 1 次提交
  7. 22 4月, 2021 1 次提交
  8. 08 4月, 2021 1 次提交
  9. 22 3月, 2021 1 次提交
  10. 05 2月, 2021 1 次提交
  11. 08 1月, 2021 1 次提交
    • Z
      Support pure fp16 training for AMP API. (#29544) · 7f7dfccf
      Zhen Wang 提交于
      * add cast ops before and after unsupported fp16 ops.
      
      * Keep partial net in FP32 pattern.
      
      * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.
      
      * Add fp16 support for adam op.
      
      * add multi precision attr for adam.
      
      * Fix the bug of test_multi_precision_fp16_train UT.
      
      * Code format for CI.
      
      * Fix the redefine error about MPTypeTrait on windows.
      
      * fix bugs of the _create_accumulators func in Momentum.
      
      * fix bug when inserting post cast op.
      
      * Add the update_loss_scaling op in allow_set of UnusedVarCheck.
      
      * Update for ci coverage.
      
      * Add some doc for OptimizerWithMixedPrecision.
      
      * Fix the code style.
      
      * Imporve the doc of `amp_init`.
      
      * Change for fp16 testing if users have the infer program defined in separate way.
      7f7dfccf
  12. 21 12月, 2020 2 次提交
  13. 16 12月, 2020 1 次提交
  14. 02 12月, 2020 1 次提交
    • Z
      Add pure fp16 training with master weights. (#27712) · be3777a5
      Zhen Wang 提交于
      * add the weight decay func for the momentum op
      
      * Add the multi_precision function in Momentum Optimizer.
      
      * Make sure that the initial value of master weights are same with the fp16 weights.
      
      * add static loss scaling.
      
      * add the rescale_grad function in the pure fp16 training.
      
      * use the original momentum updating method.
      
      * Polish some codes, such as variable names.
      
      * add docstring for apis.
      
      * update the var creation details of _create_master_weight.
      
      * not modify codes about imperative momentum updating.
      
      * Fix the error of test_dist_sparse_tensor_load_momentum UT.
      
      * add unit test for multi precision fp16 training.
      
      * add more unit tests for CI.
      
      * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
      
      * For CI Coverage Checking.
      be3777a5
  15. 18 11月, 2020 1 次提交
  16. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  17. 04 11月, 2020 1 次提交
  18. 21 10月, 2020 2 次提交
    • C
      fix test_weight_decay_extend error (#28178) · 5d73bfdb
      Chen Weihang 提交于
      5d73bfdb
    • C
      2.0rc api rename (#28088) · 7c1aa0d6
      cnn 提交于
      * rename manual_seed to seed
      
      * rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D
      
      * rename manual_seed --> seed
      
      * do not rename .cc, .cu and .h file
      
      * rename manual_seed --> seed
      
      * rename manual_seed --> seed
      
      * rename manual_seed --> seed
      
      * rename manual_seed --> seed
      
      * disable_static on doc example code
      
      * donot change manual_seed on generator
      
      * add enable_static on sample code
      
      * convert python/paddle/fluid/layers/nn.py to bak
      
      * fix typo
      
      * fix code style
      
      * fix seed to manual_seed when call functions of Generator()
      
      * fix bug
      7c1aa0d6
  19. 24 9月, 2020 1 次提交
  20. 22 9月, 2020 1 次提交
    • P
      Use dygraph mode by default (#27443) · 827ac36f
      pangyoki 提交于
      * default open dygraph mode
      
      * fix CI-Mac
      
      * fix Mac-CI other unittest file
      
      * fix CI-Py3
      
      * fix test_communicator_geo and test_buffer_shared_memory_reuse_pass
      
      * add enable_static to fix CI-Py3
      
      * add enable_static to fix CI-coverage
      
      * delete try except
      827ac36f
  21. 08 9月, 2020 1 次提交
  22. 28 8月, 2020 1 次提交
    • L
      Refine paddle.manual_seed (#26496) · 844583c8
      Leo Chen 提交于
      * refine manual seed
      
      * fix ci problem
      
      * fix unittests
      
      * fix unittest
      
      * set is_init_py=false in manual_seed
      
      * fix unittest
      
      * fix bernoulli_op
      
      * fix(unittest): change random_seed to manual_seed
      
      * 🐞fix(unittest): fix manual_seed
      
      * trigger ci
      
      * fix test_sentiment
      
      * fix test_imperative_save_load
      
      * fix test_uniform_random_op
      
      * fix test_uniform_random_op
      
      * fix test_jit_save_load
      
      * merge develop
      
      * fix manual_seed
      
      * fix manual_seed
      
      * use global engine
      
      * use shared_ptr
      
      * fix double free
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix test bug
      
      * fix test bug
      
      * fix test bug
      
      * fix ci
      844583c8
  23. 17 8月, 2020 1 次提交
    • Z
      set default python3,fix incompatible,cache dir for third party,unify error... · 5017aa76
      Zhou Wei 提交于
      set default python3,fix incompatible,cache dir for third party,unify error code,for windows (#26178)
      
      * set default python3 for paddle windows,test=win
      
      * set default python3,cache dir for third party,error code,test=win
      
      * fix some incompatible
      
      * fix some error
      
      * set virtual environment,test=win
      5017aa76
  24. 15 4月, 2020 1 次提交
  25. 28 2月, 2020 1 次提交
  26. 30 10月, 2019 1 次提交
  27. 15 10月, 2019 1 次提交
  28. 19 9月, 2019 1 次提交
  29. 10 9月, 2019 1 次提交
  30. 06 9月, 2019 1 次提交
  31. 16 8月, 2019 1 次提交
  32. 27 6月, 2019 1 次提交
  33. 19 6月, 2019 1 次提交
  34. 12 6月, 2019 1 次提交
    • H
      Cherry-pick: fix random CI failure. (#18011) · 0bf25351
      Huihuang Zheng 提交于
      * Cherry-pick fix random Python3 CI failure.
      
      In some tests, SWEs used "print('xxx').format('xxx')". The syntax
      is only supported in Python2, not python3. However, since those
      lines are related to data download, if the CI machines already have
      the data, it passes CI tests. That causes random failure.
      
      * Cherry-pick: disable CUDNN case of test_warpctc_op
      
      Also temporary disable a unit test. The test will be fixed under high priority.
      0bf25351
  35. 25 5月, 2019 1 次提交
  36. 21 5月, 2019 1 次提交
    • Y
      Add LAMB Optimizer support (#17489) · f9796b12
      Yibing Liu 提交于
      * Add LAMB optimizer
      
      * Expose LAMB Optimizer's APIs
      
      test=develop, test=document_preview
      
      * Cleanup code & doc
      
      test=develop, test=document_preview
      
      * Update lamb optimizer's formula
      
      test=develop
      f9796b12
  37. 16 5月, 2019 1 次提交
    • J
      init auto loss scaling (#17194) · 30e178fa
      Jie Fang 提交于
      * init auto loss scaling
      
      test=develop
      
      * change API.spec
      
      * change ifelse to switch and use reduce_sum to optimize checking isfinite
      
      test=develop
      
      * Remove redundant code
      
      test=develop
      30e178fa
  38. 30 4月, 2019 1 次提交
    • G
      resolve #17159 (#17172) · e4a52e08
      guomingz 提交于
      Update the folder name generation mechanism for saving the quantized model and weights.
      The folder name would be unique by adding the timestamp postfix.
      
      test=develop
      e4a52e08