- 20 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* support FP16 for more ops * add amp list tests * refine reduce_mean_grad * fix OP benchmark ci * fix fp16 reduce_mean * updat ut, but still have some problems * remove mean/reduce_mean fp16 kernel
-
- 15 9月, 2021 1 次提交
-
-
由 王明冬 提交于
* clip op extra information when export model,test=ocr * rename clip_extra parameter to kwargs in save_inference_model, test=ocr
-
- 17 6月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
-
- 03 6月, 2021 1 次提交
-
-
由 Yuang Liu 提交于
-
- 07 5月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add casting initializers for bf16 training * Changes after review * Correct test and add comment
-
- 28 4月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 22 4月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 08 4月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* Use the runtime to create the unsupported_fp16_list using in AMP. * Add more infos about supported ops. * Add some comments for the function of OpSupportedInfos. * Fix the unit test of test_multi_precision_fp16_train.
-
- 22 3月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 05 2月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* Use correct master weights in AdamW. * Just modify the master weight. * Update for CI Coverage.
-
- 08 1月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* add cast ops before and after unsupported fp16 ops. * Keep partial net in FP32 pattern. * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode. * Add fp16 support for adam op. * add multi precision attr for adam. * Fix the bug of test_multi_precision_fp16_train UT. * Code format for CI. * Fix the redefine error about MPTypeTrait on windows. * fix bugs of the _create_accumulators func in Momentum. * fix bug when inserting post cast op. * Add the update_loss_scaling op in allow_set of UnusedVarCheck. * Update for ci coverage. * Add some doc for OptimizerWithMixedPrecision. * Fix the code style. * Imporve the doc of `amp_init`. * Change for fp16 testing if users have the infer program defined in separate way.
-
- 21 12月, 2020 2 次提交
- 16 12月, 2020 1 次提交
-
-
由 huangxu96 提交于
* add static.amp into setup.pu.in * add unittest for api
-
- 02 12月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT. * For CI Coverage Checking.
-
- 18 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add debuging code * change seed & add debug message
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 04 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* skip reader op in mixed_precision decorator * add ut
-
- 21 10月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
-
由 cnn 提交于
* rename manual_seed to seed * rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D * rename manual_seed --> seed * do not rename .cc, .cu and .h file * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * disable_static on doc example code * donot change manual_seed on generator * add enable_static on sample code * convert python/paddle/fluid/layers/nn.py to bak * fix typo * fix code style * fix seed to manual_seed when call functions of Generator() * fix bug
-
- 24 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 22 9月, 2020 1 次提交
-
-
由 pangyoki 提交于
* default open dygraph mode * fix CI-Mac * fix Mac-CI other unittest file * fix CI-Py3 * fix test_communicator_geo and test_buffer_shared_memory_reuse_pass * add enable_static to fix CI-Py3 * add enable_static to fix CI-coverage * delete try except
-
- 08 9月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* add correlation api to contrib
-
- 28 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed *
🐞 fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci
-
- 17 8月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
set default python3,fix incompatible,cache dir for third party,unify error code,for windows (#26178) * set default python3 for paddle windows,test=win * set default python3,cache dir for third party,error code,test=win * fix some incompatible * fix some error * set virtual environment,test=win
-
- 15 4月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* allow amp and recompute working together
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 30 10月, 2019 1 次提交
-
-
由 gongweibao 提交于
* add custom black varname test=develop * fix dtype test=develop * fix num test=develop * fix ut test=develop * fix coverage test=develop * fix blackvar names test=develop
-
- 15 10月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 19 9月, 2019 1 次提交
-
-
由 Jie Fang 提交于
Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
-
- 10 9月, 2019 1 次提交
-
-
由 gongweibao 提交于
Fix float16 optimizer
-
- 06 9月, 2019 1 次提交
-
-
由 Jie Fang 提交于
init new amp, optimize inserting cast op for batchnorm
-
- 16 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop
-
- 27 6月, 2019 1 次提交
-
-
由 翟飞跃 提交于
-
- 19 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* add multi process reader test=develop
-
- 12 6月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
* Cherry-pick fix random Python3 CI failure. In some tests, SWEs used "print('xxx').format('xxx')". The syntax is only supported in Python2, not python3. However, since those lines are related to data download, if the CI machines already have the data, it passes CI tests. That causes random failure. * Cherry-pick: disable CUDNN case of test_warpctc_op Also temporary disable a unit test. The test will be fixed under high priority.
-
- 25 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* add data parallel batch
-
- 21 5月, 2019 1 次提交
-
-
由 Yibing Liu 提交于
* Add LAMB optimizer * Expose LAMB Optimizer's APIs test=develop, test=document_preview * Cleanup code & doc test=develop, test=document_preview * Update lamb optimizer's formula test=develop
-
- 16 5月, 2019 1 次提交
-
-
由 Jie Fang 提交于
* init auto loss scaling test=develop * change API.spec * change ifelse to switch and use reduce_sum to optimize checking isfinite test=develop * Remove redundant code test=develop
-
- 30 4月, 2019 1 次提交
-
-
由 guomingz 提交于
Update the folder name generation mechanism for saving the quantized model and weights. The folder name would be unique by adding the timestamp postfix. test=develop
-