- 07 1月, 2021 1 次提交
-
-
由 furnace 提交于
* Layer norm fp16 (#29169) * add fp16 for layer_norm op * revert layernorm api * fix forward * fix forward * fix backward for layernorm with fp16 * fix unit test for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U> * fix with_mkldnn compile error for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com> * fix layer_norm accuracy (#29434) * Layernorm opt (#29522) * layernorm fw opt * layernorm bw opt * fix typo, test=develop * remove const dim3 for windows CI compatibility * merge develop Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com> * Fix compile problem when cuda_arch < 6000 (#29576) * fix compile problem when cuda_arch < 6000 * refine code * refine code Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com> Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
-
- 05 1月, 2021 1 次提交
-
-
由 cc 提交于
* fix ininite scale values (#29386) * Support dygraph quant model (#29927) * Avoid the scale to be infinity in quant2_int8_mkldnn_pass, test=develop * support quantized model for paddle2.0 dygraph, test=develop Co-authored-by: NWojciech Uss <wojciech.uss@intel.com>
-
- 29 12月, 2020 1 次提交
-
-
由 XiaoguangHu 提交于
* [cherry-pick] cherry-pick of PR#29928 * delete paddle.metric.chunk_eval and paddle.metric.mean_iou * delete paddle.nn.clip and paddle.nn.clip_by_norm * delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish * [cherry-pick] cherry-pick of PR#29928 * fix extension import error
-
- 09 12月, 2020 1 次提交
-
-
由 Aurelius84 提交于
-
- 03 12月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* Add pure fp16 training with master weights. (#27712) * add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
-
- 01 12月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 11月, 2020 2 次提交
-
-
由 WangXi 提交于
-
由 Wojciech Uss 提交于
-
- 27 11月, 2020 1 次提交
-
-
由 guofei 提交于
* Optimiz the unittest test_imperative_out_scale test=develop
-
- 26 11月, 2020 1 次提交
-
-
由 Aurelius84 提交于
-
- 25 11月, 2020 1 次提交
-
-
由 huangxu96 提交于
* Impelement 2.0 API version Conv2d and Linear layer quantization in imperative mode. * use cudnn softmax in static Lenet * Modified ChannelwiseQAT Unittest for 2.0 API. * For CI python coverage.
-
- 24 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* upgrade comment string to raw string * fix string in * fix string with ' ' * revert update on comments * upgrade only necessary * fix sample code checker * fix comments with '''
-
- 23 11月, 2020 1 次提交
-
-
由 furnace 提交于
* refactor momentum op to combine weight_decay (scale op and sum op)
-
- 18 11月, 2020 3 次提交
-
-
由 Chen Weihang 提交于
* add debuging code * change seed & add debug message
-
由 Bai Yifan 提交于
* support user-defined quant and preprocess
-
由 Leo Chen 提交于
* add matmtl_v2 to amp list * support dygraph
-
- 16 11月, 2020 1 次提交
-
-
由 cc 提交于
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 04 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* skip reader op in mixed_precision decorator * add ut
-
- 21 10月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
-
由 cnn 提交于
* rename manual_seed to seed * rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D * rename manual_seed --> seed * do not rename .cc, .cu and .h file * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * rename manual_seed --> seed * disable_static on doc example code * donot change manual_seed on generator * add enable_static on sample code * convert python/paddle/fluid/layers/nn.py to bak * fix typo * fix code style * fix seed to manual_seed when call functions of Generator() * fix bug
-
- 14 10月, 2020 1 次提交
-
-
由 guofei 提交于
* Implement the function of OueScaleForTraining/OutScaleForInference in dygraph test=develop
-
- 12 10月, 2020 2 次提交
- 11 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* replace config by kwargs * change save path form dir to prefix * fix failed unittests * revert unittest name change * polish en docs * add more tests for coverage
-
- 09 10月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* fix bilateral bug
-
- 01 10月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 23 9月, 2020 2 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 Zhang Ting 提交于
* add fused_bn_add_relu op
-
- 22 9月, 2020 1 次提交
-
-
由 pangyoki 提交于
* default open dygraph mode * fix CI-Mac * fix Mac-CI other unittest file * fix CI-Py3 * fix test_communicator_geo and test_buffer_shared_memory_reuse_pass * add enable_static to fix CI-Py3 * add enable_static to fix CI-coverage * delete try except
-
- 21 9月, 2020 1 次提交
-
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
- 18 9月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 15 9月, 2020 1 次提交
-
-
由 cc 提交于
* Remove the cache in post_traning_quantization, test=develop
-
- 14 9月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240) * update amp_check_finite_and_scale_op for static_amp. * use amp_check_finite_and_scale in static graph amp. * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op). * add update_loss_scaling op in cpp. * add update_loss_scaling_op unit test. * update the doc of the check_finite_and_unscale op * Update the process of gradients updating skipping if the gradients have infinite values. * update the way to zero grads. * update test_update_loss_scaling_op.py * add log info when find infinite grads. * add the unit test for UpdateLossScaling Layer.
-
- 10 9月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 08 9月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* add correlation api to contrib
-
- 07 9月, 2020 1 次提交
-
-
由 Sylwester Fraczek 提交于
* fix dimensions error for mobilenetv1_KL_quant fixes AssertionError: The size of weight scales vector (1000) does not match the number of output channels (1024) in the weights tensor fc7_weights. add mul test * remove comment * add third case unit test
-
- 03 9月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 02 9月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
-