- 08 1月, 2021 13 次提交
-
-
由 huangxu96 提交于
-
由 Chen Weihang 提交于
-
由 Zhen Wang 提交于
* add cast ops before and after unsupported fp16 ops. * Keep partial net in FP32 pattern. * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode. * Add fp16 support for adam op. * add multi precision attr for adam. * Fix the bug of test_multi_precision_fp16_train UT. * Code format for CI. * Fix the redefine error about MPTypeTrait on windows. * fix bugs of the _create_accumulators func in Momentum. * fix bug when inserting post cast op. * Add the update_loss_scaling op in allow_set of UnusedVarCheck. * Update for ci coverage. * Add some doc for OptimizerWithMixedPrecision. * Fix the code style. * Imporve the doc of `amp_init`. * Change for fp16 testing if users have the infer program defined in separate way.
-
由 Leo Chen 提交于
* fix dtype of ungenerated grad var * update ut * refine code * set default dtype * fix could_use_cudnn bug * remove debug code * re-implement * fix bug
-
由 Aurelius84 提交于
* fix tensor shape bug * fix op_num * clean code
-
由 liym27 提交于
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
-
由 littletomatodonkey 提交于
-
由 liym27 提交于
-
由 liym27 提交于
1. When x is Variable, call nn.shape(x) only in following cases: 1)The shape of x is used in control flow condition. 2)The dim to be used is negetive 2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()
-
由 Leo Chen 提交于
* change to tensor copy sync * change to tensor copy sync * make copy_to safe when use TensorCopy * refine code * add ut * add cudapinned garbagecollector * add testcase: cpu place -> cuda pinned place
-
由 liym27 提交于
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
-
由 Chengmo 提交于
* add tensor table
-
由 guofei 提交于
* Quantization supports 2.0 APIs * Fix the error of save_quantized_model
-
- 07 1月, 2021 8 次提交
-
-
由 Chen Weihang 提交于
* simplify prepared op impl to improve performance * fix kunlun compile error * continue fix kunlun compile error * only transform diff place when dtype diff * fix failed unittests * remove useless file * polish impl by review comment
-
由 Chen Weihang 提交于
-
由 WangXi 提交于
-
由 LutaoChu 提交于
-
由 wangchaochaohu 提交于
-
由 Chen Weihang 提交于
* Simplify the options of spawn based on fleetrun * polish details * polish doc details
-
由 tangwei12 提交于
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
-
由 123malin 提交于
* test=develop, add model_average and lookahead
-
- 06 1月, 2021 10 次提交
-
-
由 ceci3 提交于
* fix syncbn convet * add unittest
-
由 Leo Chen 提交于
* add dispenable input 'shape' for core.ops.reshape2 * add dispenable inputs for core.ops.reshape2/expand/slice * add ut
-
由 Zhou Wei 提交于
* Polish and Optimize the print/repr message of all layer * fix some code format
-
由 WeiXin 提交于
-
由 Zhou Wei 提交于
-
由 liym27 提交于
1. when slice_item is a slice: 1) the start of __getitem__ should be std::max(start, 0) if slice 2) the start of __getitem__ should be std::min(end, dim) 2. when slice_item is an integer, it should be in [-dim_len, dim_len) 3. Fix error message to use accurate data
-
由 gongweibao 提交于
-
由 ceci3 提交于
-
由 ceci3 提交于
* add attribute for batch_norm
-
由 Jiaqi Liu 提交于
* fix beam search bug * add dygraph unittest * update dynamic_decode argument doc * add warning info for state which has no lengths attribute
-
- 05 1月, 2021 8 次提交
-
-
由 WeiXin 提交于
* Support storage of large parameters * Reduce the complexity of the unittest * Reduce the complexity of the unittest,commented out unittest for * add unittest for static.save/load * Increase the timeout threshold of 'test_static_save_load' * Increase the timeout threshold of 'test_static_save_load' * Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load' * Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'
-
由 chentianyu03 提交于
-
由 WangXi 提交于
-
由 wanghuancoder 提交于
-
由 Thunderbrook 提交于
* add topo aware * resource.h * topo aware * format
-
由 gongweibao 提交于
-
由 cc 提交于
* Avoid the scale to be infinity in quant2_int8_mkldnn_pass, test=develop * support quantized model for paddle2.0 dygraph, test=develop
-
由 Chen Weihang 提交于
* set flags_selectedd_gpus for spawn * add cond for unittest * Delete test_no_single_process_using_multi_gpus_in_spawn.py * Update spawn.py * Update nccl_context.cc
-
- 04 1月, 2021 1 次提交
-
-
由 WangXi 提交于
-