- 08 12月, 2020 4 次提交
 - 
- 
由 liuyuhui 提交于
* add deformable_conv op on xpu (#29234) * rebase develop * update deformable_conv op on xpu * update deformable_conv op on xpu * update kunlun conv2d/softmax/elementwise implemetation (#29229) * update conv2d & softmax to new xpu api * test=kunlun * remove useless comments * test=kunlun * remote softmax xpu op * test=kunlun * update kunlun softmax * test=kunlun * update xpu unitest * test=kunlun * fix elementwise_grad bug for kunlun *test=kunlun * support global pooling for kunlun (#29293) * test=kunlun * update reduce_sum op on xpu (#29367) * update reduce_sum op on xpu * update reduce_sum op on xpu * support running on xpu * fix expand/uniform_random && concat/transpose to new api on xpu (#29280) * fix expand && concat/transpose to new api * update uniform_random_op * update xpu_header * 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448) Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com> Co-authored-by: N卖鱼的哲学 <tangzhiyi11@users.noreply.github.com> Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com> Co-authored-by: Ntaixiurong <taixiurong@126.com> Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
 - 
由 ShenLiang 提交于
* fix the bug of reducer in embedding
 - 
由 lilong12 提交于
* update, test=develop (#29331)
 - 
由 Zhang Ting 提交于
 
 - 
 - 07 12月, 2020 4 次提交
 - 
- 
由 Shang Zhizhou 提交于
* fix tensorrt unittest precision error * fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
 - 
由 Bai Yifan 提交于
* add deform_conv2d,DeformConv2D
 - 
由 chajchaj 提交于
 - 
由 chentianyu03 提交于
* rm complexvariable * modify test_var_base unittest * remove duplicated codes
 
 - 
 - 05 12月, 2020 2 次提交
 - 
- 
由 liym27 提交于
[cherri-pick] Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray (#29306) (#29368)
 - 
由 chentianyu03 提交于
* fix random failed of complex matmul * Make transpose, trace, kron, reshape, sum op support complex type (#29321) * add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest * kron, reshape, transpose support complex types * sum and trace op support complex types * add test case of sum and trace op * fix the bug of imag part of complex not initialized * format file * format code style * kron support type promotion; modify test cases
 
 - 
 - 04 12月, 2020 6 次提交
 - 
- 
由 Huihuang Zheng 提交于
Reduce exception type so that if covert_to_static failed, it reports right error message.
 - 
由 liym27 提交于
[cherry-pick 2.0rc1][inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267) (#29359)
 - 
由 liym27 提交于
[cherry-pick 2.0rc1][Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop (#29310) (#29361)
 - 
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
 - 
由 liym27 提交于
[Cheery-Pick 2.0.0-rc1][Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. (#29253) (#29340) Usage scenarios:A function could have run successfully in static mode, you can use it to decorate a function in the following cases: 1. An unknown error occurs in the dynamic-to-static conversion process of the function; 2. In the internal implementation of the function, it has two branches: dynamic branch and static branch; 3. Users don't want to convert the function in the process of dynamic to static.
 - 
由 Leo Chen 提交于
* use has_grad instead of train_mode * add vlog for debug * fix ut * fix ut
 
 - 
 - 03 12月, 2020 4 次提交
 - 
- 
由 LielinJiang 提交于
* move temporal_shift to functional
 - 
由 ShenLiang 提交于
* Change the api of DataParallel and Fleet (#29224)
 - 
由 Zhen Wang 提交于
* Add pure fp16 training with master weights. (#27712) * add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
 - 
由 Huihuang Zheng 提交于
Cherry-pick of PR #29226
 
 - 
 - 02 12月, 2020 2 次提交
 - 
- 
由 chentianyu03 提交于
 - 
由 Chen Weihang 提交于
* hot fix complle failed in gcc4.8 * fix failed unittest
 
 - 
 - 01 12月, 2020 4 次提交
 - 
- 
由 Jiawei Wang 提交于
* add lamb optimizer and unittest * fix momentum resume training * fix momentum acc
 - 
由 wanghuancoder 提交于
 - 
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest
 - 
由 Zhou Wei 提交于
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * fix coverage * fix api doc * fix CI unittest * fix CI unittest * fix unitest * empty tensor does’t need inner_var_ * fix some error message
 
 - 
 - 30 11月, 2020 11 次提交
 - 
- 
由 Chen Weihang 提交于
 - 
由 hong19860320 提交于
 - 
由 123malin 提交于
* fix paramete prefetch & device guard Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
 - 
由 liym27 提交于
* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable. * Add a new attribute `_inplace_version` for VarBase. * Raise exception if an inplace operation can result in incorrect gradient computation. * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation. * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode. * Use original var_wrapper if the inplace_version is not changed. * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
 - 
由 joejiong 提交于
As the title
 - 
由 WangXi 提交于
 - 
由 Chen Weihang 提交于
* fix failed tests in yingchun gived list * add unittests into static_mode_white_list * add enable static * fix dist unittest * skip test_sigmoid_focal_loss_op & add gym * revert no need skip unittests * remove gym
 - 
由 Thunderbrook 提交于
 - 
由 liym27 提交于
 - 
由 WeiXin 提交于
* Changed a variable name error * Add comments * Move member functions of TranslatedLayer out of function * edit code according to review * Edit input argument of '_run_static_graph' * reset due to Segmentation fault * rename variables when stitching graph * modify code according CI * Add comments to '__i_m_p_l__' * remove blanks befor 'Get...' * edit code according to review * Add a comment to '_execution_method_creator' * Edit a comment to '_execution_method_creator'
 - 
由 wanghuancoder 提交于
* Generate code coverage reports only for incremental files, test=develop * Generate code coverage reports only for incremental files, test=develop * Generate code coverage reports only for incremental files, test=develop * test for diff python file, test=develop * fix no python diff report, test=develop * add cc test file, test=develop * fix bug in generic.cmake, test=develop * for debug no cc report, test=develp * modify compire branch form test_pr to test, test=develop * fix bug, test=develop * test for h file changed, test=develop * debug for redefinition of argument optimize error, test=develop * close -o3 for test, test=develop * remove -o3 for test, test=develop * remove coverage option for nvcc, test=develop * use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop * reopen -o3, test=develop * remove debug code, test=develop * remove unused code, test=develop
 
 - 
 - 28 11月, 2020 3 次提交
 - 
- 
由 Huihuang Zheng 提交于
test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.
 - 
由 liym27 提交于
 - 
由 liym27 提交于
[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. (#29165)
 
 -