- 03 8月, 2021 3 次提交
-
-
由 WangXi 提交于
-
由 QingshuChen 提交于
* support Kunlun2 * support KL2 * support KL2
-
由 wanghuancoder 提交于
-
- 09 7月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 02 7月, 2021 1 次提交
-
-
由 houj04 提交于
-
- 30 6月, 2021 1 次提交
-
-
由 houj04 提交于
* support set_device for NPU. * minor update doc and add more unit test.
-
- 29 6月, 2021 1 次提交
-
-
由 taixiurong 提交于
-
- 24 6月, 2021 1 次提交
-
-
由 houj04 提交于
* in NPU environment, use CPUPlace for missing operators. * in NPU environment, use CPUPlace for missing operators. * fix TensorCopy bug and add unit test. * fix code style. * add more unit tests.
-
- 23 6月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* optimize attr default value, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * fix bug in AttrReader, test=develop * fix bug, test=develop * fix double_grad, test=develop * refine, test=develop * refine, test=develop * fix checker null, test=develop * for test, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 21 6月, 2021 1 次提交
-
-
由 cc 提交于
* Combine amp and qat * add unit test
-
- 10 6月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add check nan of inf for dygraph * add unittest for dygraph * revert error change
-
- 08 6月, 2021 1 次提交
-
-
由 WeiXin 提交于
* replace 'InnerSetOverridedStopGradient' with 'SetOverridedStopGradient'. * improve coverage. * polish error message.
-
- 26 5月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* modify matmul Op to complex template types * remove complex64/128 head file
-
- 12 5月, 2021 1 次提交
-
-
由 liym27 提交于
-
- 11 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix find_unused_parameters default value
-
- 10 5月, 2021 1 次提交
-
-
由 Roc 提交于
-
- 01 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 30 4月, 2021 2 次提交
-
-
由 WeiXin 提交于
-
由 pangyoki 提交于
* add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs * add softmax_with_cross_entropy_ Inplace API * add clip_ scale_ add_ subtract_ Inplace APIs * add wlist * fix parameter of scale api * add add_n_ Inplace API and remove log_ Inplace API * fix elementwise_add_ and elementwise_sub_ broadcast problem * elementwise inplace api give error message before run the op * use broadcast_shape in elementwise inplace op * add 8 inplace apis that is auto generated * add unittest for all inplace apis * add decorator for inplace apis in static mode * fix windows blas fail of exp inplace api, change array_equal to allclose * add flatten inplace api * add flatten unittest * fix flatten unittest * add decorator * fix grad.numpy in test_pylayer_op * unsupport softmax_with_cross_entropy_ * add test_inplace_softmax_with_cross_entropy to static_mode_white_list * delete __all__ in inplace_utils * delete activation inplace function and add Tensor.inplace_func * change paddle.inplace_ to Tensor.inplace_ * fix little problem * add paddle in inplace_utils
-
- 29 4月, 2021 2 次提交
- 28 4月, 2021 1 次提交
-
-
由 zhulei 提交于
-
- 27 4月, 2021 1 次提交
-
-
由 WeiXin 提交于
* clear 'BasicEngine' when an exception occurs in the backward. * deal with conflict. * deal with conflict.
-
- 26 4月, 2021 2 次提交
- 25 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 21 4月, 2021 1 次提交
-
-
由 石晓伟 提交于
* flush denormal in the tracer op, test=develop * add cmake dependencies, test=develop * add a macro, test=develop * fix the windows case, test=develop
-
- 15 4月, 2021 1 次提交
-
-
由 WeiXin 提交于
* custom python backward * polish up the code * polish up the code * polish up the code. * Fix code format and comments. * Delete redundant files. * add unnittest. * edit unnittest. * edit unnittest. * Remove redundant header files. * Improve coverage and remove redundant code. * support saving for backward. * polish code according to comments. * Add support type for PyLayer. * Modify the DOC. * polish Doc. * polish Doc. * polish Doc. * polish Doc. * polish Doc. * polish Doc. * polish code and make the code robust. * Modify the code format.
-
- 14 4月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register backward hook method * add leaf grad accumullated test
-
- 13 4月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add layer.to api * add layer.to api * add layer.to api * add the doc for Layer.to * add input type checking * modify assert and import bug * format code style * format code style * make place support str type * add SetGradVarBase method to set the gradient after conversion * modify argument palce to device * modify argument palce to device * modify doc of layers.to API * add xpuplace to device argument
-
- 09 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* [feature] support npu allocator (#30840) [feature] support npu allocator * [feature] support npu operator (#30951) [feature] support npu operator * [feature] support npu allocator, part 2 (#30972) * support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute * support parsing ascend rank table file (#31000) support parsing ascend rank table file * Fix reshape on GE graph. (#31084) Fix reshape on GE graph * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973) * add npu sub op * fix typo * rename test * fix bug * fix bug * add fp16 kernel * fix typo * support sub grad op * support elementwise_sub_grad op Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> * Fix compilation problem (#31100) Fix compilation problem (#31100) * fix compile * fix code stype * remove const_cast * support adding correct npu op in pybind.h (#31143) * support adding correct npu op in pybind.h * refine code * [NPU] Support executor with NPU (#31057) * [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu * refactor npu device manager (#31154) refactor npu device manager (#31154) * fix selected npus * fix compile * fix reading flags from env * format Co-authored-by: Nxiayanming <41795079@qq.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com> Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
-
- 01 4月, 2021 4 次提交
-
-
由 ShenLiang 提交于
* support control flow * supoort sync_parameters_buffers * fix the bug of sparse embedding
-
由 chentianyu03 提交于
* add custom init grad for backward function * add custom init grad for backward function * handle when the grad_tensor is none * handle when the grad_tensor is none * fix the args type error on windows platform * modify the args order and doc * format code * add grad_tensor to xpu * modify the grad_tensor type check * add paddle.backward api to support multi tensors gradient compute * add paddle.backward api to support multi tensors gradient compute * add paddle.atuograd module and backward api * change tensor.backward func args * modify tensor backward api * remove create_graph intputs args * add doc and examplex code for backward api * when have the same tensor, throw error * modify test Init func args * modify the execute.Init func args in test files * add paddle.autograd package in setup.py.in * modify error msg, remove _run_backward method in class Tensor * add test cases for backward api
-
由 kuizhiqing 提交于
* new group * ci compatible fix * assert nccl
-
由 Chen Weihang 提交于
* refactor and simplify hook design * fix reducer add hook error * add Tensor.register_hook basic impl * refine prepare data impl * revert prepare data change * support register_hook for Tensor * add hook test in model * polish tests and doc example * fix double grad test failed * remove reduce hook func * fix set empty error * polish code by comments * change reduce_hook to mutable_hook * remove useless tmp_ins * fix shape code format error * fix shape code format error
-
- 26 3月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
* delete include framework.pb.h * fix error
-
- 15 3月, 2021 1 次提交
-
-
由 Kaipeng Deng 提交于
* add dict/str/list supprot for DataLoader. test=develop
-
- 12 3月, 2021 1 次提交
-
-
由 whs 提交于
-
- 09 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 liuyuhui 提交于
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
-