- 07 12月, 2020 2 次提交
- 05 12月, 2020 1 次提交
-
-
由 myq406450149 提交于
* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop * fix format. test=develop * format fix. test=develop * add lod_rank_table. test=develop * fix format. test=develop * fix doc info. test=develop * fix np error * add unbind dygraph api. test=develop * fix unbind doc.test=develop
-
- 04 12月, 2020 6 次提交
-
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest * kron, reshape, transpose support complex types * sum and trace op support complex types * add test case of sum and trace op * fix the bug of imag part of complex not initialized * format file * format code style * kron support type promotion; modify test cases
-
由 卖鱼的哲学 提交于
* fix expand && concat/transpose to new api * update uniform_random_op * update xpu_header
-
由 lilong12 提交于
-
由 QingshuChen 提交于
* test=kunlun
-
由 liym27 提交于
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)
-
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
-
- 03 12月, 2020 9 次提交
-
-
由 tangwei12 提交于
* fix gpu emb out of range Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf * fix doc Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
-
由 Leo Chen 提交于
* use has_grad instead of train_mode * add vlog for debug * fix ut * fix ut
-
由 Zhang Ting 提交于
* improve performance of elementwise_sum_grad
-
由 Shang Zhizhou 提交于
* fix tensorrt output shape error * fix unittest tensorrt_engine_op_test * fix code style for unitest
-
由 Aurelius84 提交于
-
由 ShenLiang 提交于
-
由 wangchaochaohu 提交于
-
由 ShenLiang 提交于
-
由 wanghuancoder 提交于
-
- 02 12月, 2020 8 次提交
-
-
由 Wilber 提交于
-
由 ShenLiang 提交于
-
由 Zhen Wang 提交于
-
由 Leo Chen 提交于
-
由 Zhen Wang 提交于
* add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT. * For CI Coverage Checking.
-
由 Wojciech Uss 提交于
-
由 furnace 提交于
* add fp16 for layer_norm op * revert layernorm api * fix forward * fix forward * fix backward for layernorm with fp16 * fix unit test for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U> * fix with_mkldnn compile error for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Shang Zhizhou 提交于
-
- 01 12月, 2020 10 次提交
-
-
由 Shang Zhizhou 提交于
* add compile option WITH_TENSORRT * add WITH_TENSORRT to ci paddle_buils.sh * add WITH_TENSORRT to paddle_build.sh * change FATAL to WARNING when TensorRT is not found and WITN_TENSORRT=ON, just to pass ci-py3 temporarily
-
由 Leo Chen 提交于
* pass stop_gradient for cast op * improve performance of elementwise_add grad * use tensor copy async * dygraph branch * fix dygraph branch * add ut
-
由 卖鱼的哲学 提交于
* rebase develop * update deformable_conv op on xpu * update deformable_conv op on xpu
-
由 Chen Weihang 提交于
* hot fix complle failed in gcc4.8 * fix failed unittest
-
由 GeminiCarrie 提交于
* Fix a bug when running on an operating system without "bash." * add execution condition * for ci-coverage
-
由 ShenLiang 提交于
-
由 QingshuChen 提交于
* update conv2d & softmax to new xpu api * test=kunlun * remove useless comments * test=kunlun * remote softmax xpu op * test=kunlun * update kunlun softmax * test=kunlun * update xpu unitest * test=kunlun * fix elementwise_grad bug for kunlun *test=kunlun
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest
-
由 Zhou Wei 提交于
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * fix coverage * fix api doc * fix CI unittest * fix CI unittest * fix unitest * empty tensor does’t need inner_var_ * fix some error message
-
由 Wilber 提交于
-
- 30 11月, 2020 4 次提交
-
-
由 Adam Osewski 提交于
- Make sure that oneDNN memory descriptors are created only once at first iteration.
-
由 joanna.wozna.intel 提交于
-
由 Wilber 提交于
-
由 123malin 提交于
* fix paramete prefetch & device guard Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
-