- 19 1月, 2021 7 次提交
-
-
由 ykkk2333 提交于
* add rmsprop_op_xpu test=kunlun * modified rmsprop_op_xpu error code. test=kunlun
-
由 Zhou Wei 提交于
-
由 Zhen Wang 提交于
-
由 Leo Chen 提交于
* unify calling cudaSetDevice * fix compile
-
由 pangyoki 提交于
-
由 Leo Chen 提交于
* support layer_norm fp16 in dygraph amp * add ut * refine code
-
由 wanghuancoder 提交于
-
- 18 1月, 2021 7 次提交
-
-
由 hutuxian 提交于
-
由 hutuxian 提交于
-
由 liuyuhui 提交于
-
由 QingshuChen 提交于
-
由 wanghuancoder 提交于
-
由 taixiurong 提交于
-
由 JZ-LIANG 提交于
-
- 17 1月, 2021 1 次提交
-
-
由 guofei 提交于
* Modify the calculation logic of LambOptimizer
-
- 16 1月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
* Move pass tester helper functions to single common place. * Use helper functions in two more fuse pass tests.
-
- 15 1月, 2021 6 次提交
-
-
由 Zhang Ting 提交于
-
由 pangyoki 提交于
* add view strategy on squeeze,unsqueeze,reshape,flatten * add squeeze unittest * add unittests * use View strategy as name rather than Reuse Allacation * fix view api doc * fix format * use core.ops when input of reshape2 is Tensor * fix test_cross_entropy_loss error because of reshape2 * fix test_cross_entropy_loss error because of reshape2 * add inplace strategy * add elementwise_add sub * let backward op not use inplace * grad op do not use inplace * fix memory increase error and add leaf error message * delete selected_rows * change op_function * little change * solve HandleViewBetweenInputAndOutput * add unittest and leaf error message * merge view error * optimize op_function_generator format and support sum inplace op * fix format of basic_engine * fix format for framework * little change of variable wrapper * add reshape, squeeze, unsqueeze, scatter api * add relu elu tanh softmax inplace api * fix test_squeeze_op unittest * fix test_relu_op unittest * fix comment problems * delete sample code of inplace api * add reference of grad_pending_nodes in basic_engine * fix unittest name * add inplace apis into wlist * fix error message * add PADDLE_ENFORCE for set grad op twice * fix head file error
-
由 Yang Zhang 提交于
built-in `rsqrt` is shadowed
-
由 石晓伟 提交于
-
由 Wojciech Uss 提交于
-
由 wawltor 提交于
* fix the rnn mask memory bug for out of read * update the code for the rnn
-
- 14 1月, 2021 5 次提交
-
-
由 taixiurong 提交于
-
由 ShenLiang 提交于
-
由 yaoxuefeng 提交于
-
由 123malin 提交于
* test=develop, add distributed_infer
-
由 QingshuChen 提交于
-
- 13 1月, 2021 8 次提交
-
-
由 cc 提交于
* skip quantizing ops in cpu inference, test=develop
-
由 alncat 提交于
* added support for inference using qunatization aware trained dygraph * added support for inference using qunatization aware trained dygraph correct boost get usage * Delete incorrect warning message (#30196) * fix warning and no grad * clean redundant API alias in 2.0 - part 2 (#30013) * delete paddle.nn.functional.assign * fix dynamic to static error * just add the op error message for the matmul xpu (#30246) add the op error message for the matmul xpu * Add Static Variable Clone (#30208) Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat * use wget to replace curl to download the lcov file (#30229) * use wget to replace curl to download the lcov file * add cache for lcov * fix test_pool3d_op timeout issue (#30248) * Fix unittests bugs. (#30250) * modify error message based on comments (#30189) * modify error message based on comments * edit code according to review. * Correct spelling according to review. * Fix bug for 'save mutiple method' (#30218) * Fix bug for 'save mutiple method' * To pass coverage. * edit code to pass coverage. * edit code to pass coverage. * add unittest for coverage. * change for coverage. * edit for coverage. * added support for inference using qunatization aware trained dygraph * Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206) * add alias from fluid.layers.auc to static.auc * Update __init__.py * added support for inference using qunatization aware trained dygraph correct boost get usage * corrected boost get usage * corrected naming issues and enforcing zero check * correct paddle enforce message * added more error checkings * corrected error report message and optimized code * corrected findvar usage * corrected paddle_enforce in scope * correct error messages * correct error reporting format Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NBai Yifan <me@ethanbai.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NWeiXin <weixin10@baidu.com> Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
-
由 GaoWei8 提交于
* softmax backward optimize
-
由 Zhang Jun 提交于
* fix bug on compiling inference shared lib with crypto;test=develop * fix cmake bug when build inference lib using -DWITH_CRYPTO=OFF * update cmake * remove unnecessary enforce message
-
由 Huihuang Zheng 提交于
usleep function in <unistd.h> only takes argument less than 1,000,000. Current call can exceed this limit, we have to fix it. This PR can fix random CI error.
-
由 Leo Chen 提交于
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338) * set expected place in child thread for dataloader * set device id when set tensor from numpy * revert tensor_py change * add compile guard * fix ci * fix bug
-
由 QingshuChen 提交于
* optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor
-
由 ShenLiang 提交于
-
- 12 1月, 2021 5 次提交
-
-
由 JZ-LIANG 提交于
-
由 lidanqing 提交于
-
由 Chen Weihang 提交于
-
由 tangwei12 提交于
* add sparse embedding & load vars for 2.0 Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b * fix hdfs gloo Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6 * fix gloo hdfs Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e * move loadvar/sparse embedding from incubute to static Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0
-
由 tangwei12 提交于
* rename sendrecv.proto to namespace paddle.distributed * split ps with distributed
-