- 31 8月, 2021 2 次提交
-
-
由 Yuang Liu 提交于
[cherry-pick][hybrid performance] Grad fuse for gradient merge under pipeline mode (#35004) (#35299)
-
由 Yuang Liu 提交于
[cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965) (#35296) Co-authored-by: NWangXi <wangxi16@baidu.com>
-
- 17 8月, 2021 2 次提交
-
-
由 chentianyu03 提交于
* copy boost optional.hpp to paddle * copy boost optional.hpp to paddle * move directions * del fluid/utils * modify .hpp to .h * move directions * modify to paddle::optional * add modification description * format code stype for the files in paddle/utils * format code stype
-
由 Zeng Jinle 提交于
* add inplace passes and tests * update * fix use_cuda undefined fix compile error of op compat * add more ut * fix CPU CI error * check adam unique * fix mac/windows ci, improve coverage * fix ci error * follow weihang's comment * fix BlockDesc::MoveFrom * follow qiuliang's comment * update * follow huihuang's comments
-
- 16 8月, 2021 2 次提交
-
-
由 Fan Zhang 提交于
-
由 joanna.wozna.intel 提交于
* Remove force_fp32_output from elementwise_add quantization * Fix cpu_quantize_placement test * Review related changes
-
- 13 8月, 2021 2 次提交
-
-
由 zyfncg 提交于
* Fix a bug : can't load more than one custom op module * Fix a bug : can't load more than one custom op module * add test for load multiple modules of custom c++ op * add config for Coverage CI
-
由 Zeng Jinle 提交于
-
- 11 8月, 2021 4 次提交
-
-
由 Wangzheee 提交于
* fix_fc_reshape_convert * fix
-
由 Hao Lin 提交于
* Add ext_tensor.slice() API, test=develop * Call Tensor::mutable_data first to fix bugs and add test for writing to sliced tensor * Fix unit test bug * Fix code format problem, test=develop * Fix code format problem * Fix code format problem * strengthen unit test * Use CustomTensorUtils::ShareDataFrom to simplify codes
-
由 lilong12 提交于
* add auto_parallel apis
-
由 hong 提交于
* add not used output var to gc_check_list; test=develop * add useless output to gc check list; test=develop
-
- 10 8月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add any.hpp to utils and replace boost::any with self defined paddle::any * add copy any.hpp to custom op depends * modify any.hpp include path * remove boost from setup.py.in * add copy any.hpp to custom op depends * move any.hpp to paddle/utils/ dirs * move any.h to extension/include direction * copy utils to right directions
-
- 06 8月, 2021 3 次提交
-
-
由 houj04 提交于
-
由 QingshuChen 提交于
* support kunlun black list and add kl1 op * xpu_op_list add device_context dependence
-
由 Qi Li 提交于
-
- 05 8月, 2021 3 次提交
-
-
由 hong 提交于
* first test version * add test exec; * add data transfer; test=develop * add new exec head; * add memcpy; test=develop * add python fetch * add new test * add graph node; test=develop * remove useless new executor test; test=develop * remove gperf dependency; test=develop * fix compile bugs; test=develop * remove useless code; test=develop * remove useless code; test=develop * add uni test; test=develop * polish code; test=develop * polish code; test=develop * add interpreter cmakefile; test=develop * remove useless code; test=develop
-
由 chentianyu03 提交于
* replace boost::algorithm::ends_with with self define ends_with function * remove BOOST macro in certain operators * remove boost::lexical_cast * add test for string_helper * add more test case for string_helper * modify join_string func and test case * fix build_strategy_test failed bug * remove string_helper_test from parallel_UT_rule.py
-
由 王明冬 提交于
-
- 04 8月, 2021 2 次提交
- 03 8月, 2021 2 次提交
-
-
由 QingshuChen 提交于
* support Kunlun2 * support KL2 * support KL2
-
由 zhouweiwei2014 提交于
-
- 02 8月, 2021 2 次提交
-
-
由 Zeng Jinle 提交于
* add basic APIs * add attr_types * follow comments * change pass attr types * add set pass attribute codes * refine PADDLE_THROW
-
由 Huihuang Zheng 提交于
The comment background message is too long, see details at https://github.com/PaddlePaddle/Paddle/pull/34521
-
- 30 7月, 2021 3 次提交
-
-
由 Huihuang Zheng 提交于
-
由 jakpiase 提交于
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * added squeeze and squeeze2 kernels * CI fix * CI fix * CI fix * disabled tests when compiled with cuda * added setting format_tag by strides * added sigmoid BF16 FWD/BWD and gelu BF16 BWD * changes after review * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD" This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae. * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op" This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae. * minor change * added reshape1/2 kernels * moved some functions into private block * CI fix * CI fix * CI fix
-
由 wangguanqun 提交于
* add trainer desc config to distributed strategy * code style modified
-
- 29 7月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add fix op run order pass * add ut for fix_op_run_order * fix ci error * improve coverage * improve coverge again and fix cpu test case * follow some comments
-
由 gongweibao 提交于
-
由 Yuang Liu 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 Leo Chen 提交于
-
- 28 7月, 2021 4 次提交
-
-
由 jiangcheng 提交于
See https://github.com/PaddlePaddle/Paddle/pull/33949 for details
-
由 jiangcheng 提交于
This PR added optional boolean is_parameter and stop_gradient in the VarDesc proto, and remove them during save_inference_model
-
由 Wangzheee 提交于
-
由 jiangcheng 提交于
When Graph has sub-graph, apply pass to it and all sub-graph. And add single test script .
-
- 27 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348)" (#34384) This reverts commit 577fdde5.
-
- 26 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* psgpu:edit cuda remote_streams; test=develop
-
- 23 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348) This reverts commit 609f8225.
-