- 02 8月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
The comment background message is too long, see details at https://github.com/PaddlePaddle/Paddle/pull/34521
-
- 30 7月, 2021 3 次提交
-
-
由 Huihuang Zheng 提交于
-
由 jakpiase 提交于
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * added squeeze and squeeze2 kernels * CI fix * CI fix * CI fix * disabled tests when compiled with cuda * added setting format_tag by strides * added sigmoid BF16 FWD/BWD and gelu BF16 BWD * changes after review * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD" This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae. * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op" This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae. * minor change * added reshape1/2 kernels * moved some functions into private block * CI fix * CI fix * CI fix
-
由 wangguanqun 提交于
* add trainer desc config to distributed strategy * code style modified
-
- 29 7月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add fix op run order pass * add ut for fix_op_run_order * fix ci error * improve coverage * improve coverge again and fix cpu test case * follow some comments
-
由 gongweibao 提交于
-
由 Yuang Liu 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 Leo Chen 提交于
-
- 28 7月, 2021 4 次提交
-
-
由 jiangcheng 提交于
See https://github.com/PaddlePaddle/Paddle/pull/33949 for details
-
由 jiangcheng 提交于
This PR added optional boolean is_parameter and stop_gradient in the VarDesc proto, and remove them during save_inference_model
-
由 Wangzheee 提交于
-
由 jiangcheng 提交于
When Graph has sub-graph, apply pass to it and all sub-graph. And add single test script .
-
- 27 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348)" (#34384) This reverts commit 577fdde5.
-
- 26 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* psgpu:edit cuda remote_streams; test=develop
-
- 23 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348) This reverts commit 609f8225.
-
- 22 7月, 2021 2 次提交
-
-
由 Aurelius84 提交于
* modify into program_id * fix cache_info declare problem * fix python int to C long problem * modify point to reference * add ENVS
-
由 王明冬 提交于
-
- 21 7月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 20 7月, 2021 3 次提交
-
-
由 Pei Yang 提交于
-
由 Huihuang Zheng 提交于
Add boost as dependency to fix random compilation failure. This is due to program_processing.cc used boost but didn't write boost into DEPS in the CMakeLists.txt
-
由 WangXi 提交于
-
- 19 7月, 2021 2 次提交
-
-
由 李季 提交于
-
由 yaoxuefeng 提交于
-
- 16 7月, 2021 3 次提交
-
-
由 levi131 提交于
As the title, this PR converts all blocks in program into SSA sub graphs and it is guarded by flag
-
由 Aurelius84 提交于
* Add NoNeedBufferVarsInferer * fix code style
-
由 Fan Zhang 提交于
-
- 15 7月, 2021 5 次提交
-
-
由 danleifeng 提交于
-
由 huangxu96 提交于
This PR creates a class to process the program at the C++ level. Currently, this class has one class method: GetInputsOutputsInBlock()
-
由 Zhanlue Yang 提交于
* Add DCU backend support for custom ops * Added checks for DeviceCopy and renamed some macros
-
由 王明冬 提交于
[pass enhance] make the attribute check only object to which defined in op proto. test=develop (#34146)
-
由 Aurelius84 提交于
* Refine Constructor logic of ParallelExecutor * Replace executor into ParallelExecutor in run_program_op
-
- 14 7月, 2021 2 次提交
-
-
由 Leo Chen 提交于
* adam add input SkipUpdate * add unittest * add npu unittest * fix xpu compile * remove param stream
-
由 zhouweiwei2014 提交于
* Support sccache to speed up compilation on Windows * Support sccache to speed up compilation on Windows
-
- 13 7月, 2021 3 次提交
-
-
由 王明冬 提交于
-
由 Zeng Jinle 提交于
-
由 jakpiase 提交于
* added printing tensor's format * added suggested changes
-
- 09 7月, 2021 2 次提交
-
-
由 Yuang Liu 提交于
-
由 feng_shuai 提交于
-
- 08 7月, 2021 1 次提交
-
-
由 dyning 提交于
-