- 30 7月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
-
由 jakpiase 提交于
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * added squeeze and squeeze2 kernels * CI fix * CI fix * CI fix * disabled tests when compiled with cuda * added setting format_tag by strides * added sigmoid BF16 FWD/BWD and gelu BF16 BWD * changes after review * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD" This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae. * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op" This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae. * minor change * added reshape1/2 kernels * moved some functions into private block * CI fix * CI fix * CI fix
-
- 29 7月, 2021 2 次提交
-
-
由 Zeng Jinle 提交于
* add fix op run order pass * add ut for fix_op_run_order * fix ci error * improve coverage * improve coverge again and fix cpu test case * follow some comments
-
由 Huihuang Zheng 提交于
As the title
-
- 28 7月, 2021 2 次提交
-
-
由 jiangcheng 提交于
See https://github.com/PaddlePaddle/Paddle/pull/33949 for details
-
由 Wangzheee 提交于
-
- 22 7月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 21 7月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 20 7月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 16 7月, 2021 1 次提交
-
-
由 levi131 提交于
As the title, this PR converts all blocks in program into SSA sub graphs and it is guarded by flag
-
- 15 7月, 2021 2 次提交
-
-
由 王明冬 提交于
[pass enhance] make the attribute check only object to which defined in op proto. test=develop (#34146)
-
由 Aurelius84 提交于
* Refine Constructor logic of ParallelExecutor * Replace executor into ParallelExecutor in run_program_op
-
- 13 7月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 09 7月, 2021 1 次提交
-
-
由 feng_shuai 提交于
-
- 08 7月, 2021 2 次提交
- 07 7月, 2021 2 次提交
- 06 7月, 2021 2 次提交
- 05 7月, 2021 1 次提交
-
-
由 Wangzheee 提交于
-
- 02 7月, 2021 1 次提交
-
-
由 Wangzheee 提交于
-
- 01 7月, 2021 5 次提交
-
-
由 Wangzheee 提交于
-
由 王明冬 提交于
-
由 wenbin 提交于
-
由 feng_shuai 提交于
-
由 feng_shuai 提交于
-
- 30 6月, 2021 4 次提交
-
-
由 jakpiase 提交于
* added matmul_v2 bf16/fp32 FWD kernel added matmul_v2 bf16/fp32 FWD kernel * added formatting * removed some tests due to timeout in CI * refactored tests * merged tests classes into one file * minor change * removed test guard for CUDA * remove skipIf * changes after review * formated one file * minor change * added skipping UT in CUDA place
-
由 wenbin 提交于
-
由 Wangzheee 提交于
-
由 Wangzheee 提交于
-
- 29 6月, 2021 5 次提交
-
-
由 feng_shuai 提交于
-
由 feng_shuai 提交于
-
由 feng_shuai 提交于
* scale_matmul_fuse_pass_init * enhance scale_matmul_fuse_pass * change scale_matmul_fuse_pass unittest
-
由 王明冬 提交于
test=develop.
-
由 Wangzheee 提交于
-
- 28 6月, 2021 2 次提交
-
-
由 Double_V 提交于
* add transpose transpose opdef, test=develop * add line, test=develop * fix wrong name, test=develop * add pass, test=develop * fix bug, test=develop * fix bug, test=develop * delete limite about alpha, test=develop * add mul to reshape2MatmulFusePass, test=develop * add limit about alpha, test=develop * fix bug,test=develop * set adaptive as false and global_pooling as True, test=develop * set x_num_col_dims as 1, test=develop * fix reshape, add attr limit, test=develop * fix conflict,test=develop * fix comment, test=develop * fix comment,test=develop * fix comment,test=develop * ,test=develop * add IsType, test=develop * add IsType, test=develop
-
由 王明冬 提交于
-
- 25 6月, 2021 2 次提交
-
-
由 MissPenguin 提交于
-
由 Wangzheee 提交于
-