- 18 5月, 2023 8 次提交
-
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
由 co63oc 提交于
-
由 tianshuo78520a 提交于
* fix * fix
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
* simplify layer_norm_op.cc * support auto generate for op layer_norm * update unittest for composite_layer_norm * remove layer_norm_op.cc from scripts * replace layer_norm_op with generated_op * add get_expected_kernel for layer_norm * update cmake kernel register function for layer_norm_mkldnn_op
-
由 shaojie_wang 提交于
* add master gradients on static graph * add unit test for bf16 master grad static graph * use float16 as v100 test dtype * only skip GPU which do not support bf16 * use linear layer to test master grad * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
-
- 17 5月, 2023 2 次提交
-
-
由 zhangbo9674 提交于
* add program parameter dialect_interface * fix op create bug * add ir parameter convert pd variable methods * refine code * fix bug * refine by ut * refine ut * delete unused code * refine code * refine code by comment * reset WITH_NEW_IR * refine op attribute map * refine program and op create * refine program and op create
-
由 6clc 提交于
-
- 16 5月, 2023 6 次提交
-
-
由 xiaoguoguo626807 提交于
* add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify * modify gradOpMaker * modify gradOpMaker * modify genarated-j2 * add approve rules * modify aytograd_functional_static_test
-
由 sprouteer 提交于
-
由 niuliling123 提交于
-
由 Wang Xin 提交于
* static graph autogen code support for softmax op * bug fixed * fix PR-CI-Windows error * fix CI error * bug fixed * fix conflicts
-
由 meteor135 提交于
* remove _in_eager_mode_ * remove _in_eager_mode_
-
由 Yiqun Liu 提交于
* Allow to switch whether to use promote strategy to choose kernel for O2 training. * Fix comparing error and add unittest.
-
- 15 5月, 2023 6 次提交
-
-
由 shaojie_wang 提交于
* fix embedding model weight type mismatch error * Update fp16_utils.py --------- Co-authored-by: NZhang Ting <zhangting_2017@163.com>
-
由 bukejiyu 提交于
[inference Zero-Dim][trt] Add Zero-Dim tensor support for clip, cast, flatten_contiguous_range (#53769) * [inference Zero-Dim][trt]clip,cast,flatten_contiguous_range trt op converter support zero dim
-
由 xiaoguoguo626807 提交于
* add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
-
由 LoneRanger 提交于
* relocate regularizer.py * fix bug * fix bug * fix bug * relocate the import * replace _regularization_coeff with coeff * remove the L1DecayRegularizer and L2DecayRegularizer
-
- 13 5月, 2023 1 次提交
-
-
由 xiaoguoguo626807 提交于
* modify concat_grad add sum comp rule * delete default mul_double_grad * delete high grad test * recover yaml * modify yaml * recover add_double_grad prim
-
- 12 5月, 2023 7 次提交
-
-
由 6clc 提交于
* test(prim-cinn): split test_resnet and test_bert into three tests * test(prim-cinn): fix cmake file to run prim test in CINN-CI
-
由 houj04 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
* softmax support * support stack
-
由 Wang Xin 提交于
* sequence_mask functionalization * fix sequence_mask test
-
由 Aurelius84 提交于
This reverts commit 45ce0ad5.
-
由 Xiaoxu Chen 提交于
* [Dy2St]Fix x grad names when high order gradient * Polish error msg * Add inputs var to backward in dy2st * Fix error * Get grad names for backward API * Fix save load * Polish code * Add ut * [prim] fix not support optional grad bugs in higher order autodiff * [prim] remove duplicate fill_any_like caused by infershape_for_composite * fix _strip_grad_suffix_ bugs in higher-order autodiff * [prim] create output for test_static_prim.cc --------- Co-authored-by: N0x45f <wangzhen45@baidu.com>
-
- 11 5月, 2023 10 次提交
-
-
由 xiaoxiaohehe001 提交于
* support_act * delete_silu
-
由 Yuanle Liu 提交于
* prelu op trt converter support zero dim
-
由 Zhang Jun 提交于
-
由 Zhang Jun 提交于
-
由 Kaipeng Deng 提交于
* move DataLoader to paddle.io. test=develop
-
由 lijin23 提交于
* add pad op for xpu * add pad op for xpu * add pad op for xpu
-
由 xiaoguoguo626807 提交于
* modify concat_grad add sum comp rule * delete default mul_double_grad * delete high grad test * recover yaml * modify yaml
-
由 zhoutianzi666 提交于
-
由 gouzil 提交于
* [test]mv fluid controlflow detection dlnne tensorrt tests to tests * [test]clean dlnne * [test] fix test_tensorrt_engine_op * [test] try fix path error * [test] RollBACK test_tensorrt_engine_op * [test] RollBACK test_tensorrt_engine_op * [test]add todo * Empty-Commit; test=document_fix
-
由 xiaoxiaohehe001 提交于
* support_expand_mish
-