- 26 5月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* [IR&PASS] part 1: add pass base, pass manager, adaptor pass, ut * include cstdint
-
- 25 5月, 2023 3 次提交
-
-
由 zhangbo9674 提交于
* refine code * delete some unused code * refine code of build * refine code of build * add block * refine builder * refine code * refine code by comment * fix compiler bug
-
由 zhouweiwei2014 提交于
-
由 zhoutianzi666 提交于
* add tranfer_elim * transfer layout elimination
-
- 24 5月, 2023 4 次提交
-
-
由 zhangyuqin1998 提交于
-
由 wz1qqx 提交于
-
由 kangguangli 提交于
* add vector type support for program translator * polish * resolve conflicts * add verify for combine/slice and unittests * polish
-
由 lijin23 提交于
[XPU][PHI Kernels] bind bitwise_add kernel & add int32/int64 support to scatter_nd_add kernel for xpu (#54066) * bind new kernels to xpu * refine code * fix bugs in unittest
-
- 23 5月, 2023 6 次提交
-
-
由 Leo Chen 提交于
-
由 RuohengMa 提交于
-
由 zhangbo9674 提交于
* Use copy_if_different to avoid recompilation of generated cutlass kernels. * add program parameter dialect_interface * fix op create bug * add conv2d * draft of paddle converter * fix CI * fix windows CI * fix program destructor * printer draft * fix bug * printer draft finish * fix windows CI * reserve inplace semantics * revert program::destroy since no need to do topology sort * revert * modify by reviews * commit printer and resnet50 related ops * fix * fix * fix op definition * refine op dyn_cast * fix bug * refine code * refine code * refine code * refine code * add code gen * refine code * refine code * refine code --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
-
由 niuliling123 提交于
-
由 zhangyikun02 提交于
-
由 kangguangli 提交于
* Use copy_if_different to avoid recompilation of generated cutlass kernels. * add program parameter dialect_interface * fix op create bug * add conv2d * draft of paddle converter * fix CI * fix windows CI * fix program destructor * printer draft * fix bug * printer draft finish * fix windows CI * reserve inplace semantics * revert program::destroy since no need to do topology sort * revert * modify by reviews * polish * fix op definition * fix CI * refresh file changes --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
- 22 5月, 2023 11 次提交
-
-
由 Meteor Liu 提交于
* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * fixed cyclic reference that caused patial import * fixed bad change * fix bad import * fix bad import * fix bad import * fix ut failed caused by change in_dynamic_mode * fix ut failed caused by change in_dynamic_mode * fixed usage of in_dynamic_mode() or in_dygraph_mode() * revert python3 to python in .pre-commit-config.yaml * fix merge conflicts
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 niuliling123 提交于
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments * speed up eval frame for cache hitted code. * code format * fix unittest --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhangbo9674 提交于
* refine op dyn_cast * fix bug * refine code * refine code * refine code * refine code
-
由 cyber-pioneer 提交于
* recompute bn grad * fix test case --------- Co-authored-by: Nsunli <466530738@qq.com>
-
由 lijin23 提交于
* fix empty bugs for xpu * fix empty bugs for xpu
-
由 zhupengyang 提交于
-
由 Yuanle Liu 提交于
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
-
由 zhangyikun02 提交于
-
由 wangshengxiang 提交于
* bind xpu op: 3D grid sample * fix edge cases in xpu op: reshape & slice
-
- 20 5月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* add types and attributes * remove some const_cast * refine code
-
- 19 5月, 2023 4 次提交
-
-
由 shentanyue 提交于
-
由 Frank Lin 提交于
* Improve Readability and Overall Clarity of Logging * Adds the set_input_type API for specifying input data types * Specifying input data types
-
由 王明冬 提交于
-
由 GGBond8488 提交于
* remove user define grad * fix errors * remove unused self.x_grad, self.out_grad
-
- 18 5月, 2023 9 次提交
-
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
由 co63oc 提交于
-
由 tianshuo78520a 提交于
* fix * fix
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
* simplify layer_norm_op.cc * support auto generate for op layer_norm * update unittest for composite_layer_norm * remove layer_norm_op.cc from scripts * replace layer_norm_op with generated_op * add get_expected_kernel for layer_norm * update cmake kernel register function for layer_norm_mkldnn_op
-
由 shaojie_wang 提交于
* add master gradients on static graph * add unit test for bf16 master grad static graph * use float16 as v100 test dtype * only skip GPU which do not support bf16 * use linear layer to test master grad * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
- 17 5月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* add program parameter dialect_interface * fix op create bug * add ir parameter convert pd variable methods * refine code * fix bug * refine by ut * refine ut * delete unused code * refine code * refine code by comment * reset WITH_NEW_IR * refine op attribute map * refine program and op create * refine program and op create
-