- 23 5月, 2023 2 次提交
-
-
由 zhangyikun02 提交于
-
由 kangguangli 提交于
* Use copy_if_different to avoid recompilation of generated cutlass kernels. * add program parameter dialect_interface * fix op create bug * add conv2d * draft of paddle converter * fix CI * fix windows CI * fix program destructor * printer draft * fix bug * printer draft finish * fix windows CI * reserve inplace semantics * revert program::destroy since no need to do topology sort * revert * modify by reviews * polish * fix op definition * fix CI * refresh file changes --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
- 22 5月, 2023 11 次提交
-
-
由 Meteor Liu 提交于
* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * fixed cyclic reference that caused patial import * fixed bad change * fix bad import * fix bad import * fix bad import * fix ut failed caused by change in_dynamic_mode * fix ut failed caused by change in_dynamic_mode * fixed usage of in_dynamic_mode() or in_dygraph_mode() * revert python3 to python in .pre-commit-config.yaml * fix merge conflicts
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 niuliling123 提交于
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments * speed up eval frame for cache hitted code. * code format * fix unittest --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 zhangbo9674 提交于
* refine op dyn_cast * fix bug * refine code * refine code * refine code * refine code
-
由 cyber-pioneer 提交于
* recompute bn grad * fix test case --------- Co-authored-by: Nsunli <466530738@qq.com>
-
由 lijin23 提交于
* fix empty bugs for xpu * fix empty bugs for xpu
-
由 zhupengyang 提交于
-
由 Yuanle Liu 提交于
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
-
由 zhangyikun02 提交于
-
由 wangshengxiang 提交于
* bind xpu op: 3D grid sample * fix edge cases in xpu op: reshape & slice
-
- 20 5月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* add types and attributes * remove some const_cast * refine code
-
- 19 5月, 2023 4 次提交
-
-
由 shentanyue 提交于
-
由 Frank Lin 提交于
* Improve Readability and Overall Clarity of Logging * Adds the set_input_type API for specifying input data types * Specifying input data types
-
由 王明冬 提交于
-
由 GGBond8488 提交于
* remove user define grad * fix errors * remove unused self.x_grad, self.out_grad
-
- 18 5月, 2023 9 次提交
-
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
由 co63oc 提交于
-
由 tianshuo78520a 提交于
* fix * fix
-
由 xiongkun 提交于
* [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
* simplify layer_norm_op.cc * support auto generate for op layer_norm * update unittest for composite_layer_norm * remove layer_norm_op.cc from scripts * replace layer_norm_op with generated_op * add get_expected_kernel for layer_norm * update cmake kernel register function for layer_norm_mkldnn_op
-
由 shaojie_wang 提交于
* add master gradients on static graph * add unit test for bf16 master grad static graph * use float16 as v100 test dtype * only skip GPU which do not support bf16 * use linear layer to test master grad * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
- 17 5月, 2023 2 次提交
-
-
由 zhangbo9674 提交于
* add program parameter dialect_interface * fix op create bug * add ir parameter convert pd variable methods * refine code * fix bug * refine by ut * refine ut * delete unused code * refine code * refine code by comment * reset WITH_NEW_IR * refine op attribute map * refine program and op create * refine program and op create
-
由 6clc 提交于
-
- 16 5月, 2023 6 次提交
-
-
由 xiaoguoguo626807 提交于
* add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify * modify gradOpMaker * modify gradOpMaker * modify genarated-j2 * add approve rules * modify aytograd_functional_static_test
-
由 sprouteer 提交于
-
由 niuliling123 提交于
-
由 Wang Xin 提交于
* static graph autogen code support for softmax op * bug fixed * fix PR-CI-Windows error * fix CI error * bug fixed * fix conflicts
-
由 meteor135 提交于
* remove _in_eager_mode_ * remove _in_eager_mode_
-
由 Yiqun Liu 提交于
* Allow to switch whether to use promote strategy to choose kernel for O2 training. * Fix comparing error and add unittest.
-
- 15 5月, 2023 5 次提交
-
-
由 shaojie_wang 提交于
* fix embedding model weight type mismatch error * Update fp16_utils.py --------- Co-authored-by: NZhang Ting <zhangting_2017@163.com>
-
由 bukejiyu 提交于
[inference Zero-Dim][trt] Add Zero-Dim tensor support for clip, cast, flatten_contiguous_range (#53769) * [inference Zero-Dim][trt]clip,cast,flatten_contiguous_range trt op converter support zero dim
-
由 xiaoguoguo626807 提交于
* add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
-