- 28 3月, 2023 3 次提交
-
-
由 ronnywang 提交于
-
由 Infinity_lee 提交于
-
由 Young-Flash 提交于
* add bitwise_not trt converter * run pre-commit * modify neg_one_tensor_dims init way * fix BOOL type support requires TensorRT 8.4 * fix int8 & uint8 type * improve data type readability * modify filter logic * fix coverage CI
-
- 27 3月, 2023 12 次提交
-
-
由 YuanRisheng 提交于
* perfect structure kernel registry * fix ci bugs
-
由 Aurelius84 提交于
-
由 zhouweiwei2014 提交于
-
由 HappyHeavyRain 提交于
* add fused_yaml fused_backward * fix eager_funciton bug * add some comment of fused yaml file * add 'support_dygraph_mode' configuration in fused yaml * delete some 'fused_api.h' in include file * add fused flag in api_gen
-
由 Xinyu Chen 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output
-
由 HappyHeavyRain 提交于
* support assign op * support assign infer_var_type * change code according to review * change code according to review * only save 'get_infer_var_type_func' * rest file mode
-
由 Leo Chen 提交于
-
由 wanghuancoder 提交于
This reverts commit 9c238d2b.
-
由 engineer1109 提交于
fix bug remove useless fix bug add pybind remove log fix style fix style change api
-
由 risemeup1 提交于
* fix_gcc12_error * fix gcc12 error * fix gcc12 error
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
- 25 3月, 2023 1 次提交
-
-
由 Infinity_lee 提交于
* codestyle up027 * add to pyproject.toml
-
- 24 3月, 2023 4 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
由 YuhangLi 提交于
* [CUSTOM DEVICE]analysis predictor custom device support * del debug log
-
由 Yuanle Liu 提交于
-
- 23 3月, 2023 15 次提交
-
-
由 HongyuJia 提交于
-
由 Wangzheee 提交于
-
由 xiaoguoguo626807 提交于
* delete prim flag for matmul_2_grad * delete prim flag for matmul_2_grad * add new setgradoutmeta for matmul_double_grad_node * modify test and delete log * deal with review
-
由 iSerendipity 提交于
-
由 ZhouMengLei1999 提交于
-
由 sneaxiy 提交于
* remove fluid deps in fused_linear_param_grad_add_kernel * fix compile error * fix ut error * follow comments
-
由 Huang Jiyi 提交于
* unify add_position_encoding * unify affine_channel * unify alloc_float_status * unify allreduce * unify alltoall * unify anchor_generator * unify ascend_trigger * fix bug * fix test
-
由 Huang Jiyi 提交于
* update * update * update * update * update * fix test
-
由 cxxly 提交于
-
由 zqw_1997 提交于
* to support cuda12, pybind need to upgrade to v2.10.0 * add DEPS of pybind in test_custom_plugin_creater.cc * only change the tag * please let CI pass * try pybind v2.10/3 * modify the include header in test * code check
-
由 Infinity_lee 提交于
-
由 Huang Zhengjie 提交于
* fix distributed graph engine
-
由 PuQing 提交于
* add nanmedian output defs * remove the multiclass_nms3 momentum
-
由 PuQing 提交于
[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928) * autofix * add select config * autofix C410 * add C410 select
-
由 Jiabin Yang 提交于
* allow return none when stop_gradient=True * remove useless code * refine code * refine code * fix test cast * change more test * add more tests * fix error msg in pylayer
-
- 22 3月, 2023 5 次提交
-
-
由 HappyHeavyRain 提交于
* test_get_kernel * add invoke signature * change reduce_max * change frobenius_norm * reset reduce_max according to composite and change reduce_all * fix the bug when Scalar(*) * fix 'scalar when support_tensor' * change code according to review * change 'keep_signature' to 'manual_signature' and add some erro info * support optimizers autogen * change sgd yaml * change generate signature * fix test/cpp/new_executor/CM * reset signature generated function * change signature funciton * change signature funciton
-
由 YangQun 提交于
* support 0-d tensor for element wise unary ops * fix python code style check * fix approval check * support 0-d tensor for onednn softmax and logsoftmax kernels * fix commnets * fix some unittests
-
由 joanna.wozna.intel 提交于
-
由 Ghost Screaming 提交于
* Add fused_feed_forward pass for semi-automatic static graph training. * Add fused_feedforward property in parallel_executor.cc * Polish code. * Polish fused feed_forward pass code. Support use_dropout1 and use_dropout2 option. * Support model parallel in fused_feedforward pass.
-
由 Sławomir Siwek 提交于
* extract common methods to reuse * add header for transpose ops * fused_transpose * Split big function * transpose2 tests * fused_transpose * Apply extra attributes * add pbtxt file * update pbtxt * Merge develop * add more strict op compats * code style * remove mkldnn_data_type * unify SetOutMemDescWithReshape2FuseSupport * adjust quantize-dequantize for transpose * remove appendact * transpose2 quantization * fix int8 tests * adjust transpose_op to current develop * delete fusion code from transpose_kernel * add fused transpose to NHWC unittest * change order
-