- 03 8月, 2023 1 次提交
-
-
由 WangZhen 提交于
-
- 01 8月, 2023 4 次提交
-
-
由 Sonder 提交于
* move prune_gate_by_capacity to phi * fix * fix registe info * remove useless codes
-
由 gouzil 提交于
-
由 hong 提交于
* new ir support print op * fix gpu bug * fix bug * update * remove layout to string * remove usless header * polish code * fix bug * posolis code
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
- 31 7月, 2023 3 次提交
-
-
由 Sonder 提交于
-
由 wanghuancoder 提交于
support stride
-
由 zhangyuqin1998 提交于
* rename BatchNormGradFunctor * Update batch_norm_grad_kernel.cc * Update batch_norm_grad_kernel.cu * Update batch_norm_grad_kernel.cc * fix * Update batch_norm_grad_kernel.cc
-
- 28 7月, 2023 1 次提交
-
-
由 hong 提交于
* new ir support save combine * update * polish code * update * new ir support fluid op * remove depulicate op * fix ir exe test compile error * fix compile bug * update * code format * update * update * polish code
-
- 27 7月, 2023 1 次提交
-
-
由 ming1753 提交于
* Paddle-TRT support float64 in/out type, support fill_any_like_op in int64
-
- 25 7月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
-
- 24 7月, 2023 2 次提交
-
-
由 chen 提交于
[Paddle-TRT] Convert 0D tensor to 1D tensor, increase the shape tensor's number count when collecting shape (#55503) * make 0-D tensor to 1-D tensor to support Grounding-SAM and add shape check * recover identity_op_clean_pass.cc
-
由 Xinyu Chen 提交于
* onednn: remove fc+eltwiseadd fusion pass * onednn: remove post-sum fusion in fc kernel * onednn: tests: make unfused add run into f32
-
- 21 7月, 2023 1 次提交
-
-
由 Ruibin Cheung 提交于
-
- 14 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 13 7月, 2023 2 次提交
-
-
由 zyfncg 提交于
* add check of input tensors in Yaml * fix bug of code-gen for opmaker * fix bug
-
由 RichardWooSJTU 提交于
* add matmul int8
-
- 12 7月, 2023 2 次提交
-
-
由 YangQun 提交于
* squash pick the poc code * fix build after rebase * fix int8 conv and fc uts * Fix and clean-up Get_SRC_Scale_Memory * fix floating point fc uts * fix test_analyzer_int8_googlenet * test_analyzer_int8_mobilenetv1 * fix int8 mobilenet v2 and v3 * fix build error after rebase * [oneDNN] rename library version * fix conv bias datatype * try to fix import error * fix rebase error * [oneDNN] pack library into python wheel * add MKLDNN_SHARED_LIB_3 to env_dict * fix test_analyzer_bert * fix fill_constant op kernel * fix ernie and matmul op ut * fix softplus ut * fix conv+relu6 fusion ut * fix hardswish fusion * fix quant+transpose fusion ut * fixsgd ut * fix int8 matmul with flatten * fix fc+scale fusion * fix conv/matmul+gelu fusion uts * fix rebase error * Revert "fix conv/matmul+gelu fusion uts" This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1. * upgrade to onednn v3.1 * remove older version onednn * use densetensor::data() for achieving mean and var in layernorm impl * comments for atol of integer tests * fix clang-format * Revert "remove older version onednn" This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09. * improve binary handle * fix expand kernel * Revert "use densetensor::data() for achieving mean and var in layernorm impl" * always use forward_inference for conv * remove activation scales * rollback changes to mkldnn.cmake * address comments * port changes to dequantize kernel * fix merge error * fix fused_elementwise_kernel * upgrade onednn version to v3.1.1 * fix some approval error * fix error msg format * remove old onednn libs * try to fix symbolic link issue * fix cinn test case segfault * do not explicit link test with onednn * remove unnecessary changes * integrate CINN with onednn v3 * link with mkldnn project * fix cinn build file --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com> Co-authored-by: Ntianshuo78520a <707759223@qq.com>
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 05 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
* configure elementwise_pow op_version * support auto generate for static op elementwise_pow * pre-commit run
-
- 04 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 03 7月, 2023 5 次提交
-
-
由 lzydev 提交于
* support auto-gen concat * fix bug in legacy_backward.yaml * fix bug in get_expeceted_kernel_type
-
由 RedContritio 提交于
* configure elementwise_mod op_version * support auto generate for static op elementwise_mod
-
由 RedContritio 提交于
* configure elementwise_floordiv op_version * support auto generate for static op elementwise_floordiv * update unity_build_rule.cmake
-
由 LoneRanger 提交于
* fix the static op generation for group_norm * fix bug * fix bug * Update op_compat.yaml
-
由 FormlessUnit 提交于
* add linear_compress API
-
- 30 6月, 2023 4 次提交
-
-
由 lzydev 提交于
-
由 lzydev 提交于
-
由 lzydev 提交于
-
由 Zhenghai Zhang 提交于
* static graph autogen code for expand * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug
-
- 29 6月, 2023 2 次提交
-
-
由 Yiqun Liu 提交于
* Fix compiling on XPU related to MPTypeTrait. * Unify the use of MPTypeTrait. * Fix compiling error.
-
由 张经纬 提交于
* fix clang-format * 'fix_clang-format' * fix remaining errors * format * empty commit, re-trigger all ci * empty commit, re-trigger all ci --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 28 6月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 27 6月, 2023 1 次提交
-
-
由 6clc 提交于
* feat(cmake): add cmake of cinn * feat(cmake): add cmake of cinn python test * feat(cmake): add jit * feat(cmake): test/CMakeList.txt * feat(cmake): rebase to develop * feat(cmake): remove some flags * fix(cmake): fix cinn's gflags depends * feat(cmake): add ci scripts of cinn * feat(cmake): copy code of cinn * fix(cmake): fix cinn third_party model path * gflags dynamic dependce * fix ci build_demo * tmp update to c++17 of cinn-only test * fix cinn only with c++17
-
- 26 6月, 2023 3 次提交
-
-
由 cyberslack_lee 提交于
-
由 Sanbu 提交于
-
由 Sonder 提交于
* remove ops from OpsWithFluidKernelNeedMoveToPhi set * open static build flag * OpsWithFluidKernelNeedMoveToPhi * open new_executor_static_build * add infermate for cudnn_lstm * fix * update * fix * update * update * update * fix pow2 decay * fix pow2 decay * recover analysis_predictor.cc * fix pow2 decay * fix cudnn lstm * add output register info for svd * fix pow2_decay_with_linear_warmup_kernel * recover test lstm cudnn * recover svg register codes * fix register info * fix reduce sum register info * add output info for adadelta * add output info for adadelta * add output info for adamax * fix complex abs register info * add register info for cudnn_lstm_grad * recover * fix lstm cudnn * fix * fix xpu output registe info * remove std::cout * add backend * remove output info in pow2_decay_with_linear_warmup_kernel * add judgment in TensorShouldBeFakeInitialized * recover power_ * close new_executor_static_build * fix set_value_xpu
-
- 25 6月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* auto parallel support pipeline scheduler with standalone executor * rm check_fetch * update cmakelist and flags env * rm set micro batch id * rm import * update utils func * raise error when merge tensor for return_numpy is False * fix _pipeline_opt * fix unittest
-
- 20 6月, 2023 2 次提交