- 13 1月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* fix fc kernel diff * disable fc_elementwise_layernorm_fuse_pass
-
- 19 12月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
* [Release2.4] Revert python link prs (#48573) * Revert "Fix mac link python (#48017)" This reverts commit 3fa7a736. * Revert "[Cherry-pick] Fix python link error (#47811)" This reverts commit ff642c68. * Update config.go * [Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) * [Inference] optimize some code and fix some bug (#48780) * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout * [Paddle Inference] clean unused code (#48392) * fix * update * update Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 10 11月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
* add fuse_multi_transformer_layer_pass
-
- 09 11月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* suqeeze2 + transpose2 fuse onednn cherrypick 2.4 * format * fix merge
-
- 08 11月, 2022 2 次提交
-
-
由 Kaipeng Deng 提交于
-
由 jakpiase 提交于
* fc cherrypick * another files added * added transpose cherrypick * reverter somebodys fc changes * minor fix * minor fix * cherry-pick of fc+act changes * minor fix * fix
-
- 03 11月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
-
由 yeliang2258 提交于
* add constant_folding_pass pass for mkldnn int8 * update UpdateScaleOpInOutScales
-
- 20 10月, 2022 1 次提交
-
-
由 Kaipeng Deng 提交于
* add fused_attention_pass. test=develop * support fp16. test=develop * fix format. test=develop
-
- 14 10月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
-
- 28 9月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 02 9月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
add constant folding pass, for some model,it will get less latency;
-
- 22 8月, 2022 3 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 Yuanle Liu 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
- 15 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 14 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
This reverts commit 84bf5c31.
-
- 10 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* cuda_graph * cuda_graph_ * cuda_graph_ * cuda_graph_
-
- 05 8月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
- 04 8月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
- 02 8月, 2022 1 次提交
-
-
由 Wilber 提交于
* multihead matmul add fp16 * fix windows error * fix rocm error * fix rocm error
-
- 29 7月, 2022 1 次提交
-
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
- 21 7月, 2022 2 次提交
-
-
由 ming1753 提交于
* fc support fp16 * add a ‘,’ on paddle_pass_builder.cc * fc support fp16 on non-cuda.
-
由 xiaoxiaohehe001 提交于
* convfusionfp16 * convfusionfp16 * convfusionfp16
-
- 12 7月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* add method for post ops * format code * gpd * format style * add matmul+act test * implement matmul+activation * whitespaces * code style * python code format * Increase UT timeout * code format * update style * generalize activation fuse passes * change order * Unify activation GPD * Revert changes with op_act * remove softmax mkldnn attrs * set common name for act attributes * whitespace * append postops by helper function * ut style * revert changes related to quantization * Reduce redundancy * reduce number of parameters * trigger CI * validate attribute * trim unit test
-
- 08 7月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 29 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* inference add convert to mixed model ability.
-
- 24 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* revert 40531 * update
-
- 22 6月, 2022 1 次提交
-
-
由 WJJ1995 提交于
* fixed multihead matmul fuse pass * Add unittests * rm scale op * fixed code style * fixed code style * resolve testcase falied * add note
-
- 21 6月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* consolidate conv act passes * generalize conv_activation * integrate conv+act tests * code style format * whitespaces * remove timeout from old tests * implement comments from review * restore ut * whitespace * code style * transpose * fixes after review * method for gettin act * Change Paddle_enforce error type * code format * add missing opcompats
-
- 20 6月, 2022 1 次提交
-
-
由 whs 提交于
-
- 09 6月, 2022 1 次提交
-
-
由 minghaoBD 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 2 次提交
-
-
由 Tomasz Socha 提交于
* Enable fc on bfloat16 * Add pass for residual connection * Dissable Residual connection pass for now * Ban ResidualData from DQ * style * WO for python tests
-
由 Wangzheee 提交于
* new general transformer inference support
-
- 20 5月, 2022 1 次提交
-
-
由 zhupengyang 提交于
-
- 19 5月, 2022 1 次提交
-
-
由 shentanyue 提交于
* support yolov5s static/int8 * fix eltwise_sub and div weight compute * fix delete_fill_constant_pass
-