- 12 5月, 2023 1 次提交
-
-
由 Wangzheee 提交于
-
- 08 5月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* add embedding generic plugin, not enabled
-
- 25 4月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 13 4月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* fix bilinear_interp_v2 && some other bugs in trt 7011 * add version check in test_trt_convert_bilinear_interp_v2.py
-
- 31 3月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* this is a test pr, test=develop * solve the four [-Wterminate] warning, test=develop * solve the four [-Wterminate] warning, test=develop * new fix [-Wterminate], test=delelop * new fix [-Wterminate], test=delelop * new fix [-Wterminate], test=delelop * new , test = develop * new , test = develop * new , test = develop * new , test = develop * new , test = develop * new , test = develop
-
- 28 3月, 2023 1 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
- 22 3月, 2023 1 次提交
-
-
由 Wangzheee 提交于
fix embd plugin: S = mask_id.d[1]
-
- 21 3月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* fix GN when params.c% params.cPerBlock != 0 * fix GN when params.cnot divisable by params.cPerBlock
-
- 16 3月, 2023 1 次提交
-
-
由 xjmxyt 提交于
* add dynamic support * add more test * fix bug * change test * change test
-
- 24 2月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* fix multihead * fix multihead
-
- 20 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
-
- 16 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move layer_norm_kernel.cu.h to phi * fix bugs * fix namespace * fix bugs * fix CI-Windwos * replace mutable_data * fix bugs * fix bugs
-
- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 09 2月, 2023 2 次提交
-
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 31 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
- 12 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* compile fix * fix compile * compile fix * add more preln
-
- 11 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* compile fix * fix compile * compile fix
-
- 10 1月, 2023 3 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 wenbin 提交于
* gn bug fix * bug fix * gn bug fix
-
由 MarDino 提交于
-
- 09 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* skip_groupnorm * init * preln * add ut * more assert * set timeout * fix windows ci issue
-
- 23 12月, 2022 2 次提交
- 21 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* optimize token prune for no varlen
-
- 20 12月, 2022 1 次提交
-
-
由 wenbin 提交于
* gn nhwc8 * remove error
-
- 19 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* General optimization for no_varlen embedding layernorm
-
- 15 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
-
- 13 12月, 2022 2 次提交
-
-
由 weishengying 提交于
-
由 Wangzheee 提交于
* fix some transformer unitest
-
- 08 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
- 05 12月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
- 01 12月, 2022 3 次提交
- 28 11月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* add trt support
-
- 25 11月, 2022 3 次提交
-
-
由 zhangxin81 提交于
* fix loopup_table plugin deserialize size error
-
由 Wangzheee 提交于
* fix
-
由 Wang Bojun 提交于
* group norm fp16 support
-
- 24 11月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* optimize token prune
-