- 24 8月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* add code * fix bug * fix bug
-
- 23 8月, 2023 2 次提交
-
-
由 Wang Xin 提交于
-
由 wanghuancoder 提交于
* support ir fill constant
-
- 22 8月, 2023 2 次提交
-
-
由 jiangfan06 提交于
-
由 FormlessUnit 提交于
* Modify kernels to support quantized_matmul --------- Co-authored-by: Nsuperxf <1208713646@qq.com>
-
- 21 8月, 2023 1 次提交
-
-
由 RichardWooSJTU 提交于
-
- 18 8月, 2023 1 次提交
-
-
由 huangjiyi 提交于
-
- 16 8月, 2023 4 次提交
-
-
由 MarDino 提交于
* refine static op return val
-
由 jiangfan06 提交于
-
由 xiongkun 提交于
-
由 hong 提交于
* support new ir load combine * update * polish code * remove print * support c concat * update * polish code * fix bug * polish code * fix compile bug * polish code * remove useless code
-
- 15 8月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* support_mmha * add_python_api * add_api_doc * fix_doc_error * fix_infermeta * add_infermeta * add_bf16_cuda_check * add_bf16_check * fix_ci_windows * fix_ci_windows_kernel_register * fix_test_mmha * add_cumoffsets * remove_bias * delete_mmha_reshape_input_output * rename_delete_hfile * remove_fluid --------- Co-authored-by: Nyangjianfengo1 <yangjianfeng01@baidu.com>
-
- 14 8月, 2023 1 次提交
-
-
由 MarDino 提交于
* add rmsnorm residual bias add and quant * refine python interface * add rmsnorm unittest * Add layernorm * fix layernorm unittest * refine unittest * fix example code * fix review comment
-
- 10 8月, 2023 1 次提交
-
-
由 lzy 提交于
* add variable_length_memory_efficient_attention * update variable_length_memory_efficient_attention unittest * update variable_length_mem_eff_attn's docs and unittest * update variable_length_mem_eff_attn's docs * Update test_variable_length_memory_efficient_attention.py * Update variable_length_memory_efficient_attention.cu * fix codestyle * fix variable_length_fmha's docs and unittest * fix variable_length_fmha's docs
-
- 09 8月, 2023 1 次提交
-
-
由 Xinyu Chen 提交于
* onednn: rename macro to PADDLE_WITH_DNNL * onednn: rename macro to CINN_WITH_DNNL
-
- 08 8月, 2023 2 次提交
- 03 8月, 2023 3 次提交
- 02 8月, 2023 3 次提交
-
-
由 wz1qqx 提交于
-
由 wanghuancoder 提交于
* fix security bug
-
由 wanghuancoder 提交于
* fix security bug
-
- 01 8月, 2023 1 次提交
-
-
由 hong19860320 提交于
-
- 31 7月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
support stride
-
- 28 7月, 2023 1 次提交
-
-
由 Yuang Liu 提交于
-
- 26 7月, 2023 1 次提交
-
-
由 tianhaodongbd 提交于
-
- 24 7月, 2023 1 次提交
-
-
由 JYChen 提交于
* add IndexPutGradInfermeta to fix backward error in static-mode * codestyle
-
- 20 7月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 19 7月, 2023 2 次提交
- 13 7月, 2023 4 次提交
-
-
由 freeliuzc 提交于
* add init value for CudaSwishFunctor * add new phi kernel fusedBiasActKernel
-
由 Charles-hit 提交于
* [prim]support fp16 for instance_norm and instance_norm_grad * support fp16 and bfp16 dtype for instance_norm prim rules * fix new ir test --------- Co-authored-by: Ncxxly <chenxx_id@163.com>
-
由 lil-Xing 提交于
* add phi operator c_concat and ut * update create_var use * update copyright
-
由 RichardWooSJTU 提交于
* add matmul int8
-
- 12 7月, 2023 2 次提交
-
-
由 hong 提交于
* refine program translator * fix warning: not override * fix bug * merge new modifications * modify by reviews * resolve conflicts * resolve conflicts * fix * fix * update * support selected rows * update * add selectrows * fix bug * add ut * refine code * refien code * update * update * support selected rows * support selected rows * support dense tensor * remove useless code * polish code * remote standalone executor test --------- Co-authored-by: Nkangguangli <kangguangli@hotmail.com> Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 11 7月, 2023 3 次提交
-
-
由 pangengzheng 提交于
* support sharding parallel * fix name * fix * update * test amp for sharding --------- Co-authored-by: pangengzheng <pangengzheng.baidu.com>
-
由 MarDino 提交于
* add rmsnorm kernel * add static graph test * fix round type * use alignas to avoid msvc compile error * remove redundant headerfile to avoid rocm compile error * fix rocm compile not found cub * Add document
-
由 FormlessUnit 提交于
* rename weight_only/llm.int8
-