- 28 8月, 2023 1 次提交
-
-
由 GGBond8488 提交于
【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509) * tmp commit * add atan2 * add inplace api * fix error * add inpalce divide * add inplace api * add more inplace * add more inpalce * fix logical_not error * support sinh and cosh in cpu * support asin, acos, atan, asinh, acosh, atanh in cpu * fix typro * fix typro * mv out atan2 ldexp * mv out atan2 ldexp * support sinh and cosh in gpu * support asin, acos, atan, asinh, acosh, atanh in gpu * fix ge error * fix dygraph commpare error * fix dygraph commpare error * check complex in python * fix cast inpalce error * open inplace test * fix ops.yaml error * mv cast inpalce to python * fix coverage ci * add last inplace * fix inplace error * fix cast error * fix error * add nan_to_num_ * fix typro * fix sparse cast error * remove gpu 4 * fix static cast error * tmp commit * add atan2 * add inplace api * fix error * add inpalce divide * add inplace api * add more inplace * add more inpalce * fix logical_not error * fix typro * fix typro * mv out atan2 ldexp * mv out atan2 ldexp * fix ge error * fix dygraph commpare error * fix dygraph commpare error * fix cast inpalce error * open inplace test * fix ops.yaml error * mv cast inpalce to python * fix coverage ci * add last inplace * fix inplace error * fix cast error * fix error * add nan_to_num_ * fix typro * fix sparse cast error * remove gpu 4 * fix static cast error * fix cast error * fix * Revert "check complex in python" This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05. * add renorm , fix error * add coverage * fix cumsum inpalce version error * add cast inpalce impl * rm test.log * fix multiply_dyfunction and add multiply_backward test * add and use is_same_tensor * fix typro * fix sone error * fix typro --------- Co-authored-by: NScotty <jmhgchn@gmail.com> Co-authored-by: NScotty <527407973@qq.com>
-
- 25 8月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* add_bias_and_simplify_mmha
-
- 24 8月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
-
- 22 8月, 2023 1 次提交
-
-
由 FormlessUnit 提交于
* Modify kernels to support quantized_matmul --------- Co-authored-by: Nsuperxf <1208713646@qq.com>
-
- 16 8月, 2023 1 次提交
-
-
由 MarDino 提交于
* refine static op return val
-
- 15 8月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* support_mmha * add_python_api * add_api_doc * fix_doc_error * fix_infermeta * add_infermeta * add_bf16_cuda_check * add_bf16_check * fix_ci_windows * fix_ci_windows_kernel_register * fix_test_mmha * add_cumoffsets * remove_bias * delete_mmha_reshape_input_output * rename_delete_hfile * remove_fluid --------- Co-authored-by: Nyangjianfengo1 <yangjianfeng01@baidu.com>
-
- 14 8月, 2023 1 次提交
-
-
由 MarDino 提交于
* add rmsnorm residual bias add and quant * refine python interface * add rmsnorm unittest * Add layernorm * fix layernorm unittest * refine unittest * fix example code * fix review comment
-
- 10 8月, 2023 1 次提交
-
-
由 lzy 提交于
* add variable_length_memory_efficient_attention * update variable_length_memory_efficient_attention unittest * update variable_length_mem_eff_attn's docs and unittest * update variable_length_mem_eff_attn's docs * Update test_variable_length_memory_efficient_attention.py * Update variable_length_memory_efficient_attention.cu * fix codestyle * fix variable_length_fmha's docs and unittest * fix variable_length_fmha's docs
-
- 07 8月, 2023 1 次提交
-
-
由 yin wei 提交于
* add mask * add backword * add enforce info * update scale * integrate code * update enforce * add enforce eq * add error type * update enforce * add test_flash_attention * Polish codes and fix compiling errors. * Set num_splits to 0 for flash-attn with tensor mask. * Fix the compiling error for non flash-attn case. --------- Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 04 8月, 2023 1 次提交
-
-
由 kangguangli 提交于
* fix bug: feed_with_place should consider variable existence * fix * fix build scope * change method to set feed var name * remove feed_with_place to placeholder * fix * rename to data * fix * fix
-
- 31 7月, 2023 2 次提交
-
-
由 hong 提交于
* fix new ir shadow typo * update
-
由 wanghuancoder 提交于
support stride
-
- 27 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: Nzhangrui34 <v_zhangrui34@baidu.com>
-
- 25 7月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * relax constraint when inserting get_parameter * add env flag * fix bug * dygraph2static support new ir * fix bug * revert test env * change cc_test_old to cc_test * update * fix build_static bug * update test * fix type test error * udpate cmake * disable test in windows * fix inference compile * fix program translator error * only run on cpu, not support gpu yet * fix conflict * polish code * fix bug * add feed with place op * update * remove useless unitest * udpate mkldnn * update * update * align mkldnn version * new ir support builtin slice op * fix bug * fix phi kernel adaptor bug * add enable static * add enable_static * remove useless test case * change feed list to single variable * update * add feed with place and shaddow output op * fix bug * remove usless code * support gpu * fix bug * fix bug * remove template * add more data type * fix cimpile bug * udpate * remove useless code * revert dygraph2st test * remove usless code * revert op * fix bug * new ir dygraph2static support gpu * remove usless code * code polish * add const * revert code and remove useless code * revert code * revert legacy op yaml * remove useless code * delete std::move --------- Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
-
- 20 7月, 2023 1 次提交
-
-
由 zyfncg 提交于
-
- 19 7月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
* delete relu6_raw * fix codestyle * Update test_mkldnn_matmul_activation_fuse_pass.py * fix * Update backward.yaml * Update ops.yaml * Update backward.yaml
-
- 18 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
* batch add inpalce api * fix inplace fn generate * add test for new inpalce api * fix typro * fix typro * fix typro * fix test error * fix atan2 * remove atan2 * auto genereate inpalce api * fix inplace generate fn error * fix windows error * fix test error * fix test error * fix windows ci error * fix test error * fix test_error * fix test error * fix eigen aliasing error in inplace * remove elementwise_pow inplace * fix doc error * fix test error
-
- 14 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 11 7月, 2023 2 次提交
-
-
由 MarDino 提交于
* add rmsnorm kernel * add static graph test * fix round type * use alignas to avoid msvc compile error * remove redundant headerfile to avoid rocm compile error * fix rocm compile not found cub * Add document
-
由 FormlessUnit 提交于
* rename weight_only/llm.int8
-
- 10 7月, 2023 1 次提交
-
-
由 hong 提交于
* fix affine grid bug * revert cummax
-
- 04 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 03 7月, 2023 2 次提交
-
-
由 lzydev 提交于
* support auto-gen concat * fix bug in legacy_backward.yaml * fix bug in get_expeceted_kernel_type
-
由 FormlessUnit 提交于
* add linear_compress API
-
- 30 6月, 2023 3 次提交
-
-
由 lzydev 提交于
-
由 lzydev 提交于
-
由 Zhenghai Zhang 提交于
* static graph autogen code for expand * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug
-
- 28 6月, 2023 2 次提交
-
-
由 GGBond8488 提交于
* add clone for inpalce * fix name * add inplace pow * fix typro * add note * fix typro * fix typro * fix bug * fix test error * add type error test * adjust indentation
-
由 Siming Dai 提交于
* add pos_weight in kernel * fix unittest * fix xpu * fix bce unittest, change infermeta order
-
- 26 6月, 2023 2 次提交
-
-
由 cyberslack_lee 提交于
-
由 Sanbu 提交于
-
- 20 6月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* new_ir to shared * refine code * add ir lib path to env * refine type * refine code * fix bug * fix bug * refine code * refine code * close win * refine code * refine code * refine code * add win share * refine code * refie code * refine code * refine code * refien code * fix bug * fix bug * fix bug * solve conflict * solve conflict * fix bug * refine code * fix bug * fix bug * fix bug * fix bug * fix bug * refine code * fix interpretercore program bug * delete unuse code * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix cinn bug * fix cinn bug * debug * fix cinn bug * delete unused code * fix cinn bug * fix cinn bug * fix ug * test win openblas * test win openblas * fix win openblas bug * polish code * fix win open blas bug * close win dll * fix flag bug * test for windows * fix compile bug
-
- 16 6月, 2023 1 次提交
-
-
由 zhiboniu 提交于
* fix lamb always_adapt * fix optest * fix all optests
-
- 15 6月, 2023 1 次提交
-
-
由 Zhenghai Zhang 提交于
-
- 14 6月, 2023 1 次提交
-
-
由 huangjiyi 提交于
* update * update
-
- 13 6月, 2023 1 次提交
-
-
由 NetPunk 提交于
-
- 09 6月, 2023 1 次提交
-
-
由 lzydev 提交于
* auto generate code for elementwise_max * auto generate code for elementwise_max * fix composite ops * fix bug of fmax
-
- 08 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Add outputs to check_numerics_kernel. * Add check_numerics to yaml. * Add API and unittest. * Add check_nan_inf_level as argument of check_numerics_kernel. * Add more unittests. * Fix static API implementation and unittest. * Move the implementation of check_numerics to paddle.amp. * Fix import error.
-
- 05 6月, 2023 2 次提交
-
-
由 PommesPeter 提交于
* feat: added polygamma init code * feat: added polygamma unittest code * test: added more test cases * refactor: added forward impl * refactor: added backward impl * test: updated cases * refactor: updated test cases * refactor: added more case and fixed some bugs * test: updated ref func * refactor: updated code style * refactor: move the code * refactor: updated test * refactor: updated test * docs: updated en doc Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com> * docs: updated math eq --------- Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>
-
由 huangjiyi 提交于
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
-