- 09 6月, 2023 1 次提交
-
-
由 xiaoguoguo626807 提交于
* modify multiply_grad_node create * add place choose * segment fault modify
-
- 05 6月, 2023 1 次提交
-
-
由 huangjiyi 提交于
-
- 01 6月, 2023 1 次提交
-
-
由 tianshuo78520a 提交于
* mv all unittests test * fix error * fix error * fix * fix * del unittests * fix paddle_build.sh * fix * fix test * fix add test * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * merge develop * fix test_async_read_write * fix test_async_read_write * merge develop * fix * fix import legacy_test * fix * fix * fix * fix * fix * fix * fix * fix * fix bug * fix * fix coverage test bug * fix * fix * fix * fix * fix * fix code sstyle * fix code * fix code * fix * fix * fix * del test_sequence_enumerate_op.py * fix
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by:
lzydev <1528794076@qq.com>
-
- 19 5月, 2023 1 次提交
-
-
由 limingshu 提交于
* Reorganize the forward codes of flash-attention. * Fix forward. * Remove some noused codes. * Simplify codes and fix backward. * Change all LOG(INFO) to VLOG and fix the backward. * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes * decrease the effect of debug print on performance * Unify the initialize of flashattn arguments. * Rewirte the reshape of temp_mask and temp_bias. * API support use_flash_attn. * Fix compiling error on CI. * Try to crop the flash-attention lib. * Correct the condition of whether can use flash-attn. * Remove the softmax_out argument. * Remove is_causal. * Polish codes. * Fix qkv_transpose_out's shape and scaling of Q * K. * Update commit of flash-attention. --------- Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 16 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Allow to switch whether to use promote strategy to choose kernel for O2 training. * Fix comparing error and add unittest.
-
- 25 4月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* add flags for phi * fix compile bugs * fix ci bugs * fix inference bugs * fix cinn' bugs * fix cinn bugs * perfect code according comment * fix ci bugs * fix ci bugs
-
- 23 4月, 2023 1 次提交
-
-
由 niuliling123 提交于
* Delete temp param in eager_gen
-
- 18 4月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 30 3月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* force sync batch norm grad sequential
-
- 07 3月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
-
- 31 1月, 2023 1 次提交
-
-
由 姜永久 提交于
* rm flags_retain grad in pybind * retain grads for xpu test * set retain grad for xpu * rm flag * lint --------- Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
-
- 09 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
- 13 12月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* save fused_attention memory when dropout_rate = 0.0 * add ut * fix ut bug * fix fused_layernorm_residual_dropout_bias_test.cu
-
- 09 12月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* mv fused_bias_dropout_residual_ln to fluid manual dir * rm useless comments
-
- 08 12月, 2022 1 次提交
-
-
由 Kai Song 提交于
* fix 'BlasAXPBY unimplemented' error with custom device * fix utils CmakeLists bug
-
- 01 12月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* do not link python lib in tensor wrapper
-
- 07 11月, 2022 2 次提交
-
-
由 Wang Xin 提交于
-
由 wanghuancoder 提交于
* refine python lib link
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 26 10月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* fix fused_attention fused_feedforward * fix ci * fix ci * fix ci PADDLE_GET_CONST * fix ci ut
-
- 19 9月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 15 9月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* make eager log readable * fix compile error * recover test * invoke ci again
-
- 06 9月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 9月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 02 8月, 2022 1 次提交
-
-
由 xiaoguoguo626807 提交于
* manuel_fused_gemm_epilogue
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 29 7月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 14 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
- 13 7月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 09 7月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
-
- 07 7月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* fused_gate_attention manual code in eager
-
由 Jiabin Yang 提交于
* fix deriv with inplace * fix double grad bugs * remove additional file * fix compat dygraph mode * fix yaml remove additional yaml * fix slice double grad error and auto code gen logic error for higher order differentiate * fix fix_higher_order_deriv * remove additional include * fix fix_higher_order_deriv
-
- 06 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* fused_gate_attention manual code in eager * Menual fused_feedforward in eager * fix test case
-
- 04 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 6月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* fused_gate_attention manual code in eager * refine * refine * refine * refine * refine * refine
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 10 6月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* change tensor name * fix unittest bugs
-