- 27 10月, 2022 3 次提交
-
-
由 HongyuJia 提交于
-
由 Jacek Czaja 提交于
* - halfway transforming transpose grad - Fixes - buildable * - lint * rerunning the process
-
由 Bo Zhang 提交于
* fix reduce_any kernel data race on sharedMem * use bit operation instead of div & mod * unbranch * modified according to PR comments
-
- 26 10月, 2022 6 次提交
-
-
由 HongyuJia 提交于
* remove prior_box mkldnn hard code * add header file * simplify PD_VISIT_TYPE * decouple dependency between prior_box and density_prior_box * fix pragma omp parallel error * bypass #pragma omp_parallel_for error * polish code * remove visit_type headerfile * polish codestyle * polish codestyle * try fix CI error * add testcase, datatype=float64 * reset test_prior_box testcase * add datacheck to DenseTensor * update template name * call prior_box with macro expand
-
由 Lin Manhui 提交于
* Fix paddle.pow() bugs * Add unittest cases * Fix ut cases * Add ut cases on multiple devices
-
由 zhangkaihuo 提交于
-
由 Wang Xin 提交于
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
-
由 zyfncg 提交于
* fix inference perfermence problem caused by selecting cudnn kernel for softmax * recover use_cudnn in opmaker of softmax
-
由 Chen Weihang 提交于
-
- 25 10月, 2022 4 次提交
-
-
由 jakpiase 提交于
* return proper state * fix for dims * fix
-
由 jakpiase 提交于
-
由 Wang Xin 提交于
-
由 zhouweiwei2014 提交于
-
- 24 10月, 2022 5 次提交
-
-
由 zyfncg 提交于
* support selected_rows kernel for multiply in dygraph * delete useless code of slice in fluid * fix complie bug * move slice_array from fluid to phi * fix strided_slice_op_npu
-
由 zyfncg 提交于
* gene static graph code for exp * refactor the doc of exp * fix bug * fix bug * update doc of exp * fix sparse op
-
由 Yiqun Liu 提交于
-
由 Zhang Ting 提交于
-
由 Yiqun Liu 提交于
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 21 10月, 2022 2 次提交
-
-
由 zhangyikun02 提交于
-
由 Leo Guo 提交于
Fix the bug where the device memory address appears in abs_grad kernel fallback to CPU. test=kunlun (#47186)
-
- 20 10月, 2022 4 次提交
-
-
由 zhangkaihuo 提交于
* fix indices
-
由 JingZhuangzhuang 提交于
* Add infer prune function * Update phi.cmake * Update operators.cmake * add fusion op
-
由 thunder95 提交于
-
由 zhangkaihuo 提交于
-
- 19 10月, 2022 6 次提交
-
-
由 Yuanle Liu 提交于
-
由 zyfncg 提交于
* rename op in yaml * fix test_layout_autotune * fix layout autotune of transpose
-
由 Chen Weihang 提交于
-
由 Yiqun Liu 提交于
Enable to record whether the conv algo is got by exhaustive search to fix autotune cache bug. (#47065)
-
由 will-jl944 提交于
-
由 xiongkun 提交于
* [Dy2Static] Remove GradTransformer 1. fix einsum infershape bugs. 2. remove grad_transformer and unify paddle.grad and paddle.static.gradient. 3. add dygraph_and_dy2static_only decorator for dy2static. * fix bugs * rename
-
- 18 10月, 2022 5 次提交
-
-
由 zhouweiwei2014 提交于
-
由 seemingwang 提交于
* add embedding range check * change head file * change head file * fix
-
由 liu zhengxi 提交于
-
由 houj04 提交于
* [XPU] update xpu cmake to 1016. test=kunlun * fix special case of transpose op. test=kunlun
-
由 zyfncg 提交于
* support generating code of opmaker for backward op invoke forward op * gsupport code-gen of opmaker for sparse op * refind logic of choose phi kernrel * fix complie budg * fix code_gen bug * fix bug * fix kernel signature code-gen * fix complie bug of VarType * fix complie bug of VarType * fix test_sparse_conv_op * fix test_sparse_norm_op
-
- 17 10月, 2022 4 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 OccupyMars2025 提交于
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
由 RedContritio 提交于
-