- 10 3月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* Add function node in phi_kernel for MKLDNN * fix the bug in 'BuildInferVarKernelContext' * add infer_varkernel_utils.cc * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable * change the code according to first review * change the code according to first review * change the mode of paddle_build.sh * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_' * add the error information * fix NotFound infomation warning * fix NotFound infomation warning * fix NotFound infomation warning
-
- 09 3月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* * add comm context for device context * add broadcast phi operator kernel and api * add broadcast support dtype, update ut * fix broadcast bfloat16 type * fix ut * update test_collective_broadcast_api timeout to 300
-
- 01 3月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* flash attn * seed * almost * softmax * fix workspace * add unitest; linux only * fix setup * fix datatype include * fix setup typo * fix def scope * new error api * use paddle fork * fix attr bug; complete ut * update flash hash * fix rng reset * fix offset * fix comments
-
- 16 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 03 1月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
- 23 12月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add warp transducer code
-
- 22 12月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 19 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move gather_scatter_kernel from fluid to phi * mv gather_scatter_kernel to gather_scatter_functor
-
- 17 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 16 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 06 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
- 05 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 18 11月, 2022 1 次提交
-
-
由 Tian Zheng 提交于
* Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation * Fix macro * Add implementation for conv_kernel and conv_grad_kernel * Modification after rebase onto latest develop * Modify plan cache to comply with the API of phi::autotune * Refactor to reduce duplicate code * Review fix: - move functions in conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu - add const specifier for input tensor - add logging when plans fail to execute - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h * - move plan building outside of cache * Fix ROCM build
-
- 31 10月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-
- 20 10月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Add infer prune function * Update phi.cmake * Update operators.cmake * add fusion op
-
- 19 9月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* sparse infer_meta
-
- 09 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add fusion dir and fuse_softmax_mask kernel * remove fusion kernel dir * migrate infershape * fix code errror
-
- 06 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add tensor array * fix ci bugs * fix ci bugs * fix ci bugs * fix ci bugs * update by comment * update code
-
- 02 9月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [XPU]Migrate Adam XPU kernel into Phi * test=kunlun
-
- 30 8月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* add coalesce_tensor kernel * polist coalesce_tensor kernel * add sig and InferMeta * add testcase * add legacy_api.yaml * fix infermeta * fix yaml * fix kernel implementation * add compile dependency of phi/kernels * fix MetaConfig * add python api * add and fix testcase * rnn.py add import * change _C_ops.coalesce_tensor * remove useless comments * add SetBackend * restore XPU kernel temporarily * fix code according to PR comments
-
- 26 8月, 2022 1 次提交
-
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
- 12 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 05 8月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* move mkldnn activation kernel * fix compile bugs * fix compile bugs * deal with conflict * fix compile bugs * fix windows compile bugs * mkldnn unittest fix * change mutable to alloc * fix unittest bugs * modify code according comment
-
由 Feiyu Chan 提交于
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft * make stft_op use fft from phi/kernels/funcs, clean code
-
- 03 8月, 2022 1 次提交
-
-
由 zhiboniu 提交于
* add cmake enforce * transfer multiclass_nms3 to phi
-
- 01 8月, 2022 1 次提交
-
-
由 zhiboniu 提交于
-
- 29 7月, 2022 1 次提交
-
-
由 zhiboniu 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* compile into one static library * fix xpu compile * fix xpu compile * fix inference compile * fix inference compile * add custom test * revert one file
-
- 16 7月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* draft version * draft version * draft version * migrate solve kernel to phi * polish * polish * re useless header file, fix a bug in grad_kernel_impl * add header file in need
-
- 14 7月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* adapt mkldnn kernel in PHI * fix ci compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * delete comment * fix compile bugs in windows-inference * delete code for converage * modify code by review * modify code by review * add todo * fix compile bugs * fix compile bugs * fix compile bugs * fix unittest bugsx
-
- 29 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* perfect copy * deal with conflict * deal with conflict * fix compile bugs * fix unittest bugs * change code format * deal with conflict * modify code by review * fix ce bugs * fix ce bugs * add lo * perfect code format * deal with conflicts
-
- 23 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 16 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix xpu kp compilation * add depends
-
- 10 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all phi kernels to 2(host/device) static libraries directly * fix calling kernel_declare * fix compile * fix cpu compile * fix rocm compile * fix xpu compile * fix xpu kp compile * fix inference compile
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 25 4月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* merge all phi lib to several big static lib * merge all phi lib to several big static lib
-
- 15 4月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 limingshu 提交于
* change cudnn helper for auto-tune * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm. * Fix the bug in calculating and printing current step cache hit rate. * Improve the autotune cache and fix unittest. * Change the key from AlgorithmType to int64_t. * Fix unittest for cpu-only env. * change ChooseAlgoByWorkspace for heuristic mode Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 10 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-