- 20 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 06 4月, 2023 1 次提交
-
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * update CMkaeList * fix parameter sequence * add include file * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * trans fused_attention to fluid * move #endif to end * move #endif * delete useless files * use fused attention utils and recover random seed * remove fluid include in phi
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 02 3月, 2023 1 次提交
-
-
由 Leo Chen 提交于
* register fp16 and bf16 kernel for uniform_random * fix compile * support selected_rows * add ut * revert cpu * fp16 test skip cpu
-
- 03 2月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 02 12月, 2022 1 次提交
-
-
由 Bo Zhang 提交于
* profile reduce kernel for fp16 and reduceHigherdim * use reinterpret_cast * fix for CI on ROCm * add Macro for ROCm * ROCm CI config * ROCm CI config * unit test repair * pull * add common_funcs.h * reduceType * Update reduce_function.h * not higher * rename
-
- 21 11月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* refine reduce_all
-
- 18 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi * update copyright years * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi * fix rocm-complie bugs
-
- 10 11月, 2022 1 次提交
-
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
- 31 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 20 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 23 8月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 07 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 09 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 18 4月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-
- 14 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-
- 12 4月, 2022 1 次提交
-
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 03 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-
- 02 4月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 25 3月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
- 24 3月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 17 3月, 2022 1 次提交
-
-
由 niuliling123 提交于
* Replace PADDLE_WITH_XPU2 with PADDLE_WITH_KP
-
- 08 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 07 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove storage deps of empty * remove invalid empty method * remove error empty using * fix test_sparse_utils_dev_api * revert some sparse change * add memset for conv grad * resolve conflict * resolve conflict * resolve conflict
-
- 04 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move reduce gpu impl funcs into pten/kernels/funcs * change reduce header name and namespace * fix spell word error * change mutable_data to dev_ctx.Alloc * modify place to devcontex * format code style * fix build error * fix build error * fix conflict
-
- 03 3月, 2022 1 次提交
-
-
由 niuliling123 提交于
1. set xpu2 block_size = 64 2. fix a bug when reduce_num is too large
-
- 20 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 Yiqun Liu 提交于
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 17 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* clean useless header in pten core * fix compiled failed * fix cmake target * fix typo * resolve conflict
-
- 11 2月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* improve backward performance * support different dtypes for elementwise ops
-
- 09 2月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 08 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix pten reduce dispatch bug * add cast beforce reduce * fix test failed
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 29 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* open header for custom kernel * add core utils * tidy core code * tify header * tidy include * tidy namespace * resolve conflit * fix unittest and coverage * remove platform using * resolve conflict * resolve conflict * fix digamma namespace error * fix xpu full kernel error * fix xpu full kernel error * polish details * add place for lib storage
-
- 26 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move kernel_primitives * use pten's errors
-
- 25 1月, 2022 3 次提交
-
-
由 niuliling123 提交于
This reverts commit 9059ef69.
-
由 niuliling123 提交于
-
由 xiongkun 提交于
* transfer: string tinyformat errors and part of enforce into pten * remove comment * fix by code review * assert is not compile in -DNDEBUG * add string as dependences of paddle_inference
-