提交 · a2020d0cc369d7d2cf5c4d7eae41f007afb8ab89 · PaddlePaddle / Paddle

05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
09 5月, 2022 1 次提交
- N
  
  Modified reduce for xpu2 (#42439) · ae4d1ec1
  由 niuliling123 提交于 5月 09, 2022
  
  ae4d1ec1
18 4月, 2022 1 次提交
- L
  
  [KP] Add Reduce op registry & UT for xpu_kp compilation (#41869) · b3959fe4
  由 Lijunhui 提交于 4月 18, 2022
  
  b3959fe4
14 4月, 2022 1 次提交
- C
  [Phi] Unify dispatch macros to visit (#41653) · 2ab986ae
  由 Chen Weihang 提交于 4月 14, 2022
```
* chnage dispatch to visit

* resolve conflict
```
  2ab986ae
12 4月, 2022 1 次提交

[KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e

由 Lijunhui 提交于 4月 12, 2022

* init commit no push

* collect comile errors

* bitwise UT

* fix compile problem

* cancel comments

* restore miss deletion

* fix compilation

* fix UT

* NO stash in multiple branch at the same times

* fix error

* combine .cu from gpu and kps

* replace gpu by kps

* fix by Chen-weihang

* Revert "Fix kps compile error in Junhui logic compare bitwise"

* fix backend test

* rm comments
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

3749198e

03 4月, 2022 1 次提交

add maximum limit for grid of index_select (#41127) · af8d2482

由 FlyingQianMM 提交于 4月 03, 2022

* limit grid dim for index select

* mv LimitGridDim into gpu_launch_config.h

* fix conflicts

* fix conflicts

* fix code style

* set block to 256

* fix grid setting

* set dtype of block_dim to unsigned int

af8d2482

02 4月, 2022 1 次提交
- N
  
  Fix a bug when reduceHigherDim in HIP (#41273) · 7dd4a9fe
  由 niuliling123 提交于 4月 02, 2022
  
  7dd4a9fe
25 3月, 2022 1 次提交
- F
  add maximum limit for grid of reduce, elementwise, gather and scatter (#40813) · 608a5f55
  由 FlyingQianMM 提交于 3月 25, 2022
```
* add maximum limit for grid of reduce, elementwise and gather

* add {} after if
```
  608a5f55
24 3月, 2022 1 次提交
- N
  
  Add is_mean param for mean op (#40757) · 7e1155ed
  由 niuliling123 提交于 3月 24, 2022
  
  7e1155ed
17 3月, 2022 1 次提交
- N
  Replace PADDLE_WITH_XPU2 with PADDLE_WITH_KP (#40560) · c142e37d
  由 niuliling123 提交于 3月 17, 2022
```
* Replace PADDLE_WITH_XPU2 with PADDLE_WITH_KP
```
  c142e37d
08 3月, 2022 1 次提交
- Y
  
  Rename phi::func::TensorReduceImpl to phi::func::ReduceKernel. (#40183) · 688743bf
  由 Yiqun Liu 提交于 3月 08, 2022
  
  688743bf
07 3月, 2022 1 次提交

[Phi] Remove storage deps of empty (#40136) · b46e49de

由 Chen Weihang 提交于 3月 07, 2022

* remove storage deps of empty

* remove invalid empty method

* remove error empty using

* fix test_sparse_utils_dev_api

* revert some sparse change

* add memset for conv grad

* resolve conflict

* resolve conflict

* resolve conflict

b46e49de

04 3月, 2022 1 次提交

[phi]move reduce gpu impl funcs into pten/kernels/funcs (#39990) · e2e2d531

由 chentianyu03 提交于 3月 04, 2022

* move reduce gpu impl funcs into pten/kernels/funcs

* change reduce header name and namespace

* fix spell word error

* change mutable_data to dev_ctx.Alloc

* modify place to devcontex

* format code style

* fix build error

* fix build error

* fix conflict

e2e2d531

03 3月, 2022 1 次提交
- N
  Modified Reduce for XPU2 (#38918) · 909d1e61
  由 niuliling123 提交于 3月 03, 2022
```
1. set xpu2 block_size = 64
2. fix a bug when reduce_num is too large
```
  909d1e61
20 2月, 2022 2 次提交
- C
  [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986
  由 Chen Weihang 提交于 2月 20, 2022
```
* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed
```
  dcfe1986
- Y
  
  Rename the general elementwise and broadcast functions. (#39623) · 553afc07
  由 Yiqun Liu 提交于 2月 20, 2022
  
  553afc07
19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

17 2月, 2022 1 次提交

[PTen] Clean useless header in pten core (#39560) · c05cd7ed

由 Chen Weihang 提交于 2月 17, 2022

* clean useless header in pten core

* fix compiled failed

* fix cmake target

* fix typo

* resolve conflict

c05cd7ed

11 2月, 2022 1 次提交
- Z
  Support different dtypes of inputs for elementwise ops (#38859) · bf305033
  由 Zhang Ting 提交于 2月 11, 2022
```
* improve backward performance

* support different dtypes for elementwise ops
```
  bf305033
09 2月, 2022 1 次提交
- Y
  
  Rename partial function name TensorReduceFunctorImpl to TensorReduceImpl. (#39387) · 6354f81c
  由 Yiqun Liu 提交于 2月 09, 2022
  
  6354f81c
08 2月, 2022 1 次提交
- C
  Fix reduce_sum dtype dispatch bug on gpu (#39349) · 4d7ad277
  由 Chen Weihang 提交于 2月 08, 2022
```
* fix pten reduce dispatch bug

* add cast beforce reduce

* fix test failed
```
  4d7ad277
06 2月, 2022 1 次提交
- W
  
  [PTEN] Add Gpu context (#39305) · a821c4a9
  由 Wilber 提交于 2月 06, 2022
  
  a821c4a9
29 1月, 2022 1 次提交

[PTen] Tidy pten core headers (#39188) · dd990981

由 Chen Weihang 提交于 1月 29, 2022

* open header for custom kernel

* add core utils

* tidy core code

* tify header

* tidy include

* tidy namespace

* resolve conflit

* fix unittest and coverage

* remove platform using

* resolve conflict

* resolve conflict

* fix digamma namespace error

* fix xpu full kernel error

* fix xpu full kernel error

* polish details

* add place for lib storage

dd990981

26 1月, 2022 1 次提交
- Y
  [Pten]Move kernel_primitives lib to Pten directory (#39169) · 452bcbe2
  由 YuanRisheng 提交于 1月 26, 2022
```
* move kernel_primitives

* use pten's errors
```
  452bcbe2
25 1月, 2022 3 次提交
- N
  Revert "Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959)" (#39205) · 978558be
  由 niuliling123 提交于 1月 25, 2022
```
This reverts commit 9059ef69.
```
  978558be
- N
  
  Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959) · 9059ef69
  由 niuliling123 提交于 1月 25, 2022
  
  9059ef69
- X
  [PTen] Migrate string tinyformat errors and part of enforce into pten (#39051) · 6ca49164
  由 xiongkun 提交于 1月 25, 2022
```
* transfer: string tinyformat errors and part of enforce into pten

* remove comment

* fix by code review

* assert is not compile in -DNDEBUG

* add string as dependences of paddle_inference
```
  6ca49164
24 1月, 2022 1 次提交

石

[Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7

由石晓伟提交于 1月 24, 2022

* updates callers, test=develop

* updates tensor, test=develop

* fixes errors, test=develop

* remove some dtypes, test=develop

* fix errors in the base storage modification, test=develop

* fixes a bug, test=develop

* fixes the bugs in push the whole, test=develop

* updates, test=develop

* update

* update, test=develop

* fixes the mac-py3 CI, test=develop

* remove the storage impl, test=develop

* updates some codes, test=develop

* update, test=develop

* updates pten allocation, test=develop

a56e16a7

21 1月, 2022 1 次提交
- A
  [PTen]Migrate Dim and DDim from paddle::framework into pten namespace (#39053) · 4e23ba32
  由 Aurelius84 提交于 1月 21, 2022
```
* Migrate Dim and DDim from paddle::framework into pten namespace

* fix paddle::framework::Array

* fix framework::Array
```
  4e23ba32
18 1月, 2022 2 次提交

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

Y

break the circular dependency between reduce and elementwise (#38951) · a1980d9c
由 YuanRisheng 提交于 1月 18, 2022

a1980d9c

07 1月, 2022 1 次提交
- N
  
  Fix a bug when reduce_num = 1 in Reduce Op (#38771) · f634c0b1
  由 niuliling123 提交于 1月 07, 2022
  
  f634c0b1
05 1月, 2022 1 次提交

[pten]Move reduce code new (#38648) · 7a4a512d

由 chentianyu03 提交于 1月 05, 2022

* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs

* fix compile bugs

* move reduce files by new rule

* add set header

* format code style

* merge develop and fix conflict

* merge develop and fix conflict
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

7a4a512d

04 1月, 2022 1 次提交
- C
  [PTen] Move inner empty and cast api to kernel.h (#38587) · 64538c8d
  由 Chen Weihang 提交于 1月 04, 2022
```
* move inner cast api to cast_kernel.h

* resolve conflit
```
  64538c8d
28 12月, 2021 1 次提交

[pten] remove in_type arg in cast kernel (#38486) · 0637b9a6

由 chentianyu03 提交于 12月 28, 2021

* remove intype arg in cast kernel

* modify conj config in api.yaml by dictionary order

* rm unused code in cast_kernel.cu

0637b9a6

27 12月, 2021 1 次提交
- C
  [PTen] Move cast kernel impl (#38382) · 1fb734d7
  由 Chen Weihang 提交于 12月 26, 2021
```
* rename to api to copy_to

* revert needless change

* polish format
```
  1fb734d7
26 12月, 2021 1 次提交

[PTen] Move copy kernel impl (#38421) · 73819658

由 Chen Weihang 提交于 12月 26, 2021

* add register general kernel marco

* move copy kernel impl

* revert needless change

* polish details

* fix xpu compil faild

* fix xpu compile failed

* polish format

73819658

24 12月, 2021 1 次提交

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

21 12月, 2021 1 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
15 12月, 2021 1 次提交
- C
  
  replace moves_storage and alloc_construct (#38134) · e78eb3f4
  由 Chen Weihang 提交于 12月 14, 2021
  
  e78eb3f4

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功