提交 · 88966b283952096f81aab4918b7d83b303aabad2 · Crayon鑫 / Paddle

15 1月, 2022 1 次提交

[Unify Tensors PR #7] Merged LoDTensor with Tensor, test=allcases (#38880) · 88966b28

由 Zhanlue Yang 提交于 1月 15, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Fixed example code failure

* Polished function names, removed duplicated forward declarations

88966b28

14 1月, 2022 3 次提交

[XPU]add stack_grad op for kunlun2,*test=kunlun (#38674) · 87ee3e4f

由 Zhangjingyu06 提交于 1月 14, 2022

* [XPU]add split op for kunlun2,*test=kunlun

* [XPU]add split op for kunlun2,*test=kunlun

* [XPU]add split op for kunlun,*test=kunlun

* [XPU]add stack_grad op for kunlun2,*test=kunlun
Co-authored-by: NQingshuChen <chenqingshu@baidu.com>

87ee3e4f

Y

refactor impl of elementwise op part2 (#38898) · 556d5097
由 YuanRisheng 提交于 1月 14, 2022

556d5097

[MLU]Add mean and reduce_mean op (#38872) · 7f8d5bc8

由 qipengh 提交于 1月 14, 2022

* [MLU]: add mean and reduce mean op

* [MLU]add mlu pytest dir in CMakeLists.txt

* [MLU]fix tensor data

* [MLU]fix TensorToPyArray and license

7f8d5bc8

13 1月, 2022 7 次提交

S

[bug fix] fix unfold bug in compile time (#38907) · 7f123456
由 shangliang Xu 提交于 1月 13, 2022

7f123456
F
[NPU] fix tril_triu (#38864) · eaccdc71
由 furnace 提交于 1月 13, 2022
```
[NPU] fix tril_triu
```
eaccdc71
F
[NPU] fix expand op (#38526) · 7a5af630
由 furnace 提交于 1月 13, 2022
```
* [NPU] fix expand op

* [NPU] optimize codes

* [NPU] optimize codes
```
7a5af630

[pten]Remove pten/include dir files (#38878) · 7e0292ea

由 chentianyu03 提交于 1月 13, 2022

* move dot_dev api into dot_kernel.h

* add infermate header

* modify to dotkerel in dot_op.h

* mvoe conj dev api into complex_kernel.h

* move sign dev api into  sign_kernel.h

* move scale dev api into kernel.h and remove infermete.h

* rm paddle/pten/include/math.h

* rm paddle/pten/include/math.h

* rm include dir

* rm paddle/pten/include/math.h

* fix conflict with develop branch

* rm devContext in conj_op.h

* add the missing complex_kernel header

7e0292ea

Added mul BF16/FP32 FWD/BWD oneDNN kernel (#38552) · fc6eed5b

由 jakpiase 提交于 1月 13, 2022

* base changes for mul reimplementation

* empty commit

* tmp save

* full implementation of mul bf16/fp32 fwd bwd

* CI fix

* CI rerun

* changed unity build cmake to avoid gpu issues

* removed mul mkldnn from unity build

* added skipping tests if not cpu_bf16

* CI fix

* CI fix

* CI fix

fc6eed5b

C
Fix mkldnn invalid infershape impl (#38837) · 281644cd
由 Chen Weihang 提交于 1月 13, 2022
```
* fix mkldnn invalid infershape

* add unittest for mkldnn in new executor

* add import os
```
281644cd
石

splits allocation for pten, test=develop (#38853) · 277cf900
由石晓伟提交于 1月 13, 2022

277cf900

12 1月, 2022 13 次提交
- Z
  [part 3]change type of function args (#38887) · 0efcae86
  由 Zhang Ting 提交于 1月 12, 2022
```
* code clean

* [part 3]change type of function args
```
  0efcae86
- S
  Fix conv act int8 scale (#38331) · 4825addd
  由 Sylwester Fraczek 提交于 1月 12, 2022
```
* fix conv act int8 scale

* add unit test for conv+hard_swish
```
  4825addd
- X
  support 5d for nearest interp (#38868) · d296456c
  由 xiaoting 提交于 1月 12, 2022
```
* support 5d for nearest

* update nearest3d unittest, test=develop

* fix approve ci, test=develop

* fix approve ci, test=develop
```
  d296456c
- L
  optimize elementwise_max_grad using new interfaces (#37906) · 4a64ca1e
  由 Lijunhui 提交于 1月 12, 2022
```
* init elem_max_grad op

* optimize code and reply review comments

* ternary functors

* apply new reduce func

* move functor to .h

* multi-outputs init

* rearrange code

* modifed functors

* optimizer code

* pass nullptr

* revert the last change as seg fault occurs

* optimize code

* remove inplace

* remove comments
```
  4a64ca1e
- C
  [PTen] Remove hybird dir (#38863) · 5f5f626b
  由 Chen Weihang 提交于 1月 12, 2022
```
* remove hybird dir

* resolve conflit
```
  5f5f626b
- L
  optimize elementwise_min_grad using new reduce interface (#38236) · c2f825d7
  由 Lijunhui 提交于 1月 12, 2022
```
* ini commit

* multi-outputs init commit

* optimize code

* remove inplace
```
  c2f825d7
- Z
  
  [part 6]change type of function args (#38891) · 12c5b1fe
  由 Zhang Ting 提交于 1月 12, 2022
  
  12c5b1fe
- C
  [pten]Move dot, conj, sign dev_api into kernel.h (#38862) · 5fc8bbf7
  由 chentianyu03 提交于 1月 12, 2022
```
* move dot_dev api into dot_kernel.h

* add infermate header

* modify to dotkerel in dot_op.h

* mvoe conj dev api into complex_kernel.h

* move sign dev api into  sign_kernel.h
```
  5fc8bbf7
- Y
  [PTen]Refactor impl of elementwise op grad_kernel (Part1) (#38873) · 676903d5
  由 YuanRisheng 提交于 1月 12, 2022
```
* refactor the impl of elementwise grad kernel

* refactor impl of elementwise grad kernel(cuda)

* fix compile bugs
```
  676903d5
- Z
  
  [part 4]change type of function args (#38888) · a250c56c
  由 Zhang Ting 提交于 1月 12, 2022
  
  a250c56c
- Z
  
  [part 2]change type of function args (#38886) · 86434818
  由 Zhang Ting 提交于 1月 12, 2022
  
  86434818
- Z
  
  [part 1]change type of function args (#38885) · df5d55bb
  由 Zhang Ting 提交于 1月 12, 2022
  
  df5d55bb
- L
  Adjust warpper of gpu_lanuch_config (#38654) · f5166284
  由 limingshu 提交于 1月 12, 2022
```
* first commit

* fix wrong filename

* fix the wrong spell name

* fix gpu config warper

* modify according to pr advices

* fix GpuLauchConfig1D api bugs

* change the config for dropout grad

* fix bugs

* modification according to pr advices

* modification according to pr advices
```
  f5166284
11 1月, 2022 6 次提交
- Y
  
  refactor reshape grad kernel (#38833) · 8cc09552
  由 YuanRisheng 提交于 1月 11, 2022
  
  8cc09552
- Z
  【PTen】Add dot and matmul grad kernel in pten (#38713) · be817719
  由 zyfncg 提交于 1月 11, 2022
```
* refactor matmul directory in pten

* fix merge conflict

* add dot_grad kernel

* add dot_grad kernel in pten

* add matmul_grad kernel

* update the code

* delete useless code in fluid

* fix some bug of running matmul grad kernel

* fix merge conflict

* refactor some code

* refactor code
```
  be817719
- Z
  Fix bug in elementwise_mul/div_grad when inplace strategy (#38840) · 7915d180
  由 Zhang Zheng 提交于 1月 11, 2022
```
* fix bug when inplace strategy

* fix

* fix

* fix

* fix

* fix
```
  7915d180
- N
  
  Modified Kernel Primitive API and elementwise for xpu2 #38688 · 3eaf8d2c
  由 niuliling123 提交于 1月 11, 2022
  
  3eaf8d2c
- L
  Remove useless headers for some grad ops (#38823) · 9f34a070
  由 limingshu 提交于 1月 11, 2022
```
* fix the wrong filename

* first commit

* first commit

* remove rest useless headers

* for ci approval
```
  9f34a070
- S
  support vs2019 compilation in windows (#38719) · 0ad363b1
  由 Sing_chan 提交于 1月 11, 2022
```
* support vs2019 compilation in windows

* not modify pow_op's original compute logic
```
  0ad363b1
10 1月, 2022 10 次提交

Add gpu kernel for new api : linalg.lstsq (#38621) · 405103d8

由 Haohongxiang 提交于 1月 10, 2022

* add lstsq gpu kernel

* update

* add docs_en

* modify ut

* fix bugs

* modify example in docs_en

* remove lstsq_op.cu from ROCM cmake

* modify docs_en

* modify docs_en

* modify docs_en

* remove unneccessary TensorCopy

405103d8

B
refactor the forward implementation of reshape npu op (#38748) · 31b1f707
由 baoachun 提交于 1月 10, 2022
```
* refactor the forward implementation of reshape npu op

* update reshape npu op

* update reshape npu op
```
31b1f707
C

move get expected kernel args into pten (#38825) · 3a23c1a2
由 Chen Weihang 提交于 1月 10, 2022

3a23c1a2
Y
Add the backward support for QR (#38824) · 657b6742
由 Yulong Ao 提交于 1月 10, 2022
```
* Add the backward support for QR

* Remove unnecessary comments
```
657b6742
S

[bug fix] fix unfold runtime bug (#38819) · 5c357504
由 shangliang Xu 提交于 1月 10, 2022

5c357504
T

1.fix elementwise_add_grad bug. 2. add dropout kernel in kl2 (#38726) · 7b860a23
由 taixiurong 提交于 1月 10, 2022

7b860a23
W

fix attr missing in conv cudnn kernel (#38827) · 066a8063
由 wangxinxin08 提交于 1月 10, 2022

066a8063

[Unify Tensors PR ] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea

由 Zhanlue Yang 提交于 1月 10, 2022

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor

* Modified framework::Tensor to inherit from DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

* Rearranged cfunction calls from tensor.data<void>() to tensor.data()

* Fixed CI issues

* Fixed lite issues

* Fixed data() interface issues,test=allcases

* Resolved IsInitialized() issues

* Fixed ResetHolder() issues

* Fixed MKLDNN & Storage issues

* Resolved ShareBufferWith() issues

* Fixed LoD issues

5c73a6ea

Add MaxUnPool3D op and MaxUnPool1D op (#38716) · 7e31542c

由 andyjpaddle 提交于 1月 10, 2022

* add maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update doc for maxunpool3d op

* update sample code for maxunpool3d

* add maxunpool1d op

* update some code for maxunpool1d

7e31542c

G

remove fp32 tmp tensor and cast op for initializer.Normal and initializer.Constant (#38818) · 2238a535
由 Guoxia Wang 提交于 1月 10, 2022

2238a535

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致