提交 · 538b57211ad12eaf53bec322608f712e39dc5a12 · PaddlePaddle / Paddle

31 12月, 2021 8 次提交
- J
  [new API] add paddle.kthvalue and paddle.Tensor.kthvalue (#38386) · 538b5721
  由 JYChen 提交于 12月 31, 2021
```
* add new api/op kthvalue

* kthvalue cuda kernel to cub sorting

* fix example code error

* throw errors instead of LOG in cuda sort

* throw errors by Paddle_ENFORCE
```
  538b5721
- Y
  [Pten]Fix bugs of compilation when use pten::add/subtract (#38631) · 31efec53
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs

* fix compile bugs
```
  31efec53
- Z
  
  add new API paddle.linalg.lu/lu_unpack (#38617) · 2ce91c33
  由 zhiboniu 提交于 12月 31, 2021
  
  2ce91c33
- X
  Add fold opereators (#38613) · 8898dce1
  由 xiaoting 提交于 12月 31, 2021
```
* add fold opereators, test=develop

* add fold opereators, test=develop

* add fold opereators, test=develop

* update fold op error test, test=develop

* fix unitext, test=develop

* fix unitext, test=develop
```
  8898dce1
- H
  Put_along_axis (based on PR #37921 by Xu Huang) (#38608) · f147fc99
  由 Huihuang Zheng 提交于 12月 31, 2021
```
Paddle new APIs: put_along_axis.

Xu Huang is on holiday so we created this PR to work on it. It is based on his PR: https://github.com/PaddlePaddle/Paddle/pull/37921
```
  f147fc99
- Z
  
  add lu_op backward (#38616) · a1275c8b
  由 zhiboniu 提交于 12月 31, 2021
  
  a1275c8b
- C
  [PTen] Unify data layout of pten and fluid (#38583) · 8d32cef8
  由 Chen Weihang 提交于 12月 31, 2021
```
* unify data layout

* fix test_transfer_layout error
```
  8d32cef8
- Y
  [Pten]Move math to new directory and change 「math」 to 「math_kernel」 (#38604) · e76087ad
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs
```
  e76087ad
30 12月, 2021 13 次提交

Z
add OP lu forward (#38559) · 4e21457d
由 zhiboniu 提交于 12月 30, 2021
```
LGTM
```
4e21457d

add sigmoid_cross_entropy_with_logits to kl1 (#38586) · 790cadd1

由 houj04 提交于 12月 30, 2021

* add sigmoid cross entropy with logits to kl1. test=kunlun

* add sigmoid cross entropy with logits to kl1. test=kunlun

790cadd1

Z
Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update... · ceec1e21
由 zhangyk0314 提交于 12月 30, 2021
```
Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update xpu2_op_list.h,test=kunlun (#38570)
```
ceec1e21
J
[New API] add new api paddle.mode and paddle.Tensor.mode (#38446) · 3777779b
由 JYChen 提交于 12月 30, 2021
```
* add new OP mode

* rename trans-variable name and fix UT
```
3777779b

Add cpu kernel of new api : lstsq (#38585) · ccf99b66

由 Haohongxiang 提交于 12月 30, 2021

* add cpu kernel of lstsq

* update

* modify code style

* modify unittest

* remove support for complex

ccf99b66

Add cusparse and unittest (#38431) · 667dc9f0

由 zhangkaihuo 提交于 12月 30, 2021

将cuSparse的handle与DeviceContext进行绑定，避免op中进行创建和销毁
添加对cuSparse中dense和sparse转换的API进行封装
添加对封装的API的单测

667dc9f0

W
dynamic shape clone (#38520) · 339c34e6
由 wenbin 提交于 12月 30, 2021
```
* dynamic shape clone supported
```
339c34e6
L

first commit (#38590) · ebc72ac2
由 limingshu 提交于 12月 30, 2021

ebc72ac2

refine run_program_op_grad output var name (#38470) · 1c094d3e

由 xiongkun 提交于 12月 30, 2021

* refine run_program_op_grad output var name

* add default for global_block. for pass the eagle_generator_cmd

* fix

* ;

* fix

* const cast

* mutable block

1c094d3e

Added Conv2D BF16 BWD oneDNN kernel (#38507) · ed8ba011

由 jakpiase 提交于 12月 30, 2021

* working test for padding only

* added full conv2d grad kernel

* removed some trash

* minor change

* Ci fix

* format fix

ed8ba011

[PTen] Remove offset in storage (#38472) · a504ff3f

由 Chen Weihang 提交于 12月 29, 2021

* remove offset in storage

* revert api change

* fix custom op slice bug

* fix mutable_data error

a504ff3f

add dirichlet random sample op in cpu and gpu kernel (#38244) · c5bf09bb

由 Xiaoxu Chen 提交于 12月 30, 2021

* add dirichlet sample op and cpu backend kernel

* add Dirichlet op cuda kernel  (#6)

* add dirichlet op hip kernel
Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>

c5bf09bb

Fix the bug of batch_norm and batch_norm_grad op. (#38288) · cc83c95f

由 Leo Guo 提交于 12月 30, 2021

* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list.

* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. test=kunlun
Co-authored-by: NZibin <guozibin@baidu.com>

cc83c95f

29 12月, 2021 5 次提交
- Y
  
  add top k v2 operator, test=kunlun (#38434) · d22f92ad
  由 ykkk2333 提交于 12月 29, 2021
  
  d22f92ad
- T
  add argsort/scatter for kunlun (#38345) · 4643baa7
  由 TTerror 提交于 12月 29, 2021
```
* add argsort/scatter for kunlun

* update test_scatter

* update xpu.cmake

* update xpu.cmake

* fix scatter
```
  4643baa7
- S
  
  fix lamb beta1pow beta2pow update (#38518) · 3672480b
  由 sneaxiy 提交于 12月 29, 2021
  
  3672480b
- T
  
  reduce compile time of amax and amin (#38534) · 72a41e50
  由 Tao Luo 提交于 12月 29, 2021
  
  72a41e50
- L
  
  code clean (#38550) · 206a8f6c
  由 limingshu 提交于 12月 29, 2021
  
  206a8f6c
28 12月, 2021 8 次提交
- L
  Support multi-output feature for elementwise (#38410) · 48f061fb
  由 limingshu 提交于 12月 28, 2021
```
* first commit

* pass ctest of  elementwise_div_grad
```
  48f061fb
- Z
  refactor matmul directory in pten (#38227) · 982bf444
  由 zyfncg 提交于 12月 28, 2021
```
* refactor matmul directory in pten

* fix merge conflict
```
  982bf444
- H
  Add API and op for take_along_axis (#38396) · 3310f519
  由 huangxu96 提交于 12月 28, 2021
```
* add API and op for take_along_axis

* fix compile dependency problem and add example code and doc

* add unitest

* delete some code for CI coverage

* fix code style problem

* fix as review
```
  3310f519
- G
  
  fix adamw epsilon in cuda kernel (#37746) · 6f1bb3d6
  由 Guoxia Wang 提交于 12月 28, 2021
  
  6f1bb3d6
- T
  Add Amax and Amin API (#38417) · 340dfb26
  由 Tao Luo 提交于 12月 28, 2021
```
* add amax/amin

* support axis is list
```
  340dfb26
- C
  [pten] remove in_type arg in cast kernel (#38486) · 0637b9a6
  由 chentianyu03 提交于 12月 28, 2021
```
* remove intype arg in cast kernel

* modify conj config in api.yaml by dictionary order

* rm unused code in cast_kernel.cu
```
  0637b9a6
- H
  add reduce_prod_xpu. fix reduce_mean_xpu bug. (#38481) · 78836bb7
  由 houj04 提交于 12月 28, 2021
```
* add reduce_prod_xpu. fix reduce_mean_xpu bug.

* iadd reduce_prod_xpu. fix reduce_mean_xpu bug. test=kunlun
```
  78836bb7
- L
  
  Add constructor for fused dropout param to ease use. (#38475) · f9e8a775
  由 Li Min 提交于 12月 28, 2021
  
  f9e8a775
27 12月, 2021 6 次提交
- B
  
  update mkldnn matmul_transpose_reshape fuse pass ut (#38467) · 9cfdae91
  由 baoachun 提交于 12月 27, 2021
  
  9cfdae91
- B
  add matmulv2_transpose_reshape_pass ut (#37416) · f664a533
  由 baoachun 提交于 12月 27, 2021
```
* update mkldnn matmul_v2_transpose_reshape_fuse_pass ut

* update mkldnn matmul_v2_transpose_reshape_fuse_pass ut

* update ut

* update ut
```
  f664a533
- L
  add device-agnostic stream class (#38391) · 6b5e33b4
  由 Leo Chen 提交于 12月 27, 2021
```
* add device-agnostic stream class

* add stream.h

* fix ut

* fix cpu compile
```
  6b5e33b4
- S
  
  refine float16 implementation (#38439) · 78375990
  由 sneaxiy 提交于 12月 27, 2021
  
  78375990
- L
  Support multi-outputs feature for broadcast ops (#38329) · 89d38f55
  由 limingshu 提交于 12月 27, 2021
```
* No harm to KP

* Pass the compile stage

* change the WriteData function

* fix template bugs and pass ctest of current elementwise

* for passing partial template specialization of tempalte function in CI-ROCm

* To make 'WriteData' funtion flexible.

* a less harmful way to support multi-output

* a less harmful way to support multi-output
```
  89d38f55
- G
  
  gelu using normcdf for cudnn (#38450) · 37022482
  由 Guoxia Wang 提交于 12月 27, 2021
  
  37022482

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功