提交 · 20dc1ac20bcae2d6b5f06b22daee6726ea1d5a8a · PaddlePaddle / Paddle

31 12月, 2021 11 次提交
- J
  [new api] add new api paddle.quantile and paddle.Tensor.quantile (#38567) · 20dc1ac2
  由 JYChen 提交于 12月 31, 2021
```
* add new api paddle.quantile and paddle.Tensor.quantile

* add take_todo and fix UT
```
  20dc1ac2
- Z
  
  add new API paddle.linalg.lu/lu_unpack (#38617) · 2ce91c33
  由 zhiboniu 提交于 12月 31, 2021
  
  2ce91c33
- X
  [Auto Parallel] Add general gradient merge pass to support auto parallel (#38259) · 89ce6db8
  由 xiayanming 提交于 12月 31, 2021
```
* [Auto Parallel] add gradient merge pass

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix ci issue

* fix pr review

* fix pr review

* fix pr review

* fix pr review

* fix pr review

* fix pr review
```
  89ce6db8
- X
  Add fold opereators (#38613) · 8898dce1
  由 xiaoting 提交于 12月 31, 2021
```
* add fold opereators, test=develop

* add fold opereators, test=develop

* add fold opereators, test=develop

* update fold op error test, test=develop

* fix unitext, test=develop

* fix unitext, test=develop
```
  8898dce1
- Z
  
  Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor (#38607) · 5a6a2d27
  由 Zhanlue Yang 提交于 12月 31, 2021
  
  5a6a2d27
- D
  
  fix timeout (#38612) · 02c17c0b
  由 Double_V 提交于 12月 31, 2021
  
  02c17c0b
- H
  Put_along_axis (based on PR #37921 by Xu Huang) (#38608) · f147fc99
  由 Huihuang Zheng 提交于 12月 31, 2021
```
Paddle new APIs: put_along_axis.

Xu Huang is on holiday so we created this PR to work on it. It is based on his PR: https://github.com/PaddlePaddle/Paddle/pull/37921
```
  f147fc99
- C
  
  replace contextt to context (#38619) · f1366d58
  由 Chen Weihang 提交于 12月 31, 2021
  
  f1366d58
- Z
  
  add lu_op backward (#38616) · a1275c8b
  由 zhiboniu 提交于 12月 31, 2021
  
  a1275c8b
- C
  [PTen] Unify data layout of pten and fluid (#38583) · 8d32cef8
  由 Chen Weihang 提交于 12月 31, 2021
```
* unify data layout

* fix test_transfer_layout error
```
  8d32cef8
- Y
  [Pten]Move math to new directory and change 「math」 to 「math_kernel」 (#38604) · e76087ad
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs
```
  e76087ad
30 12月, 2021 26 次提交

Z
add OP lu forward (#38559) · 4e21457d
由 zhiboniu 提交于 12月 30, 2021
```
LGTM
```
4e21457d

add sigmoid_cross_entropy_with_logits to kl1 (#38586) · 790cadd1

由 houj04 提交于 12月 30, 2021

* add sigmoid cross entropy with logits to kl1. test=kunlun

* add sigmoid cross entropy with logits to kl1. test=kunlun

790cadd1

Z
Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update... · ceec1e21
由 zhangyk0314 提交于 12月 30, 2021
```
Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update xpu2_op_list.h,test=kunlun (#38570)
```
ceec1e21
J

Refactor cpu_quantize_pass (#38019) · 1fa6900e
由 joanna.wozna.intel 提交于 12月 30, 2021

1fa6900e

flags to choose kp kernel (#38455) · ed2cfecf

由 Feng Xing 提交于 12月 30, 2021

This PR adds runtime flags run_kp_kernel, which choose which op to run for xpu2. There are two: dynamic linked and built from kp.

ed2cfecf

J
[New API] add new api paddle.mode and paddle.Tensor.mode (#38446) · 3777779b
由 JYChen 提交于 12月 30, 2021
```
* add new OP mode

* rename trans-variable name and fix UT
```
3777779b
Y
[Auto parallel] Make sure the id semantics of every var and op unique (#38132) · 5620214e
由 Yulong Ao 提交于 12月 30, 2021
```
* [Auto parallel] Make the id of var and op unique

* [Auto Parallel] Rename back dist_context to distop_context
```
5620214e

Add cpu kernel of new api : lstsq (#38585) · ccf99b66

由 Haohongxiang 提交于 12月 30, 2021

* add cpu kernel of lstsq

* update

* modify code style

* modify unittest

* remove support for complex

ccf99b66

Add cusparse and unittest (#38431) · 667dc9f0

由 zhangkaihuo 提交于 12月 30, 2021

将cuSparse的handle与DeviceContext进行绑定，避免op中进行创建和销毁
添加对cuSparse中dense和sparse转换的API进行封装
添加对封装的API的单测

667dc9f0

L

[Fleet Executor] Support multi carrier (#38535) · 3658405c
由 LiYuRio 提交于 12月 30, 2021

3658405c

Support test imperative basic with fixed retain grad interface (#38548) · 2421a25a

由 Jiabin Yang 提交于 12月 30, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* support inference test

* refine test and fix initializer failed

* support create varbase and fix retain grad error

* fix windows error

* support test_imperative_basic test in eager mode

* remove additional log in variable.h

* remove additional log in variable.h

* remove additional code create in merge
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

2421a25a

W
dynamic shape clone (#38520) · 339c34e6
由 wenbin 提交于 12月 30, 2021
```
* dynamic shape clone supported
```
339c34e6
L

first commit (#38590) · ebc72ac2
由 limingshu 提交于 12月 30, 2021

ebc72ac2
X
[New-Exe]Fix word2vec hang proble using InterpreterCore (#38584) · e683ab50
由 xiongkun 提交于 12月 30, 2021
```
* fix wait for tiexing

* fix work2vec model. new_exe support EOF Exception in ReadOp now
```
e683ab50

refine run_program_op_grad output var name (#38470) · 1c094d3e

由 xiongkun 提交于 12月 30, 2021

* refine run_program_op_grad output var name

* add default for global_block. for pass the eagle_generator_cmd

* fix

* ;

* fix

* const cast

* mutable block

1c094d3e

Added Conv2D BF16 BWD oneDNN kernel (#38507) · ed8ba011

由 jakpiase 提交于 12月 30, 2021

* working test for padding only

* added full conv2d grad kernel

* removed some trash

* minor change

* Ci fix

* format fix

ed8ba011

Z

[PSCore]Fix test fleet base 2 (#38588) · 04496d89
由 zmxdream 提交于 12月 30, 2021

04496d89
S

try to expose cast with ptr function (#38598) · 15cbf81b
由 sneaxiy 提交于 12月 30, 2021

15cbf81b
J

params file will not be a nessary file (#38579) · de26b88b
由 JingZhuangzhuang 提交于 12月 30, 2021

de26b88b

[PTen] Remove offset in storage (#38472) · a504ff3f

由 Chen Weihang 提交于 12月 29, 2021

* remove offset in storage

* revert api change

* fix custom op slice bug

* fix mutable_data error

a504ff3f

F

Replace shared_ptr with unique_ptr in base_ptr_test (#38530) · 3f6229c6
由 From00 提交于 12月 30, 2021

3f6229c6
C

refine tensor change checking cond, test=document_fix (#38592) · b1e73347
由 Chen Weihang 提交于 12月 29, 2021

b1e73347

add ExponentialFamily and Dirichlet probability distribution (#38445) · 00cddf07

由 Xiaoxu Chen 提交于 12月 30, 2021

* extend Distribution baseclass for supporting multivariant distribution and prob method

* add ExponentialFamily base class and entropy using Bregman divergence

* add dirichlet probability distribution

00cddf07

add dirichlet random sample op in cpu and gpu kernel (#38244) · c5bf09bb

由 Xiaoxu Chen 提交于 12月 30, 2021

* add dirichlet sample op and cpu backend kernel

* add Dirichlet op cuda kernel  (#6)

* add dirichlet op hip kernel
Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>

c5bf09bb

Fix the bug of batch_norm and batch_norm_grad op. (#38288) · cc83c95f

由 Leo Guo 提交于 12月 30, 2021

* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list.

* Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. test=kunlun
Co-authored-by: NZibin <guozibin@baidu.com>

cc83c95f

T

Add CUDA_ARCH_BIN (#38569) · 9e0a03ee
由 tianshuo78520a 提交于 12月 30, 2021

9e0a03ee

29 12月, 2021 3 次提交
- L
  
  add _nvprof_range interface (#38572) · ea01e790
  由 Leo Chen 提交于 12月 29, 2021
  
  ea01e790
- C
  
  unify infermeta target (#38580) · 458365cf
  由 Chen Weihang 提交于 12月 29, 2021
  
  458365cf
- Z
  
  Added copy_if_different for eager code generator (#38562) · ad78a21e
  由 Zhanlue Yang 提交于 12月 29, 2021
  
  ad78a21e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功