提交 · fdeec8c37e6a4d53557eb9715e39b6ff04ced5bc · PaddlePaddle / Paddle

12 4月, 2022 9 次提交
- A
  [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw (#41641) · fdeec8c3
  由 Aurelius84 提交于 4月 12, 2022
```
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw

* fix xpu unittest failed
```
  fdeec8c3
- J
  
  Add possibility to test native config in mkldnn tests (#41562) · b68bb428
  由 joanna.wozna.intel 提交于 4月 12, 2022
  
  b68bb428
- J
  fix_paddle_numel_check (#41607) · 51cae7f7
  由 JingZhuangzhuang 提交于 4月 12, 2022
```
* fix_paddle_numel_check

* fix_paddle_numel_check
```
  51cae7f7
- F
  add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
  由 FlyingQianMM 提交于 4月 12, 2022
```
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)

* replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op

* use CUDA_KERNEL_LOOP_TYPE

* fix code style

* replace index_select_grad_init with SetConstant
```
  bc01242b
- C
  [CustomOp]Add new method for custom double grad (#41538) · 362c7c80
  由 Chen Weihang 提交于 4月 12, 2022
```
* add new method for custom double grad

* add tanh double grad unittest

* change year

* revert tensor init method
```
  362c7c80
- Z
  [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) · dead24dd
  由 zyfncg 提交于 4月 12, 2022
```
* support setting vector out size in yaml

* support setting size of vector<tensor> for out in yaml
```
  dead24dd
- L
  
  Update Profiler (#41638) · c3e1d257
  由 liutiexing 提交于 4月 12, 2022
  
  c3e1d257
- Z
  
  fix data transform problem for cudnn backend (#41622) · c055b50c
  由 zyfncg 提交于 4月 12, 2022
  
  c055b50c
- 王
  
  [Infrt] fix ci bug. test=document_fix (#41663) · d6e15914
  由王明冬提交于 4月 12, 2022
  
  d6e15914
11 4月, 2022 15 次提交
- 石
  
  fix, test=document_fix (#41655) · b45f80dd
  由石晓伟提交于 4月 11, 2022
  
  b45f80dd
- Z
  
  tensor fluid code transfer part3 (#40034) · 5cb61417
  由 zhiboniu 提交于 4月 11, 2022
  
  5cb61417
- fix dynamic flag bug on mac (#41571) · b026840a
  由 zhouweiwei2014 提交于 4月 11, 2022
  
  b026840a
- L
  
  add backend for heter training (#41526) · c64d9a44
  由 lilong12 提交于 4月 11, 2022
  
  c64d9a44
- A
  
  support more ops (#41421) · fc621dfe
  由 Allen Guo 提交于 4月 11, 2022
  
  fc621dfe
- S
  
  update lite compile cmake (#41512) · 535810ba
  由 shentanyue 提交于 4月 11, 2022
  
  535810ba
- J
  
  fix for gaussian random (#41572) · 8fc9c412
  由 jakpiase 提交于 4月 11, 2022
  
  8fc9c412
- Y
  
  fix arg_max for int type, *test=kunlun (#41522) · 368f1dda
  由 ykkk2333 提交于 4月 11, 2022
  
  368f1dda
- H
  Add no need buffer config (#41605) · 9287d5a1
  由 hong 提交于 4月 11, 2022
```
* add no need buffer

* add no need buffer

* remove determinant no need buffer
```
  9287d5a1
- Y
  [Phi]Add multi_dot/maxout/multiplex op yaml (#41550) · 36d76840
  由 YuanRisheng 提交于 4月 11, 2022
```
* add multi_dot,maxout,multiplex yaml

* add code converage
```
  36d76840
- Z
  Modify op-benchamrk script (#41470) · 89bfa964
  由 Zhang Zheng 提交于 4月 11, 2022
```
* Modify op-benchamrk script

* fix
```
  89bfa964
- C
  [Yaml] Add assign yaml (#41428) · 437bebda
  由 chentianyu03 提交于 4月 11, 2022
```
* add assign yaml

* add assign api

* add assign backward api

* add assign

* add assign yaml

* add assign

* assign yaml

* add assign raw kernel and use assign_raw in yaml

* merge develop branch

* add missing python_api
```
  437bebda
- X
  [Yaml] add yaml for Uniform random and add unit test. (#41517) · cd2a4cdf
  由 xiongkun 提交于 4月 11, 2022
```
* gather op

* add mod

* [Yaml] final state for uniform and uniform_random
```
  cd2a4cdf
- 0
  
  Switch test_transformer to eager mode and fix roll error (#41548) · 9107dc67
  由 0x45f 提交于 4月 11, 2022
  
  9107dc67
- S
  
  fix some ops (#41577) · 795d7121
  由 sneaxiy 提交于 4月 11, 2022
  
  795d7121
10 4月, 2022 6 次提交
- L
  [KP]fix bug when TruncatedNormal cannot fall back in cpu (#41565) · c1394c6a
  由 Liu-xiandong 提交于 4月 10, 2022
```
* [KP]fix bug when TruncatedNormal cannot fall back in cpu

* delete useless comment

* delete useless comment
```
  c1394c6a
- C
  
  fix warpctc grad kernel dep eror (#41598) · 91d6f47a
  由 Chen Weihang 提交于 4月 10, 2022
  
  91d6f47a
- X
  [Yaml] Modify api and add unittests for full api final state. (#41437) · 81c40722
  由 xiongkun 提交于 4月 10, 2022
```
* full api fix

* when out is None, go old dygraph mode

* fix

* add name for buffer

* fix by code review

* fix

* by static check
```
  81c40722
- B
  
  add mkldnn compute_propagate_scales int8 pass (#41592) · c00d869b
  由 baoachun 提交于 4月 10, 2022
  
  c00d869b
- W
  
  predictor support trt (#41556) · a78ca1cf
  由 Wilber 提交于 4月 10, 2022
  
  a78ca1cf
- B
  add mkldnn int8 pass [step1] (#41579) · e68da187
  由 baoachun 提交于 4月 10, 2022
```
* add mkldnn int8 pass

* add mkldnn int8 pass

* update pass
```
  e68da187
09 4月, 2022 10 次提交

由 zhaocaibei123 提交于 4月 09, 2022

* update name

* update name

* fix test

* fix fleet bind

* update name

* update name

* fix test

* fix gpups wrapper

* remove Push/Pull/Load/Save with context in client and wrapper base class

* fix

* fix

* remove some interface

* fix

* remove

* code style

* recover

* fix

* remove code unused

* remove some unused table & accessor & CommonDenseTable => MemoryDenseTable

* fix

* fix

* fix

* recover

* remove unused code

* recover unittest

* fix

* remove

* fix

* remove code unuseful

* remove

* fix

* recover

* remove
Co-authored-by: Nesythan <esythan@126.com>

7a07c4a5

C

modify the block size of the group_norm backward (#41570) · ff2fba39
由 crystal 提交于 4月 09, 2022

ff2fba39
W

[Eager] Support allclose and linalg_cond to eager mode (#41545) · 9872da00
由 Weilong Wu 提交于 4月 09, 2022

9872da00
H

add depthwise conv hip support (#41537) · b3b8d345
由 hong 提交于 4月 09, 2022

b3b8d345
王

[infrt] opt support input valid places by commondline. (#41544) · 96ced1a1
由王明冬提交于 4月 09, 2022

96ced1a1

Autotune the workspace_size_limit in conv. (#40338) · b937cdc5

由 limingshu 提交于 4月 09, 2022

* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.

* Use the system cudaMalloc and cudaFree to allocate workspace during searching.

* Enable switch of two kind of workspace setting methods.
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

b937cdc5

Add get profiler from config (#41532) · e1792a31

由 chenjian 提交于 4月 09, 2022

* no

* maintain old profiler

* add get profiler from serialization config

* add unit test

* improve coverage

* fix

* Revert "improve coverage"

This reverts commit 4a980bfda48adadee551d0e1c5740bc5b7389200.

* fix unit

* fix

* fix

e1792a31

J
fix_ci_problem3 (#41484) · 9cb2287c
由 Jiabin Yang 提交于 4月 09, 2022
```
* fix_ci_problem3

* support windows no default error
```
9cb2287c
W

fix pylayer mem leak, test=develop (#41559) · be11648a
由 wanghuancoder 提交于 4月 09, 2022

be11648a
S

fix cross entropy (#41541) · 0e048fc6
由 sneaxiy 提交于 4月 09, 2022

0e048fc6

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功