提交 · 0b4c3c20710f8276dec18711001b3b600b91b456 · BaiXuePrincess / Paddle

12 4月, 2022 11 次提交
- Z
  [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) · 0b4c3c20
  由 Zhanlue Yang 提交于 4月 12, 2022
```
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures
```
  0b4c3c20
- C
  [CustomOp] Add context pool unittests (#41085) · 59ec9599
  由 Chen Weihang 提交于 4月 12, 2022
```
* add context pool unittests

* fix timeout

* polish details

* change option pos

* add dll decl for wndows

* fix pre-commit error

* move dll_decl and export DeviceContext

* replace lost dll_decl.h
```
  59ec9599
- A
  [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw (#41641) · fdeec8c3
  由 Aurelius84 提交于 4月 12, 2022
```
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw

* fix xpu unittest failed
```
  fdeec8c3
- J
  
  Add possibility to test native config in mkldnn tests (#41562) · b68bb428
  由 joanna.wozna.intel 提交于 4月 12, 2022
  
  b68bb428
- J
  fix_paddle_numel_check (#41607) · 51cae7f7
  由 JingZhuangzhuang 提交于 4月 12, 2022
```
* fix_paddle_numel_check

* fix_paddle_numel_check
```
  51cae7f7
- F
  add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
  由 FlyingQianMM 提交于 4月 12, 2022
```
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)

* replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op

* use CUDA_KERNEL_LOOP_TYPE

* fix code style

* replace index_select_grad_init with SetConstant
```
  bc01242b
- C
  [CustomOp]Add new method for custom double grad (#41538) · 362c7c80
  由 Chen Weihang 提交于 4月 12, 2022
```
* add new method for custom double grad

* add tanh double grad unittest

* change year

* revert tensor init method
```
  362c7c80
- Z
  [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) · dead24dd
  由 zyfncg 提交于 4月 12, 2022
```
* support setting vector out size in yaml

* support setting size of vector<tensor> for out in yaml
```
  dead24dd
- L
  
  Update Profiler (#41638) · c3e1d257
  由 liutiexing 提交于 4月 12, 2022
  
  c3e1d257
- Z
  
  fix data transform problem for cudnn backend (#41622) · c055b50c
  由 zyfncg 提交于 4月 12, 2022
  
  c055b50c
- 王
  
  [Infrt] fix ci bug. test=document_fix (#41663) · d6e15914
  由王明冬提交于 4月 12, 2022
  
  d6e15914
11 4月, 2022 9 次提交
- 石
  
  fix, test=document_fix (#41655) · b45f80dd
  由石晓伟提交于 4月 11, 2022
  
  b45f80dd
- fix dynamic flag bug on mac (#41571) · b026840a
  由 zhouweiwei2014 提交于 4月 11, 2022
  
  b026840a
- A
  
  support more ops (#41421) · fc621dfe
  由 Allen Guo 提交于 4月 11, 2022
  
  fc621dfe
- J
  
  fix for gaussian random (#41572) · 8fc9c412
  由 jakpiase 提交于 4月 11, 2022
  
  8fc9c412
- Y
  
  fix arg_max for int type, *test=kunlun (#41522) · 368f1dda
  由 ykkk2333 提交于 4月 11, 2022
  
  368f1dda
- Y
  [Phi]Add multi_dot/maxout/multiplex op yaml (#41550) · 36d76840
  由 YuanRisheng 提交于 4月 11, 2022
```
* add multi_dot,maxout,multiplex yaml

* add code converage
```
  36d76840
- C
  [Yaml] Add assign yaml (#41428) · 437bebda
  由 chentianyu03 提交于 4月 11, 2022
```
* add assign yaml

* add assign api

* add assign backward api

* add assign

* add assign yaml

* add assign

* assign yaml

* add assign raw kernel and use assign_raw in yaml

* merge develop branch

* add missing python_api
```
  437bebda
- X
  [Yaml] add yaml for Uniform random and add unit test. (#41517) · cd2a4cdf
  由 xiongkun 提交于 4月 11, 2022
```
* gather op

* add mod

* [Yaml] final state for uniform and uniform_random
```
  cd2a4cdf
- S
  
  fix some ops (#41577) · 795d7121
  由 sneaxiy 提交于 4月 11, 2022
  
  795d7121
10 4月, 2022 5 次提交
- L
  [KP]fix bug when TruncatedNormal cannot fall back in cpu (#41565) · c1394c6a
  由 Liu-xiandong 提交于 4月 10, 2022
```
* [KP]fix bug when TruncatedNormal cannot fall back in cpu

* delete useless comment

* delete useless comment
```
  c1394c6a
- C
  
  fix warpctc grad kernel dep eror (#41598) · 91d6f47a
  由 Chen Weihang 提交于 4月 10, 2022
  
  91d6f47a
- B
  
  add mkldnn compute_propagate_scales int8 pass (#41592) · c00d869b
  由 baoachun 提交于 4月 10, 2022
  
  c00d869b
- W
  
  predictor support trt (#41556) · a78ca1cf
  由 Wilber 提交于 4月 10, 2022
  
  a78ca1cf
- B
  add mkldnn int8 pass [step1] (#41579) · e68da187
  由 baoachun 提交于 4月 10, 2022
```
* add mkldnn int8 pass

* add mkldnn int8 pass

* update pass
```
  e68da187
09 4月, 2022 9 次提交

由 zhaocaibei123 提交于 4月 09, 2022

* update name

* update name

* fix test

* fix fleet bind

* update name

* update name

* fix test

* fix gpups wrapper

* remove Push/Pull/Load/Save with context in client and wrapper base class

* fix

* fix

* remove some interface

* fix

* remove

* code style

* recover

* fix

* remove code unused

* remove some unused table & accessor & CommonDenseTable => MemoryDenseTable

* fix

* fix

* fix

* recover

* remove unused code

* recover unittest

* fix

* remove

* fix

* remove code unuseful

* remove

* fix

* recover

* remove
Co-authored-by: Nesythan <esythan@126.com>

7a07c4a5

C

modify the block size of the group_norm backward (#41570) · ff2fba39
由 crystal 提交于 4月 09, 2022

ff2fba39
H

add depthwise conv hip support (#41537) · b3b8d345
由 hong 提交于 4月 09, 2022

b3b8d345
王

[infrt] opt support input valid places by commondline. (#41544) · 96ced1a1
由王明冬提交于 4月 09, 2022

96ced1a1

Autotune the workspace_size_limit in conv. (#40338) · b937cdc5

由 limingshu 提交于 4月 09, 2022

* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.

* Use the system cudaMalloc and cudaFree to allocate workspace during searching.

* Enable switch of two kind of workspace setting methods.
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

b937cdc5

J
fix_ci_problem3 (#41484) · 9cb2287c
由 Jiabin Yang 提交于 4月 09, 2022
```
* fix_ci_problem3

* support windows no default error
```
9cb2287c
W

fix pylayer mem leak, test=develop (#41559) · be11648a
由 wanghuancoder 提交于 4月 09, 2022

be11648a
L
[new-exec] fix bug that no thread is waked up when adding task to threadpool (#41567) · f581f5bf
由 Leo Chen 提交于 4月 09, 2022
```
* fix bug that no thread is waked up when adding task to threadpool

* fix typo
```
f581f5bf
L

[fleet executor] Add sink interceptor and test (#41497) · b3e79731
由 LiYuRio 提交于 4月 09, 2022

b3e79731

08 4月, 2022 6 次提交
- W
  
  Fix fake quant cuda kernel (#41305) · 330582e2
  由 whs 提交于 4月 08, 2022
  
  330582e2
- C
  fix group_norm (#41531) · 04a4bdf8
  由 crystal 提交于 4月 08, 2022
```
fix group_norm vectorized address misalignment
```
  04a4bdf8
- A
  
  fix running error for ipu (#41481) · c2e12949
  由 Allen Guo 提交于 4月 08, 2022
  
  c2e12949
- J
  
  Fix RNN OP multi-threads predict bug (#41529) · 09203e46
  由 Jack Zhou 提交于 4月 08, 2022
  
  09203e46
- modify unittest of lstm forward, *test=kunlun (#41534) · d4710dfe
  由 z8hanghuan 提交于 4月 08, 2022
```
* modify unittest of lstm forward, *test=kunlun

* modify unittest of lstm forward, *test=kunlun
```
  d4710dfe
- A
  [Eager]Fix segment_pool/allclose/isclose/scale API bug (#41506) · 0a6fe699
  由 Aurelius84 提交于 4月 08, 2022
```
* [Eager]Fix segment_pool/allclose/isclose/scale API bug

* fix kernel register problem
```
  0a6fe699

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致