提交 · b861022aa55a397976329041378193caf35987c3 · Crayon鑫 / Paddle

12 4月, 2022 21 次提交
- F
  strided_slice (#41573) · b861022a
  由 feng_shuai 提交于 4月 12, 2022
```
* strided_slice

* fix: compiler error because of size()

* fix: warning

* fix : warning

* init input_shape

* fix:forget punctuation
```
  b861022a
- H
  Add layer norm yaml (#41589) · 43d5cca6
  由 hong 提交于 4月 12, 2022
```
* add layer norm infermeta

* add layer norm yaml

* polish layer norm infer meta

* add layer norm to black list
```
  43d5cca6
- J
  add python share_data interface (#41626) · be4a2077
  由 JingZhuangzhuang 提交于 4月 12, 2022
```
* add python share_data interface

* Update inference_api.cc

* Update inference_api.cc

* add python share_data interface
```
  be4a2077
- C
  exchange assign and assign_raw kernel name (#41625) · de49a4b7
  由 chentianyu03 提交于 4月 12, 2022
```
* exchange assign and assign_raw kernel name

* fix register error
```
  de49a4b7
- D
  【heterps】datafeed puttofeedvec performance (#40168) · c202a613
  由 danleifeng 提交于 4月 12, 2022
```
* perform SlotRecordInMemoryDataFeed feedvec;test=develop
```
  c202a613
- H
  
  fix depthwise dnn bug (#41666) · 7b627dd8
  由 hong 提交于 4月 12, 2022
  
  7b627dd8
- L
  [KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e
  由 Lijunhui 提交于 4月 12, 2022
```
* init commit no push

* collect comile errors

* bitwise UT

* fix compile problem

* cancel comments

* restore miss deletion

* fix compilation

* fix UT

* NO stash in multiple branch at the same times

* fix error

* combine .cu from gpu and kps

* replace gpu by kps

* fix by Chen-weihang

* Revert "Fix kps compile error in Junhui logic compare bitwise"

* fix backend test

* rm comments
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
```
  3749198e
- L
  
  add dependency for send/recv to support pp parallel (#41652) · a058b474
  由 Leo Chen 提交于 4月 12, 2022
  
  a058b474
- F
  add trt supoort for slice op (#41467) · f403fb69
  由 feng_shuai 提交于 4月 12, 2022
```
* add trt supoort for slice op

* fix:output dims bug

* fix: test

* fix:for c++ coverage

* fix:c++ coverage

* fix: fix test bug

* fix: CI test
```
  f403fb69
- W
  
  add fp16 kernel to clip_grad (#41661) · 137dc3e3
  由 wuyefeilin 提交于 4月 12, 2022
  
  137dc3e3
- Z
  [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) · 0b4c3c20
  由 Zhanlue Yang 提交于 4月 12, 2022
```
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures
```
  0b4c3c20
- C
  [CustomOp] Add context pool unittests (#41085) · 59ec9599
  由 Chen Weihang 提交于 4月 12, 2022
```
* add context pool unittests

* fix timeout

* polish details

* change option pos

* add dll decl for wndows

* fix pre-commit error

* move dll_decl and export DeviceContext

* replace lost dll_decl.h
```
  59ec9599
- A
  [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw (#41641) · fdeec8c3
  由 Aurelius84 提交于 4月 12, 2022
```
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw

* fix xpu unittest failed
```
  fdeec8c3
- J
  
  Add possibility to test native config in mkldnn tests (#41562) · b68bb428
  由 joanna.wozna.intel 提交于 4月 12, 2022
  
  b68bb428
- J
  fix_paddle_numel_check (#41607) · 51cae7f7
  由 JingZhuangzhuang 提交于 4月 12, 2022
```
* fix_paddle_numel_check

* fix_paddle_numel_check
```
  51cae7f7
- F
  add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
  由 FlyingQianMM 提交于 4月 12, 2022
```
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)

* replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op

* use CUDA_KERNEL_LOOP_TYPE

* fix code style

* replace index_select_grad_init with SetConstant
```
  bc01242b
- C
  [CustomOp]Add new method for custom double grad (#41538) · 362c7c80
  由 Chen Weihang 提交于 4月 12, 2022
```
* add new method for custom double grad

* add tanh double grad unittest

* change year

* revert tensor init method
```
  362c7c80
- Z
  [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) · dead24dd
  由 zyfncg 提交于 4月 12, 2022
```
* support setting vector out size in yaml

* support setting size of vector<tensor> for out in yaml
```
  dead24dd
- L
  
  Update Profiler (#41638) · c3e1d257
  由 liutiexing 提交于 4月 12, 2022
  
  c3e1d257
- Z
  
  fix data transform problem for cudnn backend (#41622) · c055b50c
  由 zyfncg 提交于 4月 12, 2022
  
  c055b50c
- 王
  
  [Infrt] fix ci bug. test=document_fix (#41663) · d6e15914
  由王明冬提交于 4月 12, 2022
  
  d6e15914
11 4月, 2022 9 次提交
- 石
  
  fix, test=document_fix (#41655) · b45f80dd
  由石晓伟提交于 4月 11, 2022
  
  b45f80dd
- fix dynamic flag bug on mac (#41571) · b026840a
  由 zhouweiwei2014 提交于 4月 11, 2022
  
  b026840a
- A
  
  support more ops (#41421) · fc621dfe
  由 Allen Guo 提交于 4月 11, 2022
  
  fc621dfe
- J
  
  fix for gaussian random (#41572) · 8fc9c412
  由 jakpiase 提交于 4月 11, 2022
  
  8fc9c412
- Y
  
  fix arg_max for int type, *test=kunlun (#41522) · 368f1dda
  由 ykkk2333 提交于 4月 11, 2022
  
  368f1dda
- Y
  [Phi]Add multi_dot/maxout/multiplex op yaml (#41550) · 36d76840
  由 YuanRisheng 提交于 4月 11, 2022
```
* add multi_dot,maxout,multiplex yaml

* add code converage
```
  36d76840
- C
  [Yaml] Add assign yaml (#41428) · 437bebda
  由 chentianyu03 提交于 4月 11, 2022
```
* add assign yaml

* add assign api

* add assign backward api

* add assign

* add assign yaml

* add assign

* assign yaml

* add assign raw kernel and use assign_raw in yaml

* merge develop branch

* add missing python_api
```
  437bebda
- X
  [Yaml] add yaml for Uniform random and add unit test. (#41517) · cd2a4cdf
  由 xiongkun 提交于 4月 11, 2022
```
* gather op

* add mod

* [Yaml] final state for uniform and uniform_random
```
  cd2a4cdf
- S
  
  fix some ops (#41577) · 795d7121
  由 sneaxiy 提交于 4月 11, 2022
  
  795d7121
10 4月, 2022 5 次提交
- L
  [KP]fix bug when TruncatedNormal cannot fall back in cpu (#41565) · c1394c6a
  由 Liu-xiandong 提交于 4月 10, 2022
```
* [KP]fix bug when TruncatedNormal cannot fall back in cpu

* delete useless comment

* delete useless comment
```
  c1394c6a
- C
  
  fix warpctc grad kernel dep eror (#41598) · 91d6f47a
  由 Chen Weihang 提交于 4月 10, 2022
  
  91d6f47a
- B
  
  add mkldnn compute_propagate_scales int8 pass (#41592) · c00d869b
  由 baoachun 提交于 4月 10, 2022
  
  c00d869b
- W
  
  predictor support trt (#41556) · a78ca1cf
  由 Wilber 提交于 4月 10, 2022
  
  a78ca1cf
- B
  add mkldnn int8 pass [step1] (#41579) · e68da187
  由 baoachun 提交于 4月 10, 2022
```
* add mkldnn int8 pass

* add mkldnn int8 pass

* update pass
```
  e68da187
09 4月, 2022 5 次提交

由 zhaocaibei123 提交于 4月 09, 2022

* update name

* update name

* fix test

* fix fleet bind

* update name

* update name

* fix test

* fix gpups wrapper

* remove Push/Pull/Load/Save with context in client and wrapper base class

* fix

* fix

* remove some interface

* fix

* remove

* code style

* recover

* fix

* remove code unused

* remove some unused table & accessor & CommonDenseTable => MemoryDenseTable

* fix

* fix

* fix

* recover

* remove unused code

* recover unittest

* fix

* remove

* fix

* remove code unuseful

* remove

* fix

* recover

* remove
Co-authored-by: Nesythan <esythan@126.com>

7a07c4a5

C

modify the block size of the group_norm backward (#41570) · ff2fba39
由 crystal 提交于 4月 09, 2022

ff2fba39
H

add depthwise conv hip support (#41537) · b3b8d345
由 hong 提交于 4月 09, 2022

b3b8d345
王

[infrt] opt support input valid places by commondline. (#41544) · 96ced1a1
由王明冬提交于 4月 09, 2022

96ced1a1

Autotune the workspace_size_limit in conv. (#40338) · b937cdc5

由 limingshu 提交于 4月 09, 2022

* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.

* Use the system cudaMalloc and cudaFree to allocate workspace during searching.

* Enable switch of two kind of workspace setting methods.
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

b937cdc5

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致