提交 · 6d205516e1b8cb0aca0e5382f28d82041c122d7d · Crayon鑫 / Paddle

16 3月, 2022 11 次提交
- X
  
  tranfer cumprod and kldiv_loss infershape to phi (#40575) · 6d205516
  由 xiongkun 提交于 3月 16, 2022
  
  6d205516
- C
  
  move isclose infershape (#40595) · c7637700
  由 Chen Weihang 提交于 3月 16, 2022
  
  c7637700
- C
  [Phi] Move grid sample op kernel into phi (#40585) · 8fd20b5b
  由 Chen Weihang 提交于 3月 16, 2022
```
* add grid sample phi kernel

* add grid sample phi kernel and remove original kernel

* replace mutable_data by alloc
```
  8fd20b5b
- Q
  
  [MLU] support amp O1 of mlu (#40461) · ad81f22c
  由 qipengh 提交于 3月 16, 2022
  
  ad81f22c
- Z
  
  Fixed issue with default-valued attributes (#40368) · f748b433
  由 Zhanlue Yang 提交于 3月 16, 2022
  
  f748b433
- A
  
  Polish reshape error message under @to_static (#40599) · 80194bde
  由 Aurelius84 提交于 3月 16, 2022
  
  80194bde
- C
  
  move gather infershape (#40594) · 59e5c49f
  由 Chen Weihang 提交于 3月 16, 2022
  
  59e5c49f
- Y
  [Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd
  由 Yulong Ao 提交于 3月 16, 2022
```
* [Auto Parallel] Support the auto completion of while_op

* [Auto Parallel] Improve the completion algorithms

* [Auto Parallel] Fix bugs for ernie inference

* [Auto Parallel] Remove attrs which cannot be pickled

* [Auto Parallel] make the dims_mappings of LodTensorArray vars empty

* [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel

* [Auto Parallel] Remove unncessary comments

* [Auto Parallel] Fix a bug of the CMakeLists

* [Auto Parallel] Use the newest APIs to write the unit test

* [Auto Parallel] Remove unnecessary statements
```
  ec6b8fbd
- X
  
  lgamma tranfer make xpu ci failed. fix compile error in xpu CI (#40581) · 31858263
  由 xiongkun 提交于 3月 16, 2022
  
  31858263
- K
  
  fix IterableDataset may block model when num_workers > 0. test=develop (#40541) · a991b6a0
  由 Kaipeng Deng 提交于 3月 16, 2022
  
  a991b6a0
- 王
  [infrt]Refine phi dialect (#40505) · 927767ca
  由王明冬提交于 3月 16, 2022
```
* change some symbol names

* add test

* add phi to opt.cc

* clean code

* up

* update

* up

* up

* Update pten_pass.mlir

* Update convolution_grad_kernel.cc

* update

* restore init_infrt_dialects

* restore

* up

* up

* up
Co-authored-by: NSuperjomn <yanchunwei@outlook.com>
```
  927767ca
15 3月, 2022 29 次提交

[Phi] Move determinant op kernel into phi (#40539) · a04a6bd5

由 Chen Weihang 提交于 3月 15, 2022

* add determinant phi kernel

* remove original determinant op kernel

* add determinant grad [hi kernel

* fix determinant test failed

* remove original determinant grad op kernel

a04a6bd5

C

remove cmake kernel print info (#40550) · 0c0acbd7
由 Chen Weihang 提交于 3月 15, 2022

0c0acbd7
G
Support some ops for full quantization (#40083) · 7ced3017
由 Guanghua Yu 提交于 3月 15, 2022
```
* add some op for full_quantization
```
7ced3017

[phi] modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot (#40506) · 31729a62

由 Liu-xiandong 提交于 3月 15, 2022

* [phi] move matrix_power op

* MatrixInverse fluid -> phi

* modify the CMake to fix compile bug

* delete useless comment

* mutable memory -> phi Alloc

* modify the include file

* modify the include file

* fix bug in CI compiler

* [phi]modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot

* delete useless comment

* fix bug in CI

* modify after review

31729a62

add number count op (#39224) · 9bdee437

由 Roc 提交于 3月 15, 2022

* add expert count op

add ut for expert_count

* update UT only for cuda

* fix for rocm

* update ut

* add moe module

* add expert count op

add ut for expert_count

* update UT only for cuda

* update ut

* add moe module

* make expert count private

* rename expert count op
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

9bdee437

X
run python api in eager model and filter the out in argument list (#40523) · 4d886f75
由 xiongkun 提交于 3月 15, 2022
```
* run python api in eager model and filter the out in argument list

* fix code
```
4d886f75
Z
Fixed issues with generated scale operator (#40482) · 30417999
由 Zhanlue Yang 提交于 3月 15, 2022
```
* Fixed issues with generated scale operator

* Fixed minor issues
```
30417999
T
[einsum] refactored and supporting unknown shapes in static mode (#40360) · 187fcfa3
由 Tongxin Bai 提交于 3月 15, 2022
```
* formatted.

* Remove dead code.

* Fix error message in the unit test.

* polish formats.

* [Einsum] fix bugs.
```
187fcfa3
F
[NPU] add AMP O1 support (#40362) · 69dd43d1
由 furnace 提交于 3月 15, 2022
```
* [NPU] add AMP O1 support

* [NPU] fix NOTE and warnings
```
69dd43d1
Y
[Auto Parallel] Add the recorder and trial class for the tuner (#40555) · 2c5edb4f
由 Yulong Ao 提交于 3月 15, 2022
```
Add the recorder
```
2c5edb4f

[Phi] Move gather op kernel into phi (#40500) · 0c703fe7

由 Chen Weihang 提交于 3月 15, 2022

* add phi gather kernel

* update year

* remove original gather opkernel

* add gather grad phi kernels

* remove origin gather grad kernel

* fix failed npu and xpu

* fix xpu compile failed

0c703fe7

oneDNN NHWC fixes (#40049) · dde9cec0

由 Jacek Czaja 提交于 3月 15, 2022

* - Prototype of third solution

- fix

- compilation fixes

- fix

- fixe

- fix

- fix

- compilation fix

- comment fix

- lint

update mkldnn conv_elementwise_add_fuse_pass ut

- NHWC changes to prelu

- alhpa dims

- UT fix

- fix to UT

- lint

- Some fixes

- added to BWD of prelu NHWC support

- reverted removal of resetting cu_layout in clearing of caching

* - Small changes

* - compilation fix

* - fix

* - fix

* lint

* - fixes after internal review

* - compilation fix

* - lint

dde9cec0

change CUDA implementation of randperm OP (#40464) · 813f61d2
由 zhouweiwei2014 提交于 3月 15, 2022

813f61d2
T
add shard_id (#40261) · 6b7d4845
由 Thunderbrook 提交于 3月 15, 2022
```
* shard_id

* format
```
6b7d4845

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620

由 xiongkun 提交于 3月 15, 2022

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)

* tranfer and pass the lgamma unittest

* merge and pass the test

* transfer kldiv_loss and kldiv_loss_grad; pass the unitest

* trafer the isclose and cumprod kernel

* change PT_REGISTER -> PD_REGISTER

* fix by code review

* fix by code review

* fix

* remove enforce include dependence from scalar

* fix

* fix by code review

* fix by code review

64223620

C

add softmax yaml and add_raw infermeta (#40534) · 7039f61e
由 Chen Weihang 提交于 3月 15, 2022

7039f61e

[Phi]move reduce_min/any/all kernel (#40374) · c46e661d

由 chentianyu03 提交于 3月 15, 2022

* add reduce_min kernel

* remove raw reduce_min kernel

* add reduce min

* add reduce any all impl

* add bool reduce Kernel

* remove raw any/all kernel

* add any all kernel

* rm comment

c46e661d

Added more profile signposts to dygraph (#40201) · 36db75b4

由 Zhanlue Yang 提交于 3月 15, 2022

* Added more signposts to dygraph profiling

* Fixed minor issues

* Refactored signpost names

* Fixed typo

* Removed debug codes

* Fixed typo

* Adjusted signpost names

* Fixed issues from branch merge

36db75b4

Move one hot to phi (#39876) · 7701db37

由 hong 提交于 3月 15, 2022

* move one hot to phi; test=develop

* fix bugs; test=develop

* fix bugs; test=develop

* add infer meta; test=develop

* fix bugs; test=develop

* resolve confilct

* resolve confilct

* fix bug;

* fix error; test=develop

* update; test=develop

* polish code; test=develop

* add one api in eager mode; test=develop

* add one hot test; test=develop

* remove use less code; test=develop

* fix bug; test=develop

* polish code; test=develop

* polish code; test=develop

7701db37

Skip infrt when checking log fatal (#40529) · c9f3ad03

由 Chen Weihang 提交于 3月 15, 2022

* skip infrt when checking log fatal, test=document_fix

* remove test=document_fix

* update commit

c9f3ad03

K

New design for launch/run (#40086) · 67c6ddff
由 kuizhiqing 提交于 3月 15, 2022

67c6ddff
R

add CHECK_VERSION macro (#40512) · 464f65b1
由 ronnywang 提交于 3月 15, 2022

464f65b1
Y
[Auto parallel] Redesign the tuner for auto parallel (#40121) · f84b54eb
由 Yulong Ao 提交于 3月 15, 2022
```
* [Auto Parallel] Redesign the tunner for Auto Parallel
```
f84b54eb
C

Fix truncated norm operator (#40287) · 0c333543
由 Chang Xu 提交于 3月 15, 2022

0c333543

[Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180

由 YuanRisheng 提交于 3月 15, 2022

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

* move activation kernel

* revert relu6

* reduce add code

* perfect use_phi_functor

* completing func name

* fix bugs when run ci

* fix bugs when run infr

* modifpy infrt get kernel signature

d7112180

Q

[MLU] add check_finite_and_unscale op for amp (#40458) · 42c7bb47
由 qipengh 提交于 3月 15, 2022

42c7bb47
Y

add yaml (#40533) · 5cb506b0
由 YuanRisheng 提交于 3月 15, 2022

5cb506b0
A
[IPU] add IPU related CI configures (#40354) · 8852591f
由 Allen Guo 提交于 3月 15, 2022
```
* add ci

* rm retry tests

* format

* restore retry tests

* update timeout for ipu uts
```
8852591f
Z

[Phi]Move searchsorted kernel to phi (#40520) · 85f8fd9b
由 Zhang Zheng 提交于 3月 15, 2022

85f8fd9b

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致