提交 · f027b2ad964ea84051c39703e60738b4e10c811e · PaddlePaddle / Paddle

25 3月, 2022 27 次提交
- Z
  
  [Refactor] refactored eager_gen.py PR #2 (#40907) · f027b2ad
  由 Zhanlue Yang 提交于 3月 25, 2022
  
  f027b2ad
- W
  infrt update phi gpu register. (#40866) · 5f6038ff
  由 Wilber 提交于 3月 25, 2022
```
* update register every make.

* fix

* update
```
  5f6038ff
- Y
  
  move activation (#40913) · be5918e0
  由 YuanRisheng 提交于 3月 25, 2022
  
  be5918e0
- A
  [Phi] Migrate strided_slice into Phi (#40708) · c33b4f95
  由 Aurelius84 提交于 3月 25, 2022
```
* [Phi] Migrate strided_slice into Phi

* [Phi] Migrate strided_slice into Phi

* fix compilation problem
```
  c33b4f95
- T
  
  Add Coverage build size check (#40749) · fd0c0e3c
  由 tianshuo78520a 提交于 3月 25, 2022
  
  fd0c0e3c
- J
  
  test=document_fix (#40919) · 961ef4de
  由 Jiaqi Liu 提交于 3月 25, 2022
  
  961ef4de
- Z
  add cast_grad phi kernel (#40798) · b79c6a9b
  由 zhangbo9674 提交于 3月 25, 2022
```
* add cast_grad phi kernel

* refie unittest

* refien unittest

* refine unittest

* refine include header path

* refien xpu cast unittest

* refine code
```
  b79c6a9b
- A
  [Phi] Migrate Adam and AdamW into Phi (#40351) · 56cd3407
  由 Aurelius84 提交于 3月 25, 2022
```
* [Phi] Migrate Adam and Adamw into Phi

* fix compile error and unittest ok

* fix compile error and unittest ok

* fix undefined reference to fLI::FLAGS

* test depend on operator

* fix cmake

* fix xpu compile

* fix infrt

* fix amp_type_traits

* fix amp_type_traits

* modify according reviewer

* modify according reviewer

* fix dtype float16

* fix typo

* fix Cmake

* fix code style
```
  56cd3407
- L
  Thread data registry (#40912) · aeae81a7
  由 liutiexing 提交于 3月 25, 2022
```
* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* Update ThreadDataRegistry
Co-authored-by: Nliutiexing <liutiexing@google.com>
```
  aeae81a7
- support multi_dims for tril_triu, *test=kunlun (#40712) · 9ffedcfd
  由 z8hanghuan 提交于 3月 25, 2022
```
* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun
```
  9ffedcfd
- F
  add maximum limit for grid of reduce, elementwise, gather and scatter (#40813) · 608a5f55
  由 FlyingQianMM 提交于 3月 25, 2022
```
* add maximum limit for grid of reduce, elementwise and gather

* add {} after if
```
  608a5f55
- C
  
  move mul op infershape (#40917) · 609077e9
  由 Chen Weihang 提交于 3月 25, 2022
  
  609077e9
- C
  [Phi] Move part sum op kernel (#40873) · 4ab8255a
  由 Chen Weihang 提交于 3月 25, 2022
```
* move part sum op kernel

* remove deprecated names
```
  4ab8255a
- Q
  
  [ROCm] fix compile error on DTK21.10, test=develop (#40893) · 41f813e9
  由 Qi Li 提交于 3月 25, 2022
  
  41f813e9
- change CUDA implementation of dropout OP (#40874) · 1c01d1cc
  由 zhouweiwei2014 提交于 3月 25, 2022
  
  1c01d1cc
- L
  fix paddle.vision.transforms.Resize en docs (#40719) · 236a3bc5
  由 Liyulingyue 提交于 3月 25, 2022
```
* Update transforms.py

* Update transforms.py

* Update transforms.py

* Update functional.py
```
  236a3bc5
- J
  Refactor Dygraph Flags (#40786) · 3085d5e4
  由 Jiabin Yang 提交于 3月 25, 2022
```
* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop
```
  3085d5e4
- T
  
  fix xpu op test, *test=kunlun (#40862) · 1db9cd46
  由 TTerror 提交于 3月 25, 2022
  
  1db9cd46
- X
  [OpTest] Polish optest (#40879) · d43e8433
  由 xiongkun 提交于 3月 25, 2022
```
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

* add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out

* fix some bugs, support python_out_sig
```
  d43e8433
- 王
  
  [infrt] add phi_dt.create_inited_dense_tensor.cpu.f32 kernel. (#40902) · 65478332
  由王明冬提交于 3月 25, 2022
  
  65478332
- F
  
  move elementwise_max/min/mod into phi (#40590) · cfadf61b
  由 FlyingQianMM 提交于 3月 25, 2022
  
  cfadf61b
- 0
  Fix loop index for FillZeroForEmptyGradInputs (#40909) · 3228fc34
  由 0x45f 提交于 3月 25, 2022
```
* Fix loop index for FillZeroForEmptyGradInputs

* Call fill zero in run_program_grad
```
  3228fc34
- S
  
  fix dependency (#40901) · c7b69fd2
  由 seemingwang 提交于 3月 25, 2022
  
  c7b69fd2
- A
  [NPU] add merged_momentum (#40875) · 2b74b739
  由 Aganlengzi 提交于 3月 25, 2022
```
* [NPU] add merged_momentum

* fix

* fix device
```
  2b74b739
- Z
  
  modify unit test in bn, stack and split. *test=kunlun (#40880) · 139a30ec
  由 Zhangjingyu06 提交于 3月 25, 2022
  
  139a30ec
- Z
  Scalar support marking data_type in yaml (#40867) · 04087012
  由 zyfncg 提交于 3月 25, 2022
```
* Scalar support marking data_type in yaml

* fix code-gene bug
```
  04087012
- F
  support get_item where the index is a bool scalar tensor (#40829) · 0f5e90a2
  由 FlyingQianMM 提交于 3月 25, 2022
```
* support get_item where the index is a bool scalar tensor

* add unittests for supporting get_item where the index is a bool scalar tensor
```
  0f5e90a2
24 3月, 2022 13 次提交

C
[Phi] Move mean op kernel into phi (#40872) · 8df91763
由 Chen Weihang 提交于 3月 24, 2022
```
* add mean phi kernel

* remove original mean kernel

* add alias name
```
8df91763

[Phi] Move batch size like infershape into phi (#40847) · 6d3db9c7

由 Chen Weihang 提交于 3月 24, 2022

* move batch size like infershape

* revert other op change

* call infermeta in infershape

* adjust batchsize like pos

6d3db9c7

Z

p_norm transfer to phi kernels (#40819) · 92afe146
由 zhiboniu 提交于 3月 24, 2022

92afe146
L

[new-exec] enable standalone_executor_test in coverage (#40846) · 22a5035e
由 Leo Chen 提交于 3月 24, 2022

22a5035e
J
fix build_cinn_pass internal var may be control var problem (#40812) · 310b7dba
由 jiangcheng 提交于 3月 24, 2022
```
* fix build_cinn_pass internal var may be control var problem

* add annotation and vlog by review advice
```
310b7dba

Support intermediate for Sparse API (#40840) · 98244a9a

由 zyfncg 提交于 3月 24, 2022

* support intermediate for saprse api

* close intermediate in yaml

* fix dygraph_api dep for eager

98244a9a

[AMP] Support amp for Intermediate_dygraph (#40623) · c12f7d48

由 zhangbo9674 提交于 3月 24, 2022

* approve amp for intermediate_dygraph

* add amp_utils for intermediate_dygraph

* add amp needcast check for mlu & npu

* test unittest

* add SetGradNode for set_stop_gradient && add checktensor for GradientHooks

* refine code

* refien unittest of imperative_amp for new dygraph

* inplace api skip amp

* add test_imperative_qat_amp for intermediate amp

* refine code

* refine test_amp ci strategy

* refine unittest code

* refine amp_utils code

* refine amp getpromotetype for some special op

* refine unittest code

c12f7d48

A

[phi] Remove usless cmake message (#40884) · 38d1fe34
由 Aurelius84 提交于 3月 24, 2022

38d1fe34
J
Correct MultipleQuantizeSquash (#40717) · 753964a2
由 joanna.wozna.intel 提交于 3月 24, 2022
```
* Correct MultipleQuantizeSquash

* Correct logging
```
753964a2
R

the `defaults` in FullArgSpec may be `None` (#40882) · 99541895
由 Ren Wei (任卫) 提交于 3月 24, 2022

99541895

[MoE]Assign pos op (#40580) · 305f32d1

由 Roc 提交于 3月 24, 2022

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* fix for win

* update for test (timeout)

* fix ut

* update

* fix ut for number count
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

305f32d1

L

Wrap dist api for dygraph mode (#40408) · 9d8cfc1b
由 lilong12 提交于 3月 24, 2022

9d8cfc1b
G

support dp for class_center_sample and margin_cross_entropy (#39852) · bff9e28e
由 Guoxia Wang 提交于 3月 24, 2022

bff9e28e

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功