提交 · c72e2a15ac92d5dc4103e7d9d282fb825b1e488e · PaddlePaddle / Paddle

16 2月, 2023 9 次提交

A

[API]Support is_tensor() static branch (#50520) · c72e2a15
由 Aurelius84 提交于 2月 16, 2023

c72e2a15
H
[XPU] update xccl to 1.0.8 and xdnn to 20230215 (#50247) · b8008580
由 houj04 提交于 2月 16, 2023
```
* [XPU] update xccl to 1.0.8

* update xdnn. add uint8 for concat and split.

* update xdnn to 20230215.
```
b8008580
R
[XPU] add group_norm, sin, cos, linspace, randint kernels (#50465) · c86a5140
由 ronnywang 提交于 2月 16, 2023
```
* [XPU] add group_norm kernel

* update

* add xpu sin, cos, randint, linspace kernels

* update

* update
```
c86a5140

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

Use StandaloneExecutor in FleetExecutor (#50239) · df207283

由 Ruibiao Chen 提交于 2月 16, 2023

* Use StandaloneExecutor in FleetExecutor

* Update FLAGS

* Fix CI errors

* Update code

* Add force_root_scope_vars config

* Update code

* Fix CI errors

* Fix test_layer_new errors

df207283

[phi decoupling] remove variable.h in phi (#50407) · 905cefd4

由 Huang Jiyi 提交于 2月 16, 2023

* move variable_utils from phi_api_utils to fluid

* fix coment

* update include

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* update

* update

* fix CI-Windows-OpenBLAS

* fix bugs

* fix bugs

* fix bugs

* update include

* move variable_utils to phi_utils

* fix namespace

905cefd4

姜
disable deprecated ops dygraph tests (#50521) · df0ed4d6
由姜永久提交于 2月 16, 2023
```
* disable unewanted dygraph tests

* mine_hard_exa
```
df0ed4d6

Add mean composite rule (#50298) · f7f67b72

由 zqw_1997 提交于 2月 16, 2023

* beta

* small commit

* add batch_norm composite rule

move composite test case

remove unuseful var

add composite op blacklist

* small change v2

* finish the test_composite_mean and test_composite_mean_grad

* add ops assertion to the tests

* add cinn test

* fix the error and inappropriate usage in func: mean_composite

* remove the ref of outer lib in primtives.py

* modify sample code of reduce_sum

* fix composite mean op map

* modify testcases to test more float type

* remove cpu float16 test

* cinn test fix

* remove reduce_max

* change the name sum to sum_x

* change the use of reduce_sum to sum

---------
Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>

f7f67b72

15 2月, 2023 21 次提交
- D
  
  fix npu save_combine (#50496) · 3c14b38e
  由 duanyanhui 提交于 2月 15, 2023
  
  3c14b38e
- N
  
  Add Cpu tensor cast when amp_type isn't float32 (#50401) · 3d5faa88
  由 niuliling123 提交于 2月 15, 2023
  
  3d5faa88
- L
  make cinn_launch_op run interpretercore in tracing mode to reduce number of threads (#50472) · bf38175e
  由 Leo Chen 提交于 2月 15, 2023
```
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads

* skip getWorkqueue in tracing mode
```
  bf38175e
- H
  Rewrite conv activation mkldnn fuse pass tester (#49278) · 84beef80
  由 Hulek 提交于 2月 15, 2023
```
* Done

* Deleted old python test, fixed new python test, changed names in parallel_UT

* Revert parallel UT changes

* Revert parallel UT changes v2

* Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass

* delete sqrt from possible activations from conv_concat_relu test

* review refactor

* merge main

* delete sqrt from list of compatible activations

* Test with no outdated inputs
```
  84beef80
- X
  align tool (#49865) · 4632ca13
  由 xu98bin 提交于 2月 15, 2023
```
* auto parallel align tool

* modify function get_var's return

* add save and load in align_tool

* modify load function and save function

* add finding different ops in align tool

* full auto parallel align tool

add test file for auto parallel align tool

set timeout for test

modify get_backward_tmp_var function

add annotation for align tool

modify test file

modify code to restart CI

remove timeout

* set timeout
```
  4632ca13
- W
  
  [gpups update] add gpups ci log print (#50522) · 41902dda
  由 wangzhen38 提交于 2月 15, 2023
  
  41902dda
- C
  fix composite op map (#50397) · ff86aeab
  由 cyber-pioneer 提交于 2月 15, 2023
```
* map output from composite rule to origin op

add mean layer_norm dropout op map

add input map check

composite softmax support input shape []

* composite softmax support shape []

* polish log

* solve conflict

* polish code

* polish op map output

* add check dtype
```
  ff86aeab
- Z
  
  delete onednn kernel of feed (#50503) · 8decfb78
  由 zyfncg 提交于 2月 15, 2023
  
  8decfb78
- Y
  [PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11
  由 YuanRisheng 提交于 2月 15, 2023
```
* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment
```
  8fabca11
- R
  
  fix ninja problem (#50431) · 96006f77
  由 risemeup1 提交于 2月 15, 2023
  
  96006f77
- Z
  
  add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
  由 zhangyikun02 提交于 2月 15, 2023
  
  055d0c2d
- Q
  
  remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
  由 QingshuChen 提交于 2月 15, 2023
  
  47c23ccb
- L
  make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7
  由 lzy 提交于 2月 15, 2023
```
* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding
```
  53df50c7
- Z
  remove incubate.data_generator (#50325) · a3989b5e
  由 zqw_1997 提交于 2月 15, 2023
```
* remove incubate.data_generator

* modify the setup.py

* modifyt the setup.py.in
```
  a3989b5e
- W
  [fluid clean]clean fluid.transpiler API (#50375) · b08c91ab
  由 wangxiaoning 提交于 2月 15, 2023
```
* move ascend_transpiler

* move transpiler.collective

* remver checkport

* fix

* fix import

* fix import

* add init

* fix

* fix

* fix
```
  b08c91ab
- Z
  
  fix the issue: 50470 (#50479) · fe698fd4
  由 zqw_1997 提交于 2月 15, 2023
  
  fe698fd4
- W
  
  [mv fluid] ps related (#50376) · 81113b53
  由 wangzhen38 提交于 2月 15, 2023
  
  81113b53
- W
  
  Fix is_tensor_array in getitem (#50502) · cd54cfab
  由 WangZhen 提交于 2月 15, 2023
  
  cd54cfab
- R
  fix some protobuf update problems (#49875) · d84b918b
  由 risemeup1 提交于 2月 15, 2023
```
* Improved prootbuf upgrades

* Improved prootbuf upgrades

* Improved prootbuf upgrades

* limit protobuf version>=3.20.0
```
  d84b918b
- Y
  [CUSTOM]custom device add black_list (#50409) · 66d3c56e
  由 YuhangLi 提交于 2月 15, 2023
```
* [CUSTOM]custom device add black_list

* change log level

* fix some issues
```
  66d3c56e
- Z
  Add api approval (#50459) · 86fa306a
  由 zachary sun 提交于 2月 15, 2023
```
* add new approvals

* modify github id
```
  86fa306a
14 2月, 2023 10 次提交
- E
  decouple tensor_utils (#50264) · 057cdb95
  由 engineer1109 提交于 2月 14, 2023
```
fix X

remove TensorCopy

codestyle

add fluid memory header

fix symbol

fix cmake

fix cmake

fix context

fix header

fix place

fix context

fix context

fix context

fix code

fix custom context

fix custom context

fix copy

fix data_transform

fix style

remove changes of custom

fix scalar
```
  057cdb95
- D
  Expand mixed_precision to custom device (#50378) · fcb746cb
  由 duanyanhui 提交于 2月 14, 2023
```
* expand mix_precision to custom_device

* fix bug

* fix bug

* fix comment

* fix DEFINE bug
```
  fcb746cb
- H
  
  fix operants_manager.cc compile error (#50492) · 4a7d9cd8
  由 HongyuJia 提交于 2月 14, 2023
  
  4a7d9cd8
- W
  
  [BUG FIX] ctr metric bundle import error (#50466) · 60538f66
  由 wangzhen38 提交于 2月 14, 2023
  
  60538f66
- K
  [with_data_parallel][part1] remove with_data_parallel in unit test (#50351) · e305771e
  由 kangguangli 提交于 2月 14, 2023
```
* process unit test matched test_p*

* fix ci bug

* fix codestyle

* remove all tests about pe and restore some irrelated tests

* delete test_parallel_executor_test_while_train.py
```
  e305771e
- A
  [Dy2St]Enhance @not_to_static API (#50453) · 842050f2
  由 Aurelius84 提交于 2月 14, 2023
```
* [Dy2St]Enhance @not_to_static API

* del breakpoint()
```
  842050f2
- N
  [CodeStyle] update flake8 config (#50458) · c5087da8
  由 Nyakku Shigure 提交于 2月 14, 2023
```
* update flake8 config

* remove _pb2 from linter ignore list

* refine config

* empty commit, test=document_fix
```
  c5087da8
- M
  
  remove layers.tensor.argmin/argmax/assign/cast/concat/sums (#49944) · b85af464
  由 mhy-666 提交于 2月 14, 2023
  
  b85af464
- H
  [Polish Namespace] Polish operants namespace (#50420) · 61a933ac
  由 HongyuJia 提交于 2月 14, 2023
```
* polish namespace

* change static_tensor_operants

* polish namespace
```
  61a933ac
- S
  
  support int8 for embedding (#50413) · 78eb2d87
  由 seemingwang 提交于 2月 14, 2023
  
  78eb2d87

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功