提交 · 5ffc22da8ebbfa32473482f930a04a7b5a887e49 · PaddlePaddle / Paddle

18 1月, 2023 5 次提交
- H
  [Zero-Dim]Paddle.t support 0d tensor (#49880) · 5ffc22da
  由 heliqi 提交于 1月 18, 2023
```
* support paddle.t 0d tensor

* fix paddle.t test case

* merge from develop
```
  5ffc22da
- J
  
  kunlun support p2p send/recv (#49896) · 7242f40b
  由 jameszhang 提交于 1月 18, 2023
  
  7242f40b
- Q
  
  Zero-dim support of histogram kernel, test=develop (#49884) · 6cd7fcaf
  由 Qi Li 提交于 1月 18, 2023
  
  6cd7fcaf
- W
  [0 Tensor support] support the 0d tensor for the cumsum (#49518) · 5fca45ea
  由 wawltor 提交于 1月 18, 2023
```
* Add the cumsum 0d tensor

* xpu and cpu judge the 0d  tensor

* change to 2022 to 2023 in new commit

* fix the reverse logic
```
  5fca45ea
- H
  Enrich 0d Tensor Dygraph and Shape Unit Test for `case` and `switch_case` (#49889) · 77376727
  由 Huihuang Zheng 提交于 1月 18, 2023
```
Followed PR https://github.com/PaddlePaddle/Paddle/pull/49842 , added Digraph and Shape unit test for `case` and `switch_case`. This PR only contained test changes because `case` and `switch_case` call `cond`. The PR https://github.com/PaddlePaddle/Paddle/pull/49842 has already solved the 0d tensor support.
```
  77376727
17 1月, 2023 17 次提交

Z
[Zero-Dim] Support input 0D Tensor for masked_select (#49803) · ce045890
由 Zhang Zheng 提交于 1月 17, 2023
```
* [Zero-Dim] Support input 0D Tensor for masked_select
```
ce045890

Refine munmap freq for RefcountedMemoryMapAllocation (#49691) · 3fdc105f

由 zhangbo9674 提交于 1月 17, 2023

* refine munmap freq for ref_cnt_mmap_allocator

* add shm reuse logic

* fix compile bug

* fix compile bug

* fix bug of file refcount

* fix compile bug

* fix compile bug

* refine code for delete shm case

* polish code

* refine shm cache pool size setting logic

* set buffer is 2

* refine shm cache size logic

* refine max shm cache

* refine shm cache size

3fdc105f

X
[Dy2Static] fix switch static graph affects dataloader (#49821) · 18745e6f
由 xiongkun 提交于 1月 17, 2023
```
* rebase merge

* code fix

* fix bugs
```
18745e6f
C

Update PostQuantTraining zero size (#49868) · 611da7fc
由 Chang Xu 提交于 1月 17, 2023

611da7fc

Rewrite mat reshape transpose testers (#49580) · d9d47dc6

由 Paulina Gacek 提交于 1月 17, 2023

* reshape_transpose_matmul_pass_tester rewritten

* matmul_transpose_reshape_pass_tester rewritten

* mkldnn to onednn

d9d47dc6

Y
[Zero-Dim] support input 0D Tensor for equal_all (#49845) · f287b1e9
由 yeliang2258 提交于 1月 17, 2023
```
* add zero dims test

* update code

* fix zero dims

* update code
```
f287b1e9

support CUDA Graph for new executor (#49708) · 8e5ed04d

由 pangyoki 提交于 1月 17, 2023

* new exe supports CUDA Graph

* fix

* fix

* fix

* fix FLAGS_use_stream_safe_cuda_allocator in unittest

* insert output of coalesce_tensor op to skip_gc_var

* fix

8e5ed04d

Merge ops composite into to_static (#49836) · b2a10916

由 cyber-pioneer 提交于 1月 17, 2023

* support @to_static+to_prime+cinn

* fix code logic

* debug4

* debug5

* debug6

* debug7

* debug 8

* debug 9

* debug10

* debug11

* debug11

* debug 12
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

b2a10916

W
Fix translated layer fine-tune error (#49870) · 412573f0
由 WangZhen 提交于 1月 17, 2023
```
* Fix translated layer fine-tune
```
412573f0
D

fix ps ut error;test=develop (#49867) · 56cacae9
由 danleifeng 提交于 1月 17, 2023

56cacae9
J

add test for composite with dy2st (#49873) · b927ce81
由 Jiabin Yang 提交于 1月 17, 2023

b927ce81

Support 0d Tensor in ConditionalBlockOp (#49842) · 791637cf

由 Huihuang Zheng 提交于 1月 17, 2023

Support 0d Tensor in ConditionalBlockOp

1. Add dygraph 0d tensor support for ConditionalBlockOp
2. Set scalar loss shape when `append_backward`

791637cf

姜

rm flag retain grad (#49835) · 73f97de0

由姜永久提交于 1月 17, 2023

* rm retain grad

* fix zero_dim

* fix zero_dim for xpu

* reset zero dim for xpu

* reset xpu

* reset custom_relu

* Reset flip

* fix zero dim

73f97de0

Z

Fix the paddle/staitc/amp/__init__.py (#49791) · fcc90531
由 zhangkaihuo 提交于 1月 17, 2023

fcc90531
disable scatter zero_dim test (#49853) · 86fa1715
由 zhouweiwei2014 提交于 1月 17, 2023

86fa1715
W
[Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
由 WangZhen 提交于 1月 17, 2023
```
* Support call backward() without params in dy2st
```
2f24b2d8

【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a

由 Xiaoxu Chen 提交于 1月 17, 2023

* support elementwise base func

* fix compiling error and add test

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* another magic

* add skip rename strategy

* support add vjp

* support add with new axis cal

* support sub vjp

* [prim] add multiply vjp rules

* [prim] add multiply vjp rules

* [prim] fix no infershape with composite in _append_backward_ops

* [prim] add expand vjp rule

* [prim] add exp vjp rule

* uncomment infer shape for reshape/sum static prim api

* [prim] fix tanh nullptr error

* remove some print message

* fix magic number in run_program relative tests @JiaBinYang

* [prim] add expand,multiply,exp vjp rules

* fix only support single direction reduce error

* infer reduce dims using out dims
Co-authored-by: NJiabinYang <360788950@qq.com>

39c6765a

16 1月, 2023 10 次提交

W

[PHI] channel_shuffle add yaml (#49808) · 56dbe426
由 Weilong Wu 提交于 1月 16, 2023

56dbe426
W

add add_n for the 0d tensor (#49854) · 65b0181e
由 wawltor 提交于 1月 16, 2023

65b0181e
Y
[Paddle-TRT] support nhwc (#49633) · e43f7102
由 Yuanle Liu 提交于 1月 16, 2023
```
* add trt_support_nhwc_pass
```
e43f7102
W

[Fluid clean]clean distributed fluid API (#49795) · 7de9420a
由 wangxiaoning 提交于 1月 16, 2023

7de9420a

Fix paddle save for multi-processing (#49657) · 504db4f5

由 Ghost Screaming 提交于 1月 16, 2023

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Remove climits.

* Fix bug of paddle.save. It may cause bug for saving sharded optimizer
state_dict() in parallel.

504db4f5

Q

add prod for kunlun (#49816) · bd03652f
由 QingshuChen 提交于 1月 16, 2023

bd03652f
W

[Eager] polish bmm api (#49823) · d3d69d8c
由 Weilong Wu 提交于 1月 16, 2023

d3d69d8c
Z

add sqrt_comp_grad composite rule (#49769) · 70378584
由 zqw_1997 提交于 1月 16, 2023

70378584
X

【prim】vjp for reduce sum (#49736) · 292f3f77
由 xiaoguoguo626807 提交于 1月 16, 2023

292f3f77

[Auto Parallel] Clear some fluid APIs (#49793) · e70af91d

由 Yulong Ao 提交于 1月 16, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

* [Auto Parallel] Clear some fluid APIs

e70af91d

15 1月, 2023 1 次提交

【Prim】Enhance tests (#49814) · 090aa45d

由 Jiabin Yang 提交于 1月 15, 2023

* support elementwise base func

* fix compiling error and add test

* remove additional param

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* add more test

* fix windows problem

* another magic

* fix windows compile

* invoke ci

* add skip rename strategy

* support add vjp

* fix test_tanh

* support add with new axis cal

* fix resnet and some test

* add composite log

* support sub vjp

* enhance_tests

* support more dtype for full

090aa45d

13 1月, 2023 7 次提交
- W
  
  [Phi] heaviside add yaml (#49807) · 4b7aeba4
  由 Weilong Wu 提交于 1月 13, 2023
  
  4b7aeba4
- C
  
  New feature: add register composite rule of ops (#49605) · 6ed8221a
  由 cyber-pioneer 提交于 1月 13, 2023
  
  6ed8221a
- W
  add oss flash fmha and fmhca support (#49438) · a48b8e2c
  由 Wang Bojun 提交于 1月 13, 2023
```
* add fmha_flashattention oss plugin
```
  a48b8e2c
- [Zero-Dim]simplify static unittest (#49805) · 650a0836
  由 zhouweiwei2014 提交于 1月 13, 2023
  
  650a0836
- W
  
  refine _grad_ivar (#49787) · 93cee48e
  由 wanghuancoder 提交于 1月 13, 2023
  
  93cee48e
- R
  [Zero-Dim] add where, atan2, median 0-Dim ut (#49692) · 1508cae7
  由 ronnywang 提交于 1月 13, 2023
```
* add where, atan2, median 0d ut

* add where, atan2, median 0d ut

* update

* update

* update
```
  1508cae7
- Z
  [inference][trt]set output data type of trt network (#49712) · 690d7a69
  由 Zhang Jun 提交于 1月 13, 2023
```
* update trt engine to set in/out data type

* update

* Update engine.cc

* Update engine.cc

* update

* set engine output type before freeze the network

* update

* update trt autoscan ut

* update

* update ut

* fix equal bug, update ut

* fix cast and equal ut

* update cast ut using TRT < 8.4

* set datatype from scope

* check output var is nullptr

* Update op_converter.h

* update tensorrt_engine_op_test ut

* update
```
  690d7a69

PaddlePaddle / Paddle 2 年多 前同步成功

PaddlePaddle / Paddle
2 年多前同步成功