提交 · 4c8c5b4b6e897be3297fbff8aa29ebe2f32bd005 · PaddlePaddle / Paddle

18 1月, 2023 10 次提交
- H
  Modify English Doc of ControlFlow APIs for Scalar Tensor (#49890) · 4c8c5b4b
  由 Huihuang Zheng 提交于 1月 18, 2023
```
Modify English Doc of ControlFlow APIs for Scalar Tensor

Corresponding Chinese Doc PR: https://github.com/PaddlePaddle/docs/pull/5588
```
  4c8c5b4b
- H
  [Zero-Dim]Paddle.t support 0d tensor (#49880) · 5ffc22da
  由 heliqi 提交于 1月 18, 2023
```
* support paddle.t 0d tensor

* fix paddle.t test case

* merge from develop
```
  5ffc22da
- J
  
  kunlun support p2p send/recv (#49896) · 7242f40b
  由 jameszhang 提交于 1月 18, 2023
  
  7242f40b
- Q
  
  Zero-dim support of histogram kernel, test=develop (#49884) · 6cd7fcaf
  由 Qi Li 提交于 1月 18, 2023
  
  6cd7fcaf
- W
  [0 Tensor support] support the 0d tensor for the cumsum (#49518) · 5fca45ea
  由 wawltor 提交于 1月 18, 2023
```
* Add the cumsum 0d tensor

* xpu and cpu judge the 0d  tensor

* change to 2022 to 2023 in new commit

* fix the reverse logic
```
  5fca45ea
- Z
  
  [Zero-Dim] Fix bug in masked_select for XPU (#49904) · 1a8be158
  由 Zhang Zheng 提交于 1月 18, 2023
  
  1a8be158
- L
  
  fix both with_rpc and with_distributed on (#49878) · 3bf74127
  由 LiYuRio 提交于 1月 18, 2023
  
  3bf74127
- L
  
  fix cinn compilation with py38 (#49883) · bc93452d
  由 Leo Chen 提交于 1月 18, 2023
  
  bc93452d
- J
  use default XPU stream for computing (#49806) · f6b23d6d
  由 jameszhang 提交于 1月 18, 2023
```
* revert to use default XPU stream for computing

XPUContext now has a null stream by default. If you want to use a separate stream
 (e.g. in async collective communication), you should create a dedicated XPUContext
and invoke its XPUContext::CreateStream()

* minor
```
  f6b23d6d
- H
  Enrich 0d Tensor Dygraph and Shape Unit Test for `case` and `switch_case` (#49889) · 77376727
  由 Huihuang Zheng 提交于 1月 18, 2023
```
Followed PR https://github.com/PaddlePaddle/Paddle/pull/49842 , added Digraph and Shape unit test for `case` and `switch_case`. This PR only contained test changes because `case` and `switch_case` call `cond`. The PR https://github.com/PaddlePaddle/Paddle/pull/49842 has already solved the 0d tensor support.
```
  77376727
17 1月, 2023 24 次提交

Z
[Zero-Dim] Support input 0D Tensor for masked_select (#49803) · ce045890
由 Zhang Zheng 提交于 1月 17, 2023
```
* [Zero-Dim] Support input 0D Tensor for masked_select
```
ce045890
J
Add more dy2st ut2 (#49881) · 2242136a
由 Jiabin Yang 提交于 1月 17, 2023
```
* add test for composite with dy2st

* add more log
```
2242136a

Refine munmap freq for RefcountedMemoryMapAllocation (#49691) · 3fdc105f

由 zhangbo9674 提交于 1月 17, 2023

* refine munmap freq for ref_cnt_mmap_allocator

* add shm reuse logic

* fix compile bug

* fix compile bug

* fix bug of file refcount

* fix compile bug

* fix compile bug

* refine code for delete shm case

* polish code

* refine shm cache pool size setting logic

* set buffer is 2

* refine shm cache size logic

* refine max shm cache

* refine shm cache size

3fdc105f

X
[Dy2Static] fix switch static graph affects dataloader (#49821) · 18745e6f
由 xiongkun 提交于 1月 17, 2023
```
* rebase merge

* code fix

* fix bugs
```
18745e6f
C

Update PostQuantTraining zero size (#49868) · 611da7fc
由 Chang Xu 提交于 1月 17, 2023

611da7fc

Rewrite mat reshape transpose testers (#49580) · d9d47dc6

由 Paulina Gacek 提交于 1月 17, 2023

* reshape_transpose_matmul_pass_tester rewritten

* matmul_transpose_reshape_pass_tester rewritten

* mkldnn to onednn

d9d47dc6

Y
[Zero-Dim] support input 0D Tensor for equal_all (#49845) · f287b1e9
由 yeliang2258 提交于 1月 17, 2023
```
* add zero dims test

* update code

* fix zero dims

* update code
```
f287b1e9

support CUDA Graph for new executor (#49708) · 8e5ed04d

由 pangyoki 提交于 1月 17, 2023

* new exe supports CUDA Graph

* fix

* fix

* fix

* fix FLAGS_use_stream_safe_cuda_allocator in unittest

* insert output of coalesce_tensor op to skip_gc_var

* fix

8e5ed04d

P

add https://twitter.com/PaddlePaddle_ to README.md (#48227) · 76302bdc
由 PPGitub 提交于 1月 17, 2023

76302bdc

Prim api gen (#49654) · 813e27c9

由 xiaoguoguo626807 提交于 1月 17, 2023

* proto type of composite grad in paddle

* proto type of composite grad in paddle

* refactor composite api with phi

* fix compile error

* support static graph code-gen for squeeze op

* generate static graph code of unsqueeze

* refine op name

* fix compile error

* add extra output in op_compat

* remove debug log

* fix clang compile error

* support prim switch flag

* support prim switch flag

* fix dygraph error

* merge develop

* add code_gen

* add necessary files without codegen

* fix code_gen bug

* add deps

* modify igmnore

* add ignore

* delete std cout

* add composite logic for backward.py

* add tanh first order grad composite

* support enable_prim flag for static graph

* throw expection when both GrapOpMaker and GradCompOpMaker not been registered

* reorganize the directory of prim api tests

* fix windows error

* add eager_utils

* add eager_utils

* modify code gen

* add composite parse

* add unittest for get_grad_op_desc

* code optimize

* fix static test on windows

* support generate static graph code for imag and real op

* fix windows compile error in test_static_prim

* merge develop

* disable test eager in inference

* prim code gen

* disable eager compile in inference

* origin_yaml codegen success

* rm other file

* rm gitignore file

* code_style

* add eager test

* code_style

* clear #

* merge develop

* clear #

* remove useless files

* modify static test

* support bool flag from singlton

* merge develop

* recover git ignore

* fix conflict

* clear prim_gen

* recover git ignore for generated op

* parse_yaml success

* fix test compile error

* remove some tests

* add python test

* code_style

* revert parse_utils+ clear prim_gen

* fix some name issue

* add composite code gen

* modify backward yaml

* fix static composite grad maker code gen

* remove addtional files

* add some static funcs unit test

* fix some bugs

* fix composite grad maker register code gen

* optimize some functions

* modify gen cmake

* add more api gen

* add header

* modify static

* add static expand unsqueeze

* comments

* modify compopmaker

* revert

* modify gen name
Co-authored-by: NJiabinYang <360788950@qq.com>
Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>
Co-authored-by: Ncharles-hit <wanghao107@baidu.com>

813e27c9

[PHI]Change feed_op to phi kernel (#49116) · f7f1dc03

由 YuanRisheng 提交于 1月 17, 2023

* change feed_op to phi kernel

* fix ci bugs

* fix build bugs

* fix ci bugs

* fix compile bugs

* fix ci bugs

* perfect code

* perfect comment code

* fix install bugs

* modify code according comment

* remove visitor in feed_op

* modify according comment

* perfect code according comment

* add infershape

* fix py3 bugs

* fix getexpected kernel type

* fix getexpected kernel type

* fix ci bugs

* add registry for custom device

* fix py3 bugs

* fix floating point error

* fix py3 test bugs

f7f1dc03

Merge ops composite into to_static (#49836) · b2a10916

由 cyber-pioneer 提交于 1月 17, 2023

* support @to_static+to_prime+cinn

* fix code logic

* debug4

* debug5

* debug6

* debug7

* debug 8

* debug 9

* debug10

* debug11

* debug11

* debug 12
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

b2a10916

W
Fix translated layer fine-tune error (#49870) · 412573f0
由 WangZhen 提交于 1月 17, 2023
```
* Fix translated layer fine-tune
```
412573f0
D

fix ps ut error;test=develop (#49867) · 56cacae9
由 danleifeng 提交于 1月 17, 2023

56cacae9
J

add test for composite with dy2st (#49873) · b927ce81
由 Jiabin Yang 提交于 1月 17, 2023

b927ce81

Support 0d Tensor in ConditionalBlockOp (#49842) · 791637cf

由 Huihuang Zheng 提交于 1月 17, 2023

Support 0d Tensor in ConditionalBlockOp

1. Add dygraph 0d tensor support for ConditionalBlockOp
2. Set scalar loss shape when `append_backward`

791637cf

姜

rm flag retain grad (#49835) · 73f97de0

由姜永久提交于 1月 17, 2023

* rm retain grad

* fix zero_dim

* fix zero_dim for xpu

* reset zero dim for xpu

* reset xpu

* reset custom_relu

* Reset flip

* fix zero dim

73f97de0

Fix build ci (#49879) · 60ee518a

由 risemeup1 提交于 1月 17, 2023

* fix build ci bug

* fix build ci bug,test=test=document_fix

* fix build ci bug,test=document_fix

60ee518a

Z

Fix the paddle/staitc/amp/__init__.py (#49791) · fcc90531
由 zhangkaihuo 提交于 1月 17, 2023

fcc90531
disable scatter zero_dim test (#49853) · 86fa1715
由 zhouweiwei2014 提交于 1月 17, 2023

86fa1715
H

SetDevice when parse TensorBase (#49860) · 4c576870
由 HongyuJia 提交于 1月 17, 2023

4c576870
W
[Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
由 WangZhen 提交于 1月 17, 2023
```
* Support call backward() without params in dy2st
```
2f24b2d8
L

Modified compute and amplifier interceptor (#42044) · 989e39a5
由 LiYuRio 提交于 1月 17, 2023

989e39a5

【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a

由 Xiaoxu Chen 提交于 1月 17, 2023

* support elementwise base func

* fix compiling error and add test

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* another magic

* add skip rename strategy

* support add vjp

* support add with new axis cal

* support sub vjp

* [prim] add multiply vjp rules

* [prim] add multiply vjp rules

* [prim] fix no infershape with composite in _append_backward_ops

* [prim] add expand vjp rule

* [prim] add exp vjp rule

* uncomment infer shape for reshape/sum static prim api

* [prim] fix tanh nullptr error

* remove some print message

* fix magic number in run_program relative tests @JiaBinYang

* [prim] add expand,multiply,exp vjp rules

* fix only support single direction reduce error

* infer reduce dims using out dims
Co-authored-by: NJiabinYang <360788950@qq.com>

39c6765a

16 1月, 2023 6 次提交
- Support the 'data_transform' for generating static graph ops (#49772) · 28864137
  由 HappyHeavyRain 提交于 1月 16, 2023
```
* support the 'data_transform' for generating static graph ops

* reset 'pow' code

* change the 'GetKernelTypeForVar'
```
  28864137
- Z
  CUDA12.0 integration (#49539) · 1885d55a
  由 zlsh80826 提交于 1月 16, 2023
```
* Update warpctc for cuda-12

* Deprecate cudaProfilerInitialize for CUDA > 11

* Deprecate CUSPARSE_MV_ALG_DEFAULT for CUDA_VERSION >= 11040

* Add the missing thrust header
```
  1885d55a
- Z
  [inference] Use output var name to mark the NVTX flag (#49825) · ea2e2495
  由 Zhang Jun 提交于 1月 16, 2023
```
* add outvar name for nvtx mark

* nly network created with kEXPLICIT_BATCH can setsetMaxBatchSize
```
  ea2e2495
- W
  
  [PHI] channel_shuffle add yaml (#49808) · 56dbe426
  由 Weilong Wu 提交于 1月 16, 2023
  
  56dbe426
- W
  
  add add_n for the 0d tensor (#49854) · 65b0181e
  由 wawltor 提交于 1月 16, 2023
  
  65b0181e
- R
  
  optimize build_type (#49826) · 8fdb9087
  由 risemeup1 提交于 1月 16, 2023
  
  8fdb9087

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功