提交 · 5bf3dec9804e2f0e4489f69394ee0c1ef6cc0b4b · 机器未来 / Paddle

12 8月, 2022 4 次提交

[Auto Parallel] Pybind ProcessMesh and DeviceMesh (#45013) · 5bf3dec9

由 Yulong Ao 提交于 8月 12, 2022

* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh

* [Auto Parallel] Fix the unittest problem

* [Auto Parallel] Explicitly add the src file for auto_parallel target

* [Auto Parallel] Add the proto depedency explicitly

* [Auto Parallel] Fix the cmake bug on windows and mac

* [Auto Parallel] Remove the pybind11 header file in process_mesh.h

5bf3dec9

D
enhance grid_sampler to support 3d input (#45015) · 1773fbba
由 duanyanhui 提交于 8月 12, 2022
```
* enhance grid_sampler to support 3d input
```
1773fbba
Z

fix extra output of kernels for inference (#45048) · 1cb883da
由 zyfncg 提交于 8月 12, 2022

1cb883da

[geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3

由 Siming Dai 提交于 8月 12, 2022

* add init file

* add op definition and infermeta

* add kernel definition funcs

* add broadcast infer shape

* add gpu forward kernel

* delete SUB and DIV

* add x_grad

* add template

* add e_grad for min and max

* fix small bug

* temp commit

* temp commit

* add e_grad for sum and mean

* fix some compile bug

* fix compile bugs

* fix compile problem

* add sum forward unittest

* fix broadcast error, add kernel sig, register e_grad, change unit test

* fix grad

* add temp grad fix

* temp commit

* add min max unittest

* add max, min unittest, fix mul bug

* add cpu forward sum and mean

* add forward min max, fix mean unittest

* add cpu backward min max

* fix code-style

* add backward sum mean

* fix rocm ci

* set uniitest timeout

* fix bug of x broadcast to e, gpu grad

* fix bug of x broadcast to e, cpu grad

* rename BOOST_GET_CONST macro

* fix rocm ci

* mv graph_send_e_recv to graph_send_ue_recv

* move out_size to IntArray

* add eager op test

* fix max pool type bug, add unittest for api

* revise api doc

* add fp16 for atomic min and max, add unittest

* add unittest

* add fp16 support for graph_send_recv

* fix unittest fp16 bug

* change OutSizeTensor to Out_size

* move E to Y

* add copyright, fix comment

* review code

* fix thread block size

* fix thread block size

* change api attribute name: pool_type to reduce_op, compute_type to message_op

* change api attribute name, move pool_type to reduce_op, move compute_type to message_op

615b15a3

11 8月, 2022 6 次提交
- C
  make affine_grid_op support 5d input_dim on cpu and gpu (#45012) · 7812522c
  由 carryyu 提交于 8月 11, 2022
```
* make affine_grid_op support 5d_input on cpu and gpu
```
  7812522c
- Z
  Refine cpups cmake (#45055) · 0dd895d2
  由 zhaocaibei123 提交于 8月 11, 2022
```
* first refine

* second refine

* remove some code unuseful
```
  0dd895d2
- C
  Add input shape record for new dygraph operator (#44999) · 8ea83400
  由 chenjian 提交于 8月 11, 2022
```
* fix

* add control flag and input shapes for new dygraph

* fix file mode

* improve code coverage

* fix a bug in statstic

* fix according to review

* optimize performance

* fix
```
  8ea83400
- Z
  Fix submanifold conv (#45060) · 27e3b06f
  由 zhangkaihuo 提交于 8月 11, 2022
```
* fix submanifold conv
```
  27e3b06f
- W
  
  Change bias to persistable in preln_residual_bias_fuse_pass (#45037) · 26c573de
  由 whs 提交于 8月 11, 2022
  
  26c573de
- W
  Polish black_ops_list logic in eager_gen (#44188) · 49d2a778
  由 Weilong Wu 提交于 8月 11, 2022
```
* Polish black_ops_list logic in eager_gen

* update black_ops_list
```
  49d2a778
10 8月, 2022 11 次提交
- W
  [Paddle Inference]Disable skip layernorm half (#45047) · 4805da50
  由 Wangzheee 提交于 8月 10, 2022
```
* disable_skip_layernorm_fp16
```
  4805da50
- Y
  
  fix mkldnn interpolate ops (#45008) · 3f49817a
  由 yeliang2258 提交于 8月 10, 2022
  
  3f49817a
- C
  
  polish backend and layout details (#45029) · 35839aee
  由 Chen Weihang 提交于 8月 10, 2022
  
  35839aee
- F
  1. change the codegen code to avoid conversion from heterogeneous 'initializer... · 083b4eb6
  由 Feiyu Chan 提交于 8月 10, 2022
```
1. change the codegen code to avoid conversion from heterogeneous 'initializer list' to tuple, which fails on gcc 5.4; (#45036)

2. add a template CheckTensorHasNanOrInf to handle arbitary tuple of supported types.
```
  083b4eb6
- D
  [phi] migration of class center sample infermeta (#45025) · b1e33bea
  由 duanboqiang 提交于 8月 10, 2022
```
* add class center sample infershape

* add yaml

* modify unittest

* modify unittest

* remove comment
```
  b1e33bea
- Z
  add macro control in enforce_xpu.h, test=kunlun (#45022) · 9e74211f
  由 zhangxiaoci 提交于 8月 10, 2022
```
* add macro control in enforce_xpu.h, test=kunlun

* minor bugfix

* minor bugfix
```
  9e74211f
- fix bug of adaptive pool2d_grad, *test=kunlun (#45031) · 01d05bc0
  由 z8hanghuan 提交于 8月 10, 2022
```
* fix bug of adaptive pool2d_grad, *test=kunlun

* fix bug of adaptive pool2d_grad, *test=kunlun

* fix bug of adaptive pool2d_grad, *test=kunlun
```
  01d05bc0
- X
  [Paddle Inference] Support cuda_graph. (#44878) · 84bf5c31
  由 xiaoxiaohehe001 提交于 8月 10, 2022
```
* cuda_graph

* cuda_graph_

* cuda_graph_

* cuda_graph_
```
  84bf5c31
- L
  [new-exec] set cuda device before run (#44985) · 68b06ba6
  由 Leo Chen 提交于 8月 10, 2022
```
* set cuda device before run

* add header file

* fix compile
```
  68b06ba6
- L
  fix proto consistency bug (#45017) · 9c98ee3e
  由 Leo Chen 提交于 8月 10, 2022
```
* fix proto bug

* add ut

* reset need_update for var_desc

* refine code

* fix var desc order issue
```
  9c98ee3e
- A
  [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute (#44737) · 81d6fa6c
  由 Aurelius84 提交于 8月 10, 2022
```
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute

* add unittest for inference predictor
```
  81d6fa6c
09 8月, 2022 16 次提交

S

[GNN] Fix graph sample and data type bug (#45001) · 59be2f3b
由 Siming Dai 提交于 8月 09, 2022

59be2f3b
R
Fix copy bug for same src and dst Tensor (#44992) · 125e48c3
由 Ruibiao Chen 提交于 8月 09, 2022
```
* Fix copy bug for same src and dst Tensor

* Improve code design

* Fix errors
```
125e48c3
W
[JitLayer]Rename class type Name2XX (#45006) · be931dfe
由 WangZhen 提交于 8月 09, 2022
```
* Rename class type Name2XX

* Fix return type

* Remove EngineMap function in layer
```
be931dfe
Y

fix mkldnn conv add pass when the dims of res and out are not equel (#45018) · 42c694df
由 yeliang2258 提交于 8月 09, 2022

42c694df

[Eager] support final_state_full_ under eager (#44806) · 31909bb5

由 Weilong Wu 提交于 8月 09, 2022

* [Eager] use final_state_fill_constant_

* fill_constant use str_value

* add fill_constant_ to no_amp_list

* use float(value) as input

* support final state full_ same as fill_constant

31909bb5

[geometric]Add paddle.geometric.send_u_recv API (#44580) · 34b43555

由 Siming Dai 提交于 8月 09, 2022

* change out_size to INTArray

* fix out_size eager bug

* add unittest for out_size tensor

* add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests

* fix lowest bug

* fix according review comment

* add default value in yaml

* change api file name

* change name

34b43555

C
move api(erfinv) from legacy_api.yaml to api.yaml (#44987) · 76e0926c
由 Charles-hit 提交于 8月 09, 2022
```
* move api(erfinv) from legacy_api.yaml to api.yaml

* change inplace_map key
```
76e0926c

[phi]migrate class center sample kernel (#44949) · a46d7fe6

由 duanboqiang 提交于 8月 09, 2022

* migrate class center sample kernel

* fix Resize ddim error

* set buffer ptr

* add header

* add header

* remove comment

* remove header

a46d7fe6

Y

fix vol2col (#44998) · ecc3098e
由 yeliang2258 提交于 8月 09, 2022

ecc3098e

[Auto Parallel] Add the c++ dist attrs (#44989) · 2c77b575

由 Yulong Ao 提交于 8月 09, 2022

* [Auto Parallel] Add the c++ dist attrs

* [Auto Parallel] Remove some codes to be less than 1000 lines

2c77b575

W
[JitLayer]Pybind Fucniton and hide ExecutorEngine and PEEngine (#44984) · 2832ab22
由 WangZhen 提交于 8月 09, 2022
```
* Pybind Fucniton and hide ExecutorEngine and PEEngine

* Remove FunctionNames in compilation_unit
```
2832ab22

add phi empty kernel for xpu,*test=kunlun (#44745) · cd0b03cd

由 z8hanghuan 提交于 8月 09, 2022

* add phi empty,*test=kunlun

* support empty op in xpu, *test=kunlun

* support empty op in xpu, *test=kunlun

cd0b03cd

D
[phi] migrate margin infer shape and yaml (#44940) · 6d5744b4
由 duanboqiang 提交于 8月 09, 2022
```
* add margin infer

* migrate yaml

* modify unittests script
```
6d5744b4

refine save/load interface for distributed cpups (#44862) · 7b29c89b

由 zhaocaibei123 提交于 8月 09, 2022

* save load

* save load

* add unittest

* first commit

* second commit

* third commit

* remove SaveLocalFS in memory sparse table

* save dense param

* update

* push slot

* fix push show clk: int -> float

* add unittest

* fix sample

* unittest

* add AsExtra for op

* unittest

* modify fs.py

* modify fs.py

* fix some bugs

* add dataset hdfs config

* local change

* dataset use differenct hadoop ugi/fs_name

* add

* fix conflict

* fix

* remove logs

* code style

* fix

* code style

* code style

* fix

* code style

* save_dense_param

* fix

* fix

* fix

* fix

* change momentum in dense optimzer

* fix

* fix

* change fluid => paddle.static

* remove some unuseful code
Co-authored-by: Nesythan <esythan@126.com>

7b29c89b

Y
Fix a bug in transpose2 when run native cpu (#44659) · 8185cecd
由 yeliang2258 提交于 8月 09, 2022
```
* fix a bug in transpose2 about mkldnn

* fix bug
```
8185cecd
A

fix format for paddle/phi/api/lib/tensor.cc (#44972) · b54abbe8
由 Allen Guo 提交于 8月 09, 2022

b54abbe8

08 8月, 2022 3 次提交
- Y
  Add inf and nan support in equal OP (#44667) · 85df6d73
  由 yeliang2258 提交于 8月 08, 2022
```
* add inf and nan support in equal

* add header

* fix nan and update test

* update test

* update test

* update test

* update code

* update compare test

* update func

* update

* update

* fix

* update
```
  85df6d73
- S
  
  fix memory leak (#44971) · 031debb7
  由 ShenLiang 提交于 8月 08, 2022
  
  031debb7
- Z
  
  BN1D inference support large batch_size (#44977) · c42cbb14
  由 zhangkaihuo 提交于 8月 08, 2022
  
  c42cbb14

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致