提交 · 0710f058c0634099d57d695114ff47858c159a63 · BaiXuePrincess / Paddle

29 8月, 2022 16 次提交

C
Migrate assign xpu kernel into phi (#45467) · 0710f058
由 Chen Weihang 提交于 8月 29, 2022
```
* migrate assign xpu kernel, test=kunlun

* remove assign_value xpu, test=kunlun
```
0710f058

support backward refuse forward dygraph (#45250) · 7cf7084b

由 Charles-hit 提交于 8月 29, 2022

* support refuse forward dygraph

* modify backward api exponential__grad yaml

* remove print code

* 当反向复用前向时进行需不需要更高阶的反向判断，如果不需要调用c++ api，需要的话则调用前向动态图生成反向节点

* fix some backward bugs

* modify the generated dygraph function name

7cf7084b

W
[Phi] gather gather_grad gather_nd gaussian_random xpu to Phi (#45465) · 60e1eccb
由 wanghuancoder 提交于 8月 29, 2022
```
* gather gather_grad gather_nd gaussian_random xpu to phi
```
60e1eccb
Z

refine expand_as_v2 XPU kernel, test=kunlun (#45501) · ca5567e1
由 zhangbo9674 提交于 8月 29, 2022

ca5567e1

Fix HardSwish inf (#35386) · fbd83812

由 Zhang Ting 提交于 8月 29, 2022

* fix hard_swish inf

* skip_check_grad for mkldnn op

* 'fix code style'

* fix unittest

fbd83812

C

[MLU] optimize matmul_grad_v2 dy (B,M,K)*(K,N) for better performance (#45336) · 212b51ef
由 cambriconhsq 提交于 8月 29, 2022

212b51ef
W
[Eager] Pylayer set grad (#45452) · edc9952c
由 wanghuancoder 提交于 8月 29, 2022
```
* pylayer set has grad with create_graph
```
edc9952c
A
[OpAttr]num_rows/num_colums of eye support Tensor type (#45427) · b93b710a
由 Aurelius84 提交于 8月 29, 2022
```
* [OpAttr]num_rows/num_colums of eye support Tensor type

* fix attr cast with long type
```
b93b710a

[geometric]Move graph-related incubate api to geometric (#44970) · 8f657f74

由 Siming Dai 提交于 8月 29, 2022

* move incubate to geometric

* add paddle.geometric

* fix unittest bug

* add float16 support for segment op

* change reindex and sample neighbors flag name

* add heter graph reindex

* move sample_neighbors.py to neighbors.py

* delete khop_sampler in geometric

* delete unused code

* change sample_neighbors api input order

* fix en doc

* fix unittest

* fix unittest

* change reindex

* fix division by 0

* delete unnecessary input argument

* delete final_state

8f657f74

[IPU] support depthwise_conv2d ops (#45234) · a237ff8e

由 Allen Guo 提交于 8月 29, 2022

* support depthwise_conv2d ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>

* fix duplicate name
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>

a237ff8e

A

fix compile (#45441) · f49f3b4f
由 Allen Guo 提交于 8月 29, 2022

f49f3b4f

[phi] Transfer merged_adam yaml to phi (#45367) · b4f74eed

由 HongyuJia 提交于 8月 29, 2022

* add legacy_api.yaml

* set merged_momentum inplace only

* support inplace optional<vector<tensor>>

* add dygraph_mode api

* add merged_adam yaml

* add merged_adam python api

* change testcase of merged_adam and adam

* fix import of test_merged_adam_op

b4f74eed

Z

Move expand_as_v2 XPU kernel to PHI, test=kunlun (#45474) · 4b749513
由 zhangbo9674 提交于 8月 29, 2022

4b749513
Z

move expand_v2 to phi, test=kunlun (#45469) · 23a79923
由 zhangbo9674 提交于 8月 29, 2022

23a79923

Move matmul_v2 kernel of xpu from fluid to phi (#45446) · de436f07

由 zyfncg 提交于 8月 29, 2022

* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

de436f07

W

[XPU] migrate bce_loss to phi;test=kunlun (#45459) · d3ec3fe3
由 Weilong Wu 提交于 8月 29, 2022

d3ec3fe3

26 8月, 2022 14 次提交
- R
  
  Move conv2d_transpose_grad XPU kernel to PHI, test=kunlun (#45466) · a635a8a5
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  a635a8a5
- R
  
  Move grid_sample XPU kernel to PHI, test=kunlun (#45425) · 2c89bccb
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  2c89bccb
- R
  
  Move conv2d_transpose XPU kernel to PHI, test=kunlun (#45419) · 1f1a7835
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  1f1a7835
- Z
  [Phi] Delete xpu kernel of fill_any_like and fill_constant in fluid (#45420) · 6ab80b64
  由 zyfncg 提交于 8月 26, 2022
```
* delete fill xpu op in fluid

* delete fill_constant header, test=kunlun

* fix npu header, test=kunlun
```
  6ab80b64
- W
  Layernorm shape bugfix (#45431) · 3ca8cf44
  由 Wang Bojun 提交于 8月 26, 2022
```
* fix bug fix

* add shape size check

* polish code

* multi -1 shape fix

* code style improve

* bug fix

* code style fix
```
  3ca8cf44
- W
  
  [Eager] delete final state pre-name (#45306) · 126940b3
  由 wanghuancoder 提交于 8月 26, 2022
  
  126940b3
- W
  
  fix_multihead (#45429) · fa06d9c3
  由 Wangzheee 提交于 8月 26, 2022
  
  fa06d9c3
- D
  
  fix brpc update compile error; test=develop (#45438) · a5e9ccda
  由 danleifeng 提交于 8月 26, 2022
  
  a5e9ccda
- H
  
  [XPU] add load_combine_op_xpu. test=kunlun (#45436) · 3055d71a
  由 houj04 提交于 8月 26, 2022
  
  3055d71a
- K
  Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
  由 kangguangli 提交于 8月 26, 2022
```
* remove fluid kernel and activate phi kernel

* fix parameter error

* transfer mkldnn part

* modify header file path

* fix compile error

* transfer special case

* fix lod setting and special case for layout setting

* add testcase and refine code
```
  985f2a4a
- H
  
  Modify PE Engine thread from 2 into 1 in JitLayer (#45356) · 9382159d
  由 Hui Zhang 提交于 8月 26, 2022
  
  9382159d
- H
  fix reduce mean grad bug *test=kunlun (#45401) · 2a992178
  由 haosicheng 提交于 8月 26, 2022
```
* add temporal shift and grad *test=kunlun

* fix reduce mean grad bug *test=kunlun
```
  2a992178
- X
  [ Dy2static ] select input fix and while_op memory bug fixed. (#45380) · 91298884
  由 xiongkun 提交于 8月 26, 2022
```
* while support for python container.
It is convenient to convert more dynamic graph codes into static graphs.

* cond support python container

* 1. make select_input output shape = input[1]
2. add warning in while_loop risky assign

* fix 2 problem in GPT export:
1. a bug in while_op no_need_copy_var, which causes gpu memory leakage
2. a bug in undefined_var where the stop_gradient should be False.

* change name by code review

* format
```
  91298884
- 王
  
  [NPU] fix CI error in new executor. (#45432) · f4193eac
  由王明冬提交于 8月 26, 2022
  
  f4193eac
25 8月, 2022 10 次提交

F

add support for double attributes (#45390) · efab2eb4
由 Feiyu Chan 提交于 8月 25, 2022

efab2eb4

Enable OMP multithreading in lookup_table_v2 (#45249) · 0c363de8

由 piotrekobi 提交于 8月 25, 2022

* Add omp parallel for directives

* Revert "Add omp parallel for directives"

This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6.

* Add #pragma omp parallel for to correct file

* Add check for _OPENMP definition

* Disable omp on gpu

* Trigger CI

* Readd check for _OPENMP definition

* Change macro disabling changes on GPU

* Improve macro readability

0c363de8

A
[OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]axis of Reverse Support Tensor type

* fix coverage

* fix unittest
```
91110661
D
update brpc version to 1.2.0 (#45351) · 9b5b005e
由 danleifeng 提交于 8月 25, 2022
```
* update brpc version;test=develop
```
9b5b005e
A
[OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]min/max of Uniform_rand support Tensor type

* fix typo
```
c8955d0d
C
Fix record operator input shapes segment fault in new dygraph (#45360) · 4d78390e
由 chenjian 提交于 8月 25, 2022
```
* fix segment fault

* fix
```
4d78390e

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

R
[NPU] add run_program_op_npu (#45349) · 64afa638
由 ronnywang 提交于 8月 25, 2022
```
* [NPU] add run_program_op_npu

* add run_program_op_npu ut
```
64afa638
S
make full_like support double_max in dygraph (#45385) · edd66f2e
由 Sing_chan 提交于 8月 25, 2022
```
* make full_like support double_max in dygraph

* fix bug
```
edd66f2e
W
[Eager] sync_batch_norm_grad delete mean and variance (#45411) · 5df464fe
由 wanghuancoder 提交于 8月 25, 2022
```
* sync_batch_norm_grad delete mean and variance
```
5df464fe

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致