提交 · e61b25f930032c9bfa7fcc5ce3b18a190158f305 · Crayon鑫 / Paddle

08 6月, 2022 3 次提交
- X
  
  call_once (#43206) · cad139a7
  由 xiaoxiaohehe001 提交于 6月 08, 2022
  
  cad139a7
- Z
  
  fix tensor copy bug (#43299) · 88216f63
  由 zyfncg 提交于 6月 08, 2022
  
  88216f63
- Y
  [Phi]Move group op kernel into PHI and add yaml / unittest (#43104) · 99c6497b
  由 YuanRisheng 提交于 6月 08, 2022
```
* move_group_norm

* move group norm backward

* fix code format

* modify code according comment
```
  99c6497b
07 6月, 2022 6 次提交
- S
  
  Optimized the performance of activation op in XPU2 (#43187) · d5afc1ba
  由 shixingbo 提交于 6月 07, 2022
  
  d5afc1ba
- L
  
  Allocate and use new memory for temp data in cumsum kernel (#43101) · 5dcebb9b
  由 Leo Chen 提交于 6月 07, 2022
  
  5dcebb9b
- G
  
  add bf16 dtype for flatten kernel (#43264) · 0fdb3ced
  由 Guoxia Wang 提交于 6月 07, 2022
  
  0fdb3ced
- W
  
  [multi-stream] Fix split and concat problem. (#43039) · 8c3777df
  由 Wilber 提交于 6月 07, 2022
  
  8c3777df
- L
  Transpose optimization with assitant of Chengdu Supercomputing Center and... · 71a63f0a
  由 limingshu 提交于 6月 07, 2022
```
Transpose optimization with assitant of  Chengdu Supercomputing Center and auto_tune operation (#42704)
```
  71a63f0a
- N
  
  [XPU KP]Add xpu register, any, amax, amin op test (#43204) · aec49361
  由 niuliling123 提交于 6月 07, 2022
  
  aec49361
06 6月, 2022 1 次提交
- N
  
  Replace ReduceAmax/Amax.part.cu with KP (#43202) · 39903f72
  由 niuliling123 提交于 6月 06, 2022
  
  39903f72
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
04 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：cmake-format (#43057) · 92568edb
  由 Sing_chan 提交于 6月 04, 2022
  
  92568edb
02 6月, 2022 2 次提交
- S
  Support hetergraph reindex (#43128) · ceb20406
  由 Siming Dai 提交于 6月 02, 2022
```
* support heter reindex

* add unittest, fix bug

* add comment

* delete empty line

* refine example

* fix codestyle

* add disable static
```
  ceb20406
- L
  Extend forward fast layer_norm kernel to support more dimensions. (#43118) · 85baa3c0
  由 Li Min 提交于 6月 02, 2022
```
* extend forward fast_ln_kernel to support more column values.
```
  85baa3c0
01 6月, 2022 3 次提交

Y
Add yaml and unittest for instance_norm op (#43060) · 56ae33b6
由 YuanRisheng 提交于 6月 01, 2022
```
* add yaml

* fix infrt compile bugs
```
56ae33b6
A

[fix] split nanmedian fluid deps (#43135) · b23914c2
由 Aganlengzi 提交于 6月 01, 2022

b23914c2

[Yaml]add conv3d, depthwise_conv2d yaml (#42807) · 5f2c251c

由 chentianyu03 提交于 6月 01, 2022

* add conv3d yaml

* add conv3d_grad, conv3d_double_grad

* add final_state_conv3d test case

* add conv3d double test case

* add depthwise_conv2d grad yaml

* add depthwise_conv2d double grad test case

* modify the order of args

* add depthwise_conv2d_grad_grad config

5f2c251c

31 5月, 2022 4 次提交

C
[Phi] Polish assign kernel copy impl (#43061) · c9e7c407
由 Chen Weihang 提交于 5月 31, 2022
```
* fix assign kernel copy impl

* fix test failed
```
c9e7c407

【PaddlePaddle Hackathon 2】16 新增 API RRelu (#41823) · 21e1d10f

由 thunder95 提交于 5月 31, 2022

* rrelu逻辑部分

* unregistered op kernel (unresolved)

* commit before merge

* 丰富测试用例

* 修复rrelu-sig的bug

* 修复cpu环境测试

* 修改拼写错误

* 修改code format

* 尝试优化测试用例timeout的问题

* 优化测试用例

* 移除seed, 优化随机函数

* update en doc for rrelu

* fix rrelu en docs, test=document_fix

* add paper link for en docs, test=document_fix

* udpate en doc

* add r,test=document_fix

21e1d10f

[EinsumOp] Make EinsumOp support bfloat16. (#43085) · a4bb38cb

由 xiongkun 提交于 5月 31, 2022

* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0

* make EInsumOP support bf16

* add unittest for BF16

* add condition for test_BF16

* fix bugs

* fix

a4bb38cb

add embedding yaml (#43029) · 2785f876

由 zyfncg 提交于 5月 31, 2022

* add embedding yaml

* fix infermeta bug

* fix bug of selected_rows infer_meta

* fix selected_rows

* add unittest

2785f876

30 5月, 2022 5 次提交

C

Implement fused_gate_attention operator for AlphaFold. (#42018) · fdcdbec5
由 crystal 提交于 5月 30, 2022

fdcdbec5

【PaddlePaddle Hackathon 2】15 新增 API Nanmedian (#42385) · f87fa3c0

由 thunder95 提交于 5月 30, 2022

* nanmedian op

* 修改cuda kernel的bug

* 修复count_if在其他硬件平台不兼容

* 修复某些cpu硬件不兼容

* 修复某些cpu硬件不兼容

* 修复isnan判断

* 兼容numpy低版本不支持全部nan的情况

* 兼容numpy低版本不支持全部nan的情况

* fix code example

* fix api comment error

* 修改反向传播逻辑以及c++处理逻辑

* 完成修改建议

* typo pre_dim

* update en docs, test=document_fix

* remove numpy in en doc, test=document_fix

* add r,test=document_fix

* 添加api到all

* follow advice from chenwhql

f87fa3c0

L
Optimize memcpy operation in Eigh (#42853) · 806073d6
由 limingshu 提交于 5月 30, 2022
```
* 1st commit

* fix usless change in header transpose_kernel_h file

* add sync
```
806073d6
A
[fix] addmm supports 1-d input (#42959) · 849d937b
由 Aganlengzi 提交于 5月 30, 2022
```
* addmm supports 1-d input

* fix coverage

* fix

* more ut
```
849d937b
Z
Make data transform inplaced when tensor is on GPUPinned (#43055) · 114a5d21
由 zyfncg 提交于 5月 30, 2022
```
* make data transform inplace when tensor is on gpupinned in new dygraph

* fix unittest
```
114a5d21

27 5月, 2022 2 次提交

[Phi] Change optional tensor from `optional<const Tensor&>` to `optional<Tensor>` (#42939) · 6d78524c

由 zyfncg 提交于 5月 27, 2022

* refactor the optional tensor

* remove optiona<MetaTensor> in InferMeta

* fix bug

* fix optional<vector<Tensor>>

* fix bug

* fix rmsprop

* fix amp of eager_gen

* polish code

* fix deleted code

* fix merge conflict

* polish code

* remove is_nullopt_

* fix merge conflict

* fix merge conflict

6d78524c

X

change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 (#43010) · 668e235c
由 xiongkun 提交于 5月 27, 2022

668e235c

26 5月, 2022 2 次提交
- Y
  
  move instance_norm_double_grad (#43021) · b2b78cd4
  由 YuanRisheng 提交于 5月 26, 2022
  
  b2b78cd4
- Y
  [Phi]Refactor InstanceNormKernel and InstanceNormGradKernel (#42978) · cc272afb
  由 YuanRisheng 提交于 5月 26, 2022
```
* move instance_norm

* change mutable_data

* fix compile bugs
```
  cc272afb
25 5月, 2022 2 次提交

fix maybe-uninitialized warning (#42902) · f1f79b0d

由 Leo Chen 提交于 5月 25, 2022

* fix maybe-uninitialized warning

* fix compile

* fix xpu compile

* fix npu compile

* fix infer compile

* fix compile

* fix compile

f1f79b0d

[EinsumOp] Optimize the backward speed of EinsumOp (#42663) · 71b046cd

由 xiongkun 提交于 5月 25, 2022

* change logic for optimize

* modifty

* optimize the backward speed of EinsumOp

* add cache optimizer for einsum op

* EinsumOp: fix new dygraph mode error

* fix bug

* change Cache->InnerCache

* fix code

* fix

* add nan inf utils for einsum op

* add as_extra

* Compatible with v2.3 EinsumOp

* remove dispensable

71b046cd

24 5月, 2022 2 次提交
- Y
  [Phi]Move grad_add op kernel into phi and delete elementwise_add_op file (#42903) · 4d7a9eef
  由 YuanRisheng 提交于 5月 24, 2022
```
* move grad_add

* fix unittest bugs

* fix compile bugs
```
  4d7a9eef
- F
  
  fix cmake command, rm -> remove (#42927) · de735a9a
  由 Feiyu Chan 提交于 5月 24, 2022
  
  de735a9a
23 5月, 2022 4 次提交
- Y
  Add double grad yaml for celu/sqrt/rsqrt/square op (#42895) · 0211a833
  由 YuanRisheng 提交于 5月 23, 2022
```
* add double grad yaml

* fix bugs when compile infrt
```
  0211a833
- Z
  [Phi] Remove Storage (#42872) · fa6b3c9a
  由 zyfncg 提交于 5月 23, 2022
```
* remove storage

* add glog include

* add glog include

* add glog include
```
  fa6b3c9a
- remove is_init_py of RandomGenerator, and use Global RandomGenerator by default (#42876) · 3b488bae
  由 zhouweiwei2014 提交于 5月 23, 2022
```
* remove is_init_py of RandomGenerator, and use Global Generator if not OP seed

* fix comment
```
  3b488bae
- S
  
  Fix a bug in BroadcastConfig for KP XPU2 rec model (#42866) · 106083aa
  由 shixingbo 提交于 5月 23, 2022
  
  106083aa
20 5月, 2022 2 次提交
- N
  
  Delete ElementwiseKernel in BroadcastKernel (#42779) · 0d878f1a
  由 niuliling123 提交于 5月 20, 2022
  
  0d878f1a
- L
  use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output (#42851) · f36a9464
  由 Leo Chen 提交于 5月 20, 2022
```
* use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output

* add flags to control compute type

* default to false

* add unit test

* default to true
```
  f36a9464

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致