提交 · 87388d59677cc94a8d4c528394b9212eb9e6448a · BaiXuePrincess / Paddle

21 11月, 2022 22 次提交
- V
  
  remove lrn which is not used in paddle 2.0 (#47945) · 87388d59
  由 Vvsmile 提交于 11月 21, 2022
  
  87388d59
- L
  mma qk tensor_core (#48087) · d79eda71
  由 lzy 提交于 11月 21, 2022
```
* use mma for QK dot computing in fused_multi_transformer.
* Update fused_multi_transformer_op.cu.h
```
  d79eda71
- W
  refine reduce_all (#48133) · 56f15c43
  由 wanghuancoder 提交于 11月 21, 2022
```
* refine reduce_all
```
  56f15c43
- J
  [Fluid Clean] remove apis in fluid.layers.ops (#47867) · 208f625b
  由 JYChen 提交于 11月 21, 2022
```
* remove apis in fluid.ops

* fix test_activation_nn_grad

* fix circle import error

* fix ops

* fix cos

* fix divide not inplace

* remove lazy-import part
```
  208f625b
- Z
  Fix wrong eigen header include in data_type.h (#48157) · 70589379
  由 zyfncg 提交于 11月 21, 2022
```
* Fix wrong eigen header include

* fix compile bug
```
  70589379
- 傅
  
  [fluid clean] remove fluid.layers.expand_as in nn.py under fluid (#47931) · 69eeaf03
  由傅剑寒提交于 11月 21, 2022
  
  69eeaf03
- V
  Remove API: crop (#47972) · d92daae2
  由 Vvsmile 提交于 11月 21, 2022
```
remove crop which is not used in Paddle 2.0
```
  d92daae2
- P
  [PHI decoupling] move "thread pool" from fluid to phi (#48075) · 3ca7328f
  由 PuQing 提交于 11月 21, 2022
```
* move threadpool

fix cmake

* fix make
```
  3ca7328f
- 傅
  
  （fluid清理）Remove filter by instag in nn.py under fluid (#47929) · 468f8815
  由傅剑寒提交于 11月 21, 2022
  
  468f8815
- T
  
  add adamw suppor xpu, test=kunlun (#48114) · 27e252d9
  由 taixiurong 提交于 11月 21, 2022
  
  27e252d9
- H
  
  add check_xpu_dependence.sh script. (#48154) · 394a7179
  由 houj04 提交于 11月 21, 2022
  
  394a7179
- 傅
  Remove fluid.layers.relu6 under fluid directory (#47876) · 5a45ceb2
  由傅剑寒提交于 11月 21, 2022
```
* remove relu6 test case under fluid

* fix relu6 test case in mkldnn_elt_act_fuse_pass
```
  5a45ceb2
- V
  Remove API: selu (#47969) · 1175a2b9
  由 Vvsmile 提交于 11月 21, 2022
```
replace paddle.fluid.layers.selu with paddle.nn.functional.selu
```
  1175a2b9
- V
  [Clean Fluid API]Remove API: gather (#47954) · 844ab6fe
  由 Vvsmile 提交于 11月 21, 2022
```
* Remove API: gather
	replace the paddle.fluid.layers.gather with paddle.gather

* modify the call of gather from old style to new style
```
  844ab6fe
- Update AUTHORS.md (#48177) · 1ba308f5
  由 engineer1109 提交于 11月 21, 2022
  
  1ba308f5
- W
  
  round (#48107) · b546438c
  由 wenbin 提交于 11月 21, 2022
  
  b546438c
- H
  [PHI decoupling] move cross_entropy from fluid to phi (#48160) · 3501ff7d
  由 huangjiyi 提交于 11月 21, 2022
```
* move cross_entropy from fluid to phi

* replace mutable_data with Alloc

* use .template
```
  3501ff7d
- W
  Unify `ProcessGroupNCCL` APIs underlying implementation (#48163) · 88410225
  由 Wen Sun 提交于 11月 21, 2022
```
* refactor: replace Collective & PointToPoint with NCCLEnv

* refactor: rename to RunFnInNCCLEnv

* refactor: pass std::function by value
```
  88410225
- L
  
  add new map instance (#48145) · 2a47416c
  由 LiYuRio 提交于 11月 21, 2022
  
  2a47416c
- L
  
  return pointer rather than reference (#48152) · 403d58bb
  由 LiYuRio 提交于 11月 21, 2022
  
  403d58bb
- P
  
  remove macros.h (#48069) · 02c51f3b
  由 PuQing 提交于 11月 21, 2022
  
  02c51f3b
- S
  
  add state_dict convert (#48161) · c00f0daf
  由 sneaxiy 提交于 11月 21, 2022
  
  c00f0daf
20 11月, 2022 1 次提交
- C
  remove range from fluid (#48086) · 5675c7d5
  由 ccrrong 提交于 11月 20, 2022
```
* remove range
```
  5675c7d5
19 11月, 2022 2 次提交
- W
  
  refactor: rm redundant funcs (#48149) · f38e09f0
  由 Wen Sun 提交于 11月 19, 2022
  
  f38e09f0
- A
  [CustomPlace] fix amp (#48090) · c775bc69
  由 Aganlengzi 提交于 11月 19, 2022
```
* [CustomPlace] fix amp

* [CustomPlace] fix amp

* fix ut because of too long time matmul fp16
```
  c775bc69
18 11月, 2022 15 次提交

W

refine save hook (#48124) · 04709310
由 wanghuancoder 提交于 11月 18, 2022

04709310

Fused QKVBiasAdd and Transpose with Split Q, KV (#47680) · d595928e

由 MarDino 提交于 11月 18, 2022

* fused qkvBiasAdd and transpose with split qkv

* fix typo

* fix format

* fix name

* add annotation

* fix comment

d595928e

Y
clear fluid apis: fix apis in fleet and passes (#48021) · e5408835
由 yuehuayingxueluo 提交于 11月 18, 2022
```
* clear fluid apis in fleet and passes

* fix model.py

* fix model.py

* fix cpp_pass.py
```
e5408835

[PHI] Migrate matmul_grad kernel (#48023) · 4ab18ada

由 Sławomir Siwek 提交于 11月 18, 2022

* cleanup unused code

* unify is_int8 is_bfloat16

* Simplify matmul_v2 FWD kernel

* remove RunKernel methods

* remove import namespace

* remove headers

* clean fluid/phi cross imports

* remove fluid axpy_handler

* delete fluid methods

* activations

* OneDNNMemDesc

* MKLDNNFormatForSize

* MatchShapeToLayout

* MKLDNNMemoryFormat

* MKLDNNFormat

* ReorderMKLDNNHandler

* to_void_cast

* review suggestions

* interpolate

* remove fluid depedency

* init

* ExecuteMatMulV2

* rm fluid kernel

* matmul_grad

* remove mutable_data

4ab18ada

V
Remove API: pad_constant_like (#47949) · 7073ed5b
由 Vvsmile 提交于 11月 18, 2022
```
remove pad_constant_like which is not used in paddle 2.0
```
7073ed5b

[PHI] Migrate conv_transpose kernel (#48119) · 9aacb31b

由 Zuza Gawrysiak 提交于 11月 18, 2022

* Migrate conv_transpose to phi

* Move handler to kernel

* kernel m

* Fix formatting

* handler

* remove fluid

* revert tcp_store

* tcp_store

* remove unused

* Fix declaration

* add dnn input

* Fix typo
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

9aacb31b

2

delete logical_xor api (#48070) · ec778272
由 201716010711 提交于 11月 18, 2022

ec778272
Z
Fix bug of zero_allocator in HostAlloc (#48108) · 7f92e27e
由 zyfncg 提交于 11月 18, 2022
```
* fix bug of zero_allocator in host

* fix test compile bug

* add unittest

* update test
```
7f92e27e
傅

(fluid清理）remove stack in nn.py under fluid (#47942) · 058aa381
由傅剑寒提交于 11月 18, 2022

058aa381

Optimize FusedBiasAddGelu Kernel (#47679) · b0e28540

由 MarDino 提交于 11月 18, 2022

* Add quick gelu and fused bias add kernel

* fix annotation

* remove useless code

* add fast gelu option and set it in multi transformer op

* add flag to restrict if use fast gelu approximate

* fix flags conflict

* fix use tanh function instead

* add cudart version limit

* use phi fast tanh func

* fix comment

b0e28540

[PHI decoupling] move "gpu_device_function.h" from fluid to phi (#48097) · 27ee6e71

由 huangjiyi 提交于 11月 18, 2022

* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi

* update copyright years

* rm "fluid/platform/device/gpu/gpu_device_function.h" in phi

* fix rocm-complie bugs

27ee6e71

W

Refactor collective communication reduce, scatter, reduce_scatter C++ API (#48115) · edda13cd
由 Wen Sun 提交于 11月 18, 2022

edda13cd
Z
[AutoParallel] selective recompute (#48111) · d7f7963f
由 zhaoyingli 提交于 11月 18, 2022
```
* [AutoParallel] selective recompute

* add cmakelist
```
d7f7963f

correct sync behavior for XPU distributed training (#47882) · aafa9820

由 james 提交于 11月 18, 2022

* correct sync behavior for XPU distributed training

XPU support event mechanism similar to cuda event, so it is advisable to
use an event to sync compute/comm streams for performance. However this
mechanism is never fully tested, and inconsistent loss/ending_epochs are
reported. Therefore, this PR replaces event sync with stream waiting as
a temporary solution.

* remove compile warning

aafa9820

D

Add description to `nn.functional.celu` (#48074) · 1fb4d90b
由 Dandelight 提交于 11月 18, 2022

1fb4d90b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致