提交 · c7d6d9f4659d206dc52d2992c24c5070d52431fe · PaddlePaddle / Paddle

09 12月, 2022 4 次提交
- H
  
  move ops_extra_info_gen.py from phi to fluid (#48926) · c7d6d9f4
  由 huangjiyi 提交于 12月 09, 2022
  
  c7d6d9f4
- Z
  Support static graph code-gen for scalar and int_array (#48792) · 58f08924
  由 zyfncg 提交于 12月 09, 2022
```
* add suppport_tensor for code_gen to static graph

* support code-gen for int_array

* polish code

* fix bug of data_type
```
  58f08924
- L
  move share_buffer kernel to phi (#48858) · c2e77ba3
  由 Leo Chen 提交于 12月 09, 2022
```
* move share_buffer kernel to phi

* fix ut

* add source file

* fix window links
```
  c2e77ba3
- P
  
  [PHI decoupling] move "flags.h" from fluid to phi (#48696) · 39ffef0d
  由 PuQing 提交于 12月 09, 2022
  
  39ffef0d
08 12月, 2022 1 次提交
- L
  
  first commit (#38143) · 2e7c172c
  由 limingshu 提交于 12月 08, 2022
  
  2e7c172c
07 12月, 2022 3 次提交
- S
  [PHI] Migrate squeeze and squeeze_grad kernels (#48634) · ad41fce8
  由 Sławomir Siwek 提交于 12月 07, 2022
```
* squeeze kernel

* squeze fwd

* whitespace
```
  ad41fce8
- 张
  
  [phi::DenseTensor] Replace Tensor with phi::DenseTensor (#48682) · 65420271
  由张春乔提交于 12月 07, 2022
  
  65420271
- Z
  
  modify d2d copy to xpu::copy in xpu kernel, test=kunlun (#48710) · 0d8ddf9f
  由 zhangyikun02 提交于 12月 07, 2022
  
  0d8ddf9f
06 12月, 2022 2 次提交

Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38

由 zyfncg 提交于 12月 06, 2022

* delete Bias and ResidualData in OpMaker of conv2d

* delete extra input of conv3d

* refactor pass of conv_bias_fusion

* fix mkldnn dependency

* fix mkldnn compile

* fix test_conv_bias_mkldnn_fuse_pass

* police some code

* remove useless log

* fix analyzer_vit_ocr_tester

* fix conv_activation_mkldnn_fuse_pass

* fix test_analyzer_ocr

* add fused_conv_sig

* fix performence regression

* fix performance regression

0a2dfa38

S
[PHI] Migrate elementwise_(add/mul) kernels (#48625) · 7575d37c
由 Sławomir Siwek 提交于 12月 06, 2022
```
* remove fluid code

* init

* typo

* fix merge conflicts
```
7575d37c

05 12月, 2022 8 次提交
- L
  Transpose optimization for AlphaFold2 (#45230) · a0f43889
  由 limingshu 提交于 12月 05, 2022
```
* first commit

* fix bugs according to ci

* add some changes

* change file name into function.cu.h

* remove const_cast
```
  a0f43889
- Z
  
  support nhwc in conv2d_fusion (#48642) · 30f4ef7f
  由 zhoutianzi666 提交于 12月 05, 2022
  
  30f4ef7f
- H
  
  move device_memory_aligment from fluid to phi (#48694) · 796499fd
  由 huangjiyi 提交于 12月 05, 2022
  
  796499fd
- 六
  fix bug in paddle/phi/api/yaml/generator (#48659) · 595338c6
  由六个骨头提交于 12月 05, 2022
```
* fix bug

* fix bugs in api_gen tools
```
  595338c6
- R
  Replace mutable_data with DeviceContext.Alloc in phi kernels (#48500) · 34a957e3
  由 Ruibiao Chen 提交于 12月 05, 2022
```
* Replace mutable_data with DeviceContext.Alloc in phi kernels

* Fix CI errors

* Fix CI errors

* Fix CI errors, test=kunlun

* Fix CI errors, test=kunlun

* Handle rnn_functor

* Update approvals
```
  34a957e3
- Generate static graph code of some ops by yaml (#48698) · 97aa938f
  由 HappyHeavyRain 提交于 12月 05, 2022
```
* generate static graph code of some ops by yaml, test = develop

* generate static graph code of some ops by yaml, test = develop
```
  97aa938f
- N
  [PHI decoupling] migrate poly_util.h to phi (#48499) · d6aa0d43
  由 Netpunk 提交于 12月 05, 2022
```
* rm poly_util.h

* format code

* fix some problems

* format code
```
  d6aa0d43
- 柠
  
  DenseTensor (#48419) · 6cdaa371
  由柠檬味~ 提交于 12月 05, 2022
  
  6cdaa371
02 12月, 2022 3 次提交

P
[PHI] Migrate elementwise_sub kernel (#48611) · 493825a5
由 Piotr Paturej 提交于 12月 02, 2022
```
* Add migrations

* Fix build errors

* Remove elementwise_mul from migration
```
493825a5

Migrate mul_mkldnn_op to phi matmul_kernel (#48299) · e8edbb09

由 Hulek 提交于 12月 02, 2022

* Migrate mul_mkldnn_op to matmul_kernel

* Review fixes - changed mutable_data, changed ctx to dev_ctx, fixed namespaces

* switched some funcs to phi

* Deleted not needed phi:: and changed place checking according to standards

e8edbb09

fix boardcasting superlink (#48434) · c34812ac

由 Infinity_lee 提交于 12月 02, 2022

* fix boardcasting superlink

* Update bitwise_op.cc

* fix typo errors(from 48186)

* Update python/paddle/distribution/uniform.py
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

* Update math.py

* Update math.py

* refix

* Update logic.py

* BaseTransform api doc; test=docs_preview

* Update python/paddle/vision/transforms/transforms.py

* for text block; test=docs_preview

* Update transforms.py
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

c34812ac

01 12月, 2022 2 次提交
- M
  fuse-mt passes compatible with structured pruning (#48585) · a365024c
  由 minghaoBD 提交于 12月 01, 2022
```
* fuse-mt passes compatible with structured pruning
```
  a365024c
- H
  [Fix Type] Fix typo error (#48391) · 47e7b7a5
  由 HongyuJia 提交于 12月 01, 2022
```
* fix typo error

* pass CI-coverage
```
  47e7b7a5
30 11月, 2022 8 次提交
- Z
  Fix error log for yaml check (#48126) · f62b3fc8
  由 zyfncg 提交于 11月 30, 2022
```
* fix error log for yaml check

* remove grad_op of increment
```
  f62b3fc8
- N
  [PHI decoupling] migrate transpose_op.cu.h and gpu_utils.h to phi (#48286) · 8a9bef70
  由 Netpunk 提交于 11月 30, 2022
```
* migrate transpose_op.cu.h and gpu_utils.h

* format code style

* fix some problems

* format code

* reset tranpose_op.cc

* test commit

* recover transpose_op.h

* delete transpose_op.h

* adjust header files order in transpose_op.cc
```
  8a9bef70
- A
  [Perf]Fix interploate OutSize data transform problem (#48498) · 0b2a66bb
  由 Aurelius84 提交于 11月 30, 2022
```
* [Perf]Fix interploate OutSize data transform problem

* fix code style

* fix grad

* fix phi kernel
```
  0b2a66bb
- Support more activation in fused multi transformer (#48371) · 8a717a3e
  由 MarDino 提交于 11月 30, 2022
```
* add activation support
* fix cublasLt bug
* remove useless code and fix test random range
```
  8a717a3e
- Z
  Add fuse_act_add_grad_pass (#48346) · ca552933
  由 zhangbo9674 提交于 11月 30, 2022
```
* add fuse act add grad pass

* polish code

* refine code

* add test

* refine code
```
  ca552933
- Z
  Fix the name map of operator from Phi to fluid (#48496) · e337d280
  由 zyfncg 提交于 11月 30, 2022
```
* rename some kernel name

* fix compile problem
```
  e337d280
- R
  Add int8 support in fused_multi_transformer_pass and fuse_multi_transformer_layer_pass (#48209) · 12486712
  由 RichardWooSJTU 提交于 11月 30, 2022
```
* delete unnecessary shape and slice op
Co-authored-by: NYour Name <you@example.com>
```
  12486712
- J
  use correct xpu stream for synchronization (#48470) · 16562a9d
  由 james 提交于 11月 30, 2022
```
some legacy code still use xpu_wait() for stream sync -- it only syncs
default stream. this PR replaces them with dev_ctx.Wait() to ensure
that correct stream is always used
```
  16562a9d
29 11月, 2022 9 次提交

fix mma_tensorcore (#48386) · bf4d1792

由 lzy 提交于 11月 29, 2022

* fix mma_tensorcore (__CUDA_ARCH__)

* disable tensorcore by default.

disable tensorcore by default, because the judgment of __CUDA_ARCH__ will cause undefined behavior in some environments, can manually enable it on a machine that supports tensorcore.

bf4d1792

[PHI] traspose2 kernel migration (#47748) · d86aa4ca

由 Paulina Gacek 提交于 11月 29, 2022

* traspose2 kernel migrated

* Got rid of mutable_data

* x modification added

* ops added in extra info file

* Formatting fix

* 2 fuse passes with tanpose2 commented

* nr of outs changed in 2 passes, passes uncommented

* Changes in passes reverted

* transpose chnaged in operator.cc

* MKLDNN check in operator.cc

* Transpose fixes

* Fix deleted from operato

* template corrected
Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>

d86aa4ca

张

Replace LoDTensor with phi::DenseTensor in fluid\operators (#48417) · 91dd8a2e

由张春乔提交于 11月 29, 2022

* replace LoDTensor with phi::DenseTensor in fluid\operators

* replace LoDTensor with phi::DenseTensor in fluid\operators

* Update split_lod_tensor_op.cc

* Update warpctc_op.cc

* Update broadcast_tensors_op.cc

* Update crf_decoding_op.cc

* Update lstm_op.cc

* Update lstm_op.cc

* Update lod_reset_op.cc

* Update gru_op.cc

* Update linear_chain_crf_op.cc

* resume 2 files for confilct

* Update gru_op.cc

* Update linear_chain_crf_op.cc

* Update lstm_op.cc

91dd8a2e

N
[CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
由 Nyakku Shigure 提交于 11月 29, 2022
```
* isort all files

* revert conflicting files

* revert conflicting files

* revert conflicting files
```
f85def97
S

eltwise_div + scale [PHI] (#48484) · fa10524d
由 Sławomir Siwek 提交于 11月 29, 2022

fa10524d

[PHI] Migrate matmul kernel (#48162) · f41ccbd5

由 Sławomir Siwek 提交于 11月 29, 2022

* cleanup unused code

* unify is_int8 is_bfloat16

* Simplify matmul_v2 FWD kernel

* remove RunKernel methods

* remove import namespace

* remove headers

* clean fluid/phi cross imports

* remove fluid axpy_handler

* delete fluid methods

* activations

* OneDNNMemDesc

* MKLDNNFormatForSize

* MatchShapeToLayout

* MKLDNNMemoryFormat

* MKLDNNFormat

* ReorderMKLDNNHandler

* to_void_cast

* review suggestions

* interpolate

* remove fluid depedency

* init

* ExecuteMatMulV2

* rm fluid kernel

* matmul_grad

* remove mutable_data

* mul_grad

* matmul fwd

* add extra attr

* temp disable passes

* re-enable passes

* workaround for matmul+act

* fix for matmul+eltwise_add

* fix typo

* merge bugfix #48364

* remove merge conflict

f41ccbd5

[Control Flow] replace executor in while op with InterpreterCore (#47573) · 6dbfbfa5

由 kangguangli 提交于 11月 29, 2022

* fix:add no support for cuda_arch<700

* replace Executor in while op with InterpreterCore

* cache InterpreterCore as the member of WhileOp

* fix bug: tensor place changed because of assign op in while loop

* refine code

* refine code

* refine code

* hot fix

* fix compile

* merge develop

* follow comments

* add log for test

* remove LoDTensor

* set flag control_flow_use_new_executor false
Co-authored-by: Nfengshuai <fengshuai03@baidu.com>
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

6dbfbfa5

J
Bugfix for Collective default calc stream (#48308) · a66bb67a
由 JZ-LIANG 提交于 11月 29, 2022
```
* get default calc stream from execution ctx instead of global dev ctx pool.
```
a66bb67a

[Fluid API]Remove multiple APIs in control_flow (#48279) · c0d31dac

由 LiYuRio 提交于 11月 29, 2022

* remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN

* remove less_equal, greater_than, greater_equal, equal, not_equal

c0d31dac

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功