提交 · 5bac67d4916a5191f5e8a75a53f90dc91bd6f35a · PaddlePaddle / Paddle

04 4月, 2023 1 次提交

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

03 3月, 2023 1 次提交
- Y
  [PHI Decoupling]Remove memory header (Part2) (#50870) · 558068cc
  由 YuanRisheng 提交于 3月 03, 2023
```
* decouple memory copy

* fix ci bugs

* fix ci compile bugs

* fix rocm compile

* fix ci bugs
```
  558068cc
05 12月, 2022 1 次提交

Replace mutable_data with DeviceContext.Alloc in phi kernels (#48500) · 34a957e3

由 Ruibiao Chen 提交于 12月 05, 2022

* Replace mutable_data with DeviceContext.Alloc in phi kernels

* Fix CI errors

* Fix CI errors

* Fix CI errors, test=kunlun

* Fix CI errors, test=kunlun

* Handle rnn_functor

* Update approvals

34a957e3

06 9月, 2022 1 次提交

[PHI]Add TensorArray for PHI (#45479) · 68f99b78

由 YuanRisheng 提交于 9月 06, 2022

* add tensor array

* fix ci bugs

* fix ci bugs

* fix ci bugs

* fix ci bugs

* update by comment

* update code

68f99b78

05 9月, 2022 1 次提交

[phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364

由 HongyuJia 提交于 9月 05, 2022

* migrate memcpy to phi

* fix typo error

* fix typo error

* fix  bug and testcase

* fix typo, uniform_random_kernel.cc header

* fix Alloc pinned bug

* change GPUContext::GetPinnedPlace

* add GetPinnedPlace function

* add GetPinnedPlace function

* restore default throw error

* fix Unimplemented error

* skip StandaloneExecutor testcase

* delete memcpy_sig

2f19a364

30 8月, 2022 1 次提交
- K
  fix memcpy_h2d bug related to cuda stream setting when allocate memory (#45450) · 10abdb8f
  由 kangguangli 提交于 8月 30, 2022
```
* fix memcpy_h2d bug related to cuda stream setting when allocate memory

* add header file

* fix compile error for cpu only
```
  10abdb8f
25 8月, 2022 1 次提交

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

PaddlePaddle / Paddle 大约 2 年 前同步成功

PaddlePaddle / Paddle
大约 2 年前同步成功