提交 · 17318c1a1d1491ce2ce24e9c21ff1c4774f91f8b · PaddlePaddle / Paddle

09 2月, 2023 2 次提交

[PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a

由 Huang Jiyi 提交于 2月 09, 2023

* decouple strided_memcpy

* move strided_memcpy

* move strided_memcpy to phi

* fix namespace

* update

* fix gpu compile bugs

17318c1a

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

15 12月, 2022 1 次提交
- H
  
  [PHI decoupling] move softmax from fluid to phi and remove cpu_vec.h in fluid (#48970) · 344b99e1
  由 huangjiyi 提交于 12月 15, 2022
  
  344b99e1
28 11月, 2022 1 次提交
- P
  
  add cpu_info.h (#48403) · 923ad5dc
  由 PuQing 提交于 11月 28, 2022
  
  923ad5dc
10 11月, 2022 1 次提交

[PHI Decoupling] remove dependency on "paddle/fluid/platform/errors.h" and... · 4c375454

由 huangjiyi 提交于 11月 10, 2022

[PHI Decoupling] remove dependency on "paddle/fluid/platform/errors.h" and "paddle/fluid/platform/fast_divmod.h" in phi. (#47815)

* rm "paddle/fluid/platform/errors.h" in phi

* rm "paddle/fluid/platform/fast_divmod.h" in phi

4c375454

08 11月, 2022 1 次提交
- C
  
  normalize autotune tests dir (#47726) · 6bab3343
  由 Chen Weihang 提交于 11月 08, 2022
  
  6bab3343
26 10月, 2022 1 次提交
- C
  
  clean useless api tests in phi (#47321) · c334405f
  由 Chen Weihang 提交于 10月 25, 2022
  
  c334405f
12 10月, 2022 1 次提交
- Z
  
  [Sparse] Rename and fix doc (#46853) · a9cc5482
  由 zhangkaihuo 提交于 10月 12, 2022
  
  a9cc5482
30 9月, 2022 1 次提交
- 六
  
  【Hackathon No.21】为 Paddle 新增 paddle.incubate.sparse.transpose 稀疏 API (#45849) · 2b879a69
  由六个骨头提交于 9月 30, 2022
  
  2b879a69
28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

19 9月, 2022 1 次提交

Performance fix for broadcast kernel [Part3] (#46071) · 46e4fb2a

由 limingshu 提交于 9月 19, 2022

* first commit

* refine code with template argument

* refine code with template argument

* add ternary broadcast test file

* add ternary broadcast test file

* fix accoriding to ci

* fix op-benchmark ci error

46e4fb2a

07 9月, 2022 1 次提交
- Z
  
  [Sparse]Rename sparse kernel (#45730) · 36739748
  由 zhangkaihuo 提交于 9月 07, 2022
  
  36739748
05 9月, 2022 1 次提交

[PHI] Move oneDNN helper classes to new location (#45626) · 269bd1fe

由 piotrekobi 提交于 9月 05, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Move more functions from mkldnn_helper.h to onednn_helpper.h

* Change MKLDNN to OneDNN in VLOG message
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

269bd1fe

31 8月, 2022 1 次提交

Fix split api bug (#45396) · 4a25b60d

由 Charles-hit 提交于 8月 31, 2022

* fix split bug

* solve function redefine

* fix fluid.layers.split and add unit test

* delete splitInferMeta register in unary.cc

* modify test_split_op GPU unit test

* modify test_split_op GPU unit test place param

* refactor split op and fix infershape bugs

* add () in && and ||

* fix split C++ unit test

* fix split infershape

4a25b60d

30 8月, 2022 1 次提交
- W
  [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod (#45078) · 32f42e94
  由 WangZhen 提交于 8月 30, 2022
```
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
```
  32f42e94
26 8月, 2022 1 次提交

Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a

由 kangguangli 提交于 8月 26, 2022

* remove fluid kernel and activate phi kernel

* fix parameter error

* transfer mkldnn part

* modify header file path

* fix compile error

* transfer special case

* fix lod setting and special case for layout setting

* add testcase and refine code

985f2a4a

25 8月, 2022 1 次提交

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

01 8月, 2022 2 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

W
infer context fix place error. (#44726) · 74e46a93
由 Wilber 提交于 8月 01, 2022
```
* infer context fix place error.

* update

* update
```
74e46a93

26 7月, 2022 1 次提交
- Z
  
  Optimize sparse convolution (#43576) · 9841b308
  由 zhangkaihuo 提交于 7月 26, 2022
  
  9841b308
19 7月, 2022 1 次提交
- Z
  
  Standard name of sparse pool (#44344) · 9e307229
  由 zhangkaihuo 提交于 7月 19, 2022
  
  9e307229
15 7月, 2022 1 次提交
- Z
  
  Standard sparse conv name (#44353) · 87443831
  由 zhangkaihuo 提交于 7月 15, 2022
  
  87443831
13 7月, 2022 1 次提交
- Z
  Add sparse.coalesce (#44256) · fd6b1a02
  由 zhangkaihuo 提交于 7月 13, 2022
```
* add sparse api coalesce
```
  fd6b1a02
12 7月, 2022 1 次提交
- [Sparse]add sparse unary api(sin/tan/pow/neg/log1p/square/cast...) (#44022) · 682acd22
  由 zhouweiwei2014 提交于 7月 12, 2022
  
  682acd22
02 7月, 2022 2 次提交

unify cpu context, part2 (#44012) · 755438a7

由 Leo Chen 提交于 7月 02, 2022

* fix init()

* delete test_device_context

* replace CPUDeviceContext with CPUContext

* fix test_scalar

* remove dot_op.cc

* fix compile

755438a7

unify cpu context (#43989) · 09096aeb

由 Leo Chen 提交于 7月 01, 2022

* unify cpu context

* fix init()

* delete test_device_context

* fix test_scalar

09096aeb

24 6月, 2022 1 次提交

[Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73

由 YuanRisheng 提交于 6月 24, 2022

* perfect copy

* deal with conflict

* deal with conflict

* fix compile bugs

* fix unittest bugs

* change code format

* deal with conflict

* modify code by review

* fix ce bugs

* fix ce bugs

* add lo

* perfect code format

* deal with conflicts

2739bd73

23 6月, 2022 1 次提交
- M
  
  【Hackathon No.56 57 58 59】sparse elementwise add sub mul div (#41857) · e3d94fc5
  由 Matsumoto Ruko 提交于 6月 23, 2022
  
  e3d94fc5
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
04 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：cmake-format (#43057) · 92568edb
  由 Sing_chan 提交于 6月 04, 2022
  
  92568edb
12 5月, 2022 1 次提交
- T
  
  【Hackathon No.60】refactor unary sparse ops and add sparse sqrt, tanh, sin (#41356) · f1eda7d0
  由 tiancaishaonvjituizi 提交于 5月 12, 2022
  
  f1eda7d0
19 4月, 2022 1 次提交

[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from... · 2cb19d8f

由 YuanRisheng 提交于 4月 19, 2022

[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel（Part1） (#41806)

* seperate add/div/sub/mul from elementwise

* delete code

* fix compile bugs

* deal with conflict

* fix bugs when compile

* fix windows unit test bug

* fix ci converage bugs

2cb19d8f

15 4月, 2022 1 次提交

[Phi]Reduce kernels into multiply files (#41747) · 1927aff9

由 chentianyu03 提交于 4月 15, 2022

* split reduce_kernel

* rm reduce_kernel in cmake

* split reduce_grad kernels

* fix cmake build error

* format code

* fix standalone_executor_test error

1927aff9

02 4月, 2022 2 次提交
- Z
  
  Sparse conv and pool support indices as template (#41137) · 5d3fd4fe
  由 zhangkaihuo 提交于 4月 02, 2022
  
  5d3fd4fe
- Z
  
  Fix sparse conv and verify sparse conv backward (#40961) · ad0c106c
  由 zhangkaihuo 提交于 4月 02, 2022
  
  ad0c106c
01 4月, 2022 2 次提交

[Eager] Support pinned (#41035) · f3270fc8

由 wanghuancoder 提交于 4月 01, 2022

* support pinned, test=develop

* support async_write, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine,test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

f3270fc8

Z

Add Sparse Op: copy_sparse_coo and copy_sparse_csr (#41193) · 3a29e4f8
由 zhangkaihuo 提交于 4月 01, 2022

3a29e4f8

31 3月, 2022 1 次提交
- Z
  
  Opt the compilation of sparse kernel (#41086) · b9da48da
  由 zhangkaihuo 提交于 3月 31, 2022
  
  b9da48da
29 3月, 2022 1 次提交
- Z
  
  Add Sparse op sparse_relu (#40959) · c544a181
  由 zhangkaihuo 提交于 3月 29, 2022
  
  c544a181
27 3月, 2022 1 次提交

Add StringTensor (#39830) · 0695e1ac

由 Jack Zhou 提交于 3月 27, 2022

* add string tensor and case convert kernels

* Add strings empty kernel; Reorganize the structure of case convert kernel

* Add string infermeta

* Update mutable_data of string tensor

* rename kernel name

* add string copy tmp

* Fix strings copy device bug

* add utf8 gpu converter

* add string tensor c++ api

* Remove mutable_data of string tensor

* update string tensor interface

* remove charcases_flag.h

* remove some fluid headers

* Add make_ddim

* __HIPCC__ -> PADDLE_WITH_HIP

* remove fluid headers

* fix cpu compile

* remove std::hash

* Fix cudaMalloc

* Remove strings/impl directory

* Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps

* Add empty kernel test

* Remove some comments

* Modify lower/upper api encoding type: string->bool

* STRING->PSTRING; Add CreateInferLikeMeta

* Add code gen for C++ String API

* remove strings_api_utils.h

* Add ignore file (strings_api.h, strings_api.cc)

* update strings gen script

* change args order of case convert kernels

* Add comments for pstring, StringTensor

* cpstring_internal.h -> cpstring_impl.h

* Update accordding to comments:

1. Remove fluid headers
2. paddle::platform::errors -> phi::errors
3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
4. Use camel code style

* Remove all singletons in strings kernels

* fix rocm compile

* Fix py3 compile

* Fix c++ coverage

* 1. Add pstring proto type
2. Add StringTensor debug info
3. Rename case_convert_kernel to strings_lower_upper
4. Remove serialize derialize strings kernel

* DataLayout::PSTRING -> DataLayout::PSTRING_UNION

* Register pstring data type

* Fix strings api gen

* Fix dense tensor register pstring dtype

* Fix error messages

* remove line

* add pstring unittest

* remove test string api unitest

* remove empty line

* Remove some headers to decrease the size of executable file

0695e1ac

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功