提交 · 0d46a1085f58d20e8b9d3693172d5739e73cc08d · BaiXuePrincess / Paddle

17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

20 12月, 2021 1 次提交
- F
  
  [MLU]add mlu backend (#38207) · 76514a1f
  由 fwenguang 提交于 12月 20, 2021
  
  76514a1f
09 12月, 2021 1 次提交
- J
  
  add ipu device p2 (#37840) · cb636a48
  由 jianghaicheng 提交于 12月 09, 2021
  
  cb636a48
27 11月, 2021 1 次提交

[NPU] reorganization for device API abstraction (#37110) · 72241a6a

由 Aganlengzi 提交于 11月 27, 2021

* [NPU] reorganization for device API abstraction

* [NPU] delete old files

* [NPU] fix npu_collective_helper

* [NPU] fix collective_helper

* [NPU] fix ut

* [NPU] mod memory allocation and hccl_helper

* [NPU] fix place_type

* [NPU] split enfoce.h

* move acl* call into npu_info

* merge conflict

* fix merge

* merge conflict

* merge conflict

72241a6a

12 5月, 2021 1 次提交
- L
  
  [NPU] Support npu pinned allocator and manage Tensor on NPUPinnedPlace (#32840) · 6b3bb796
  由 liym27 提交于 5月 12, 2021
  
  6b3bb796
09 4月, 2021 1 次提交

[NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d

由 Leo Chen 提交于 4月 09, 2021

* [feature] support npu allocator (#30840)

[feature] support npu allocator

* [feature] support npu operator (#30951)

[feature] support npu operator

* [feature] support npu allocator, part 2 (#30972)

* support npu allocator

* add npu device context

* fix some compile problem

* fix some compile problem

* add npu info

* compile ok

* fix include dir

* support naive_best_fit_allocator

* run ut ok, bug failed to exit

* call aclrtResetDevice before exit

* fix aclFinilize

* add system allocatot test

* add selected_gpus in gtest

* add tensor_test for npu

* support npu op, initial commit

* add npu stream

* add elementwise_add_op

* compile ok

* fix typo

* fix elementwise_add_op_npu_test

* support op run

* test can run but failed

* change aclopExecuteV2 to aclopCompileAndExecute

* support parsing ascend rank table file (#31000)

support parsing ascend rank table file

* Fix reshape on GE graph. (#31084)

Fix reshape on GE graph

* add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)

* add npu sub op

* fix typo

* rename test

* fix bug

* fix bug

* add fp16 kernel

* fix typo

* support sub grad op

* support elementwise_sub_grad op
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>

* Fix compilation problem (#31100)

Fix compilation problem (#31100)

* fix compile

* fix code stype

* remove const_cast

* support adding correct npu op in pybind.h (#31143)

* support adding correct npu op in pybind.h

* refine code

* [NPU] Support executor with NPU (#31057)

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

* refactor npu device manager (#31154)

refactor npu device manager (#31154)

* fix selected npus

* fix compile

* fix reading flags from env

* format
Co-authored-by: Nxiayanming <41795079@qq.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>

ccf5709d

07 2月, 2021 1 次提交
- Q
  
  [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) · 34f1628c
  由 Qi Li 提交于 2月 07, 2021
  
  34f1628c
21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

08 4月, 2020 1 次提交
- Z
  
  API(place-related) error message enhancement (#23515) · 480530c4
  由 Zhang Ting 提交于 4月 08, 2020
  
  480530c4
12 11月, 2019 1 次提交
- C
  Further simplify the C++ error info stack (#21093) · edd6680a
  由 Chen Weihang 提交于 11月 12, 2019
```
* simplify C++ error stack by rewrite Place, test=develop

* polish assignment overload func, test=develop
```
  edd6680a
10 7月, 2019 1 次提交
- Z
  Clean unused code of dim and place (#18565) · be24e5b3
  由 Zeng Jinle 提交于 7月 10, 2019
```
* clean code of dim and place, test=develop

* fix failed unittests, test=develop
```
  be24e5b3
23 10月, 2018 1 次提交
- Y
  Remove place hash · 1d4d4e73
  由 Yu Yang 提交于 10月 22, 2018
```
test=develop
```
  1d4d4e73
18 10月, 2018 1 次提交
- S
  
  add unittest for allocator_facade.cc · 21fdf8e8
  由 sneaxiy 提交于 10月 18, 2018
  
  21fdf8e8
03 7月, 2018 1 次提交
- Y
  
  Use std::map for Place <--> DeviceContext · 2d0e5592
  由 yuyang18 提交于 7月 03, 2018
  
  2d0e5592
07 4月, 2018 1 次提交
- Y
  
  Fix cpplint errors paddle/fluid/platform/place.* (#9711) · 55ffceaa
  由 Yi Wang 提交于 4月 07, 2018
  
  55ffceaa
27 3月, 2018 1 次提交
- C
  
  Add CUDAPinnedPlace · ab601c19
  由 chengduoZH 提交于 3月 26, 2018
  
  ab601c19
26 3月, 2018 1 次提交
- C
  
  add CUDAPinnedPlace · 18eb7730
  由 chengduoZH 提交于 3月 26, 2018
  
  18eb7730
22 3月, 2018 1 次提交
- Y
  
  Enhance device context pool (#9293) · 1d8fe2a2
  由 Yu Yang 提交于 3月 22, 2018
  
  1d8fe2a2
21 3月, 2018 1 次提交
- Y
  
  Add dctor for dev_ctx · 5c333e41
  由 Yu Yang 提交于 3月 21, 2018
  
  5c333e41
14 3月, 2018 1 次提交
- Y
  
  ParallelExecutor And dependency engine · baef1124
  由 Yu Yang 提交于 3月 14, 2018
  
  baef1124
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
08 1月, 2018 2 次提交

Y

Refine get_places · 63ff0b4b
由 Yang Yu 提交于 1月 08, 2018

63ff0b4b

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

27 12月, 2017 1 次提交
- Y
  
  Add API for HasNAN HasInf · 15309fde
  由 Yang Yu 提交于 12月 27, 2017
  
  15309fde
25 12月, 2017 2 次提交
- Q
  remove unused place (#6972) · efd37269
  由 QI JUN 提交于 12月 25, 2017
```
* remove unused place

* fix ci
```
  efd37269
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
24 12月, 2017 2 次提交

Q
refine OpKernelType (#6879) · 37e96264
由 QI JUN 提交于 12月 24, 2017
```
* refine OpKernelKey

* refine codes

* fix code style

* follow comments
```
37e96264

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

18 12月, 2017 1 次提交
- Q
  add more place test and rename Cudnn to CUDNN (#6621) · 93a2d9c5
  由 QI JUN 提交于 12月 18, 2017
```
* add more place_test and rename Cudnn to CUDNN

* fix ci
```
  93a2d9c5
15 12月, 2017 2 次提交
- T
  
  fix conflict of Place · a92f057e
  由 tensor-tang 提交于 12月 15, 2017
  
  a92f057e
- T
  
  fix undefined issue when with_gpu · f2712105
  由 tensor-tang 提交于 12月 15, 2017
  
  f2712105
14 12月, 2017 2 次提交
- T
  
  add MKLDNNPlace · e0c33176
  由 tensor-tang 提交于 12月 14, 2017
  
  e0c33176
- D
  "derived cudnnDevice context" (#6585) · 0e9b393b
  由 dzhwinter 提交于 12月 14, 2017
```
* "derived cudnnDevice context"

* "leave remove cudnn handle from CUDADeviceContext"

* "fix math function error"
```
  0e9b393b
24 10月, 2017 1 次提交

Feature/nccl dso (#5001) · 43c6ff21

由 Yu Yang 提交于 10月 23, 2017

* "add nccl enforce"

* Dev

* Update comment

* Add nccl test

* Follow comments

43c6ff21

14 10月, 2017 1 次提交
- D
  
  "nccl add interface" · d1443104
  由 Dong Zhihong 提交于 10月 13, 2017
  
  d1443104
29 9月, 2017 1 次提交
- Y
  
  Follow comments · f2feb333
  由 Yu Yang 提交于 9月 28, 2017
  
  f2feb333
10 8月, 2017 1 次提交
- Y
  Fix the bug between nvcc and boost · 2df628af
  由 Yu Yang 提交于 8月 10, 2017
```
Fix #3386
```
  2df628af
05 8月, 2017 1 次提交
- Y
  
  Add explicit to some constructors · a40b755b
  由 Yi Wang 提交于 8月 04, 2017
  
  a40b755b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致