提交 · 23290929238d86a8a01590495fac0f1888ad0be6 · 机器未来 / Paddle

09 6月, 2021 1 次提交
- L
  
  Check the installed openblas version in cmake (#33440) · 23290929
  由 Leo Chen 提交于 6月 09, 2021
  
  23290929
01 6月, 2021 1 次提交
- Z
  
  Fix duplicate download when incremental compilation (#33230) · e939236e
  由 Zhou Wei 提交于 6月 01, 2021
  
  e939236e
27 5月, 2021 2 次提交

[PsCore] support ssd (#33031) · 988b5fe1

由 Thunderbrook 提交于 5月 27, 2021

* support ssd in PsCore

* remove log

* remove bz2

* defalut value

* code style

* parse table class

* code style

* add define

988b5fe1

Z
Unify all external API error message mechanism and enhance third-party API error msg (#33003) · b425215a
由 Zhou Wei 提交于 5月 27, 2021
```
* Unify all external API error message mechanism and enhance third-party API error msg

* fix some comment

* fix some comment
```
b425215a

10 5月, 2021 1 次提交
- T
  [pslib] pslib with cmake (#32800) · fbbc3394
  由 Thunderbrook 提交于 5月 10, 2021
```
* pslib with cmake

* heter util

* vlog

* heter server test

* add dtor

* cmake
```
  fbbc3394
21 4月, 2021 1 次提交

【NPU】Merge NPU ccl code (#32381) · c3158527

由 zhang wenhui 提交于 4月 21, 2021

* add allreduce and broadcast without test (#31024)

add allreduce and broadcast without test

* Refactor HCCLCommContext to be compatible with Paddle (#31359)

Refactor HCCLCommContext to be compatible with Paddle (#31359)

* [NPU] add npu kernel for communication op (#31437)

* add allreduce and broadcast without test

* add c_broadcast_test case

* build c_comm_init and c_create_group operators

* make the whole thing compile

* add broadcast and init op test case but run failed

* make unit test compile

* fix broadcast test bug and change into hcom for ccl

* change c_comm_init and c_create_group ops accordingly

* make tests compile

* transfer code to 27

* compiled successfully in 28, but run failed

* test broadcast in 28, but failed

* make hcom primitives work

* change hccl data type for base.h

* fix broadcast bug

* make attributes work

* fix group name bug

* add allreduce but test failed

* allreduce bug for qiuliang

* allreduce finished

* add allgather and reducescatter

* merge all op code

* add allgather test

* finish run all ccl op test exclude send/recv

* all all op and test exclude send/recv

* send_v2_npu.cc recv_v2_npiu.cc compiled

* fix ccl core dump bug and test allgather, reducescatter, broadcast op

* fix allreduce bug just for test

* hcom send&recv test pass, without hcom_destroy

* for qiuliang test

* Ascend Send&Recv Test Pass

* all op (ex send/recv) ok

* fix bug

* merge all ccl op

* style merge to PaddlePaddle

* merge style

* new merge style

* merge style 2

* insert an empty at the end

* disable ctest for hcom to pass ci
Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
Co-authored-by: Nf2hkop <f2huestc@outlook.com>

* Add auto-increasing tag id for Hcom OPs (#31702)

* add c_reduce_sum op (#31793)

add c_reduce_sum op

* update Ascendrc hccl to 20.3 (#32126)

update Ascendrc hccl to 20.3 (#32126)

* fix merge code

* change cmake.txt1

* [NPU] Support npu kernel for c sync stream op (#31386)

* sync stream npu op

* add with_ascend_acl

* update c++ unittest

* compile all failed

* try to pre commit

* after pre commit

* merge&compile&test hccl successfully!

* fix code style

* fix code style

* fix bugs about hccl

* fix some bugs

* fix code style

* fix style

* fix style

* fix

* fixed

* merge develop
Co-authored-by: Nlw921014 <liuwei921014@yeah.net>
Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
Co-authored-by: Nf2hkop <f2huestc@outlook.com>
Co-authored-by: Nxiayanming <41795079@qq.com>

c3158527

09 4月, 2021 1 次提交

[NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d

由 Leo Chen 提交于 4月 09, 2021

* [feature] support npu allocator (#30840)

[feature] support npu allocator

* [feature] support npu operator (#30951)

[feature] support npu operator

* [feature] support npu allocator, part 2 (#30972)

* support npu allocator

* add npu device context

* fix some compile problem

* fix some compile problem

* add npu info

* compile ok

* fix include dir

* support naive_best_fit_allocator

* run ut ok, bug failed to exit

* call aclrtResetDevice before exit

* fix aclFinilize

* add system allocatot test

* add selected_gpus in gtest

* add tensor_test for npu

* support npu op, initial commit

* add npu stream

* add elementwise_add_op

* compile ok

* fix typo

* fix elementwise_add_op_npu_test

* support op run

* test can run but failed

* change aclopExecuteV2 to aclopCompileAndExecute

* support parsing ascend rank table file (#31000)

support parsing ascend rank table file

* Fix reshape on GE graph. (#31084)

Fix reshape on GE graph

* add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)

* add npu sub op

* fix typo

* rename test

* fix bug

* fix bug

* add fp16 kernel

* fix typo

* support sub grad op

* support elementwise_sub_grad op
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>

* Fix compilation problem (#31100)

Fix compilation problem (#31100)

* fix compile

* fix code stype

* remove const_cast

* support adding correct npu op in pybind.h (#31143)

* support adding correct npu op in pybind.h

* refine code

* [NPU] Support executor with NPU (#31057)

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

* refactor npu device manager (#31154)

refactor npu device manager (#31154)

* fix selected npus

* fix compile

* fix reading flags from env

* format
Co-authored-by: Nxiayanming <41795079@qq.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>

ccf5709d

04 3月, 2021 1 次提交
- W
  
  Windows system supports Ninja compilation (#31161) · 4d6d2db8
  由 wuhuanzhou 提交于 3月 04, 2021
  
  4d6d2db8
01 3月, 2021 1 次提交
- W
  
  Fix xpu compile and cipher symbol problem. (#31271) · e2023409
  由 Wilber 提交于 3月 01, 2021
  
  e2023409
25 2月, 2021 1 次提交
- W
  
  enable lite ut. (#30890) · 7d91974c
  由 Wilber 提交于 2月 25, 2021
  
  7d91974c
18 1月, 2021 1 次提交
- H
  
  Ascend Framework Part1: OP & Wrapper (#30281) · 40ede126
  由 hutuxian 提交于 1月 18, 2021
  
  40ede126
12 1月, 2021 1 次提交

Fix/distributed proto (#29981) · 25f80fd3

由 tangwei12 提交于 1月 12, 2021

* rename sendrecv.proto to namespace paddle.distributed

* split ps with distributed

25f80fd3

07 1月, 2021 1 次提交
- T
  
  down openssl (#29958) · 7564d43b
  由 tianshuo78520a 提交于 1月 07, 2021
  
  7564d43b
24 12月, 2020 1 次提交

[Feature] one ps (3/4) (#29604) · 032414ca

由 tangwei12 提交于 12月 24, 2020

* oneps (3/4)
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nmalin10 <malin10@baidu.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

032414ca

16 12月, 2020 1 次提交

添加rocm平台支持代码 (#29342) · 76738504

由 Y_Xuan 提交于 12月 16, 2020

* 添加rocm平台支持代码

* 修改一些问题

* 修改一些歧义并添加备注

* 修改代码格式

* 解决冲突后的代码修改

* 修改operators.cmake

* 修改格式

* 修正错误

* 统一接口

* 修改日期

76738504

25 9月, 2020 2 次提交

add xpu in heter mode (#27000) · 6f69a4cb

由 Thunderbrook 提交于 9月 25, 2020

* add xpu in heter mode
test=develop

* BOOST_CONST_GET; PADDLE_THROW
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* refine
test=develop

* refine
test=develop

* refine
test=develop

* refine code
test=develop

6f69a4cb

update gcc8 in python3 ci docker (#26979) · a2e0b7cb

由 tianshuo78520a 提交于 9月 25, 2020

* update gcc8 in python3 ci docker

* change cuda 10.2

* update cudnn8

* nvidia error cuda10.2-cudnn8-centos6 images

* fix third cache

a2e0b7cb

09 9月, 2020 1 次提交
- W
  
  [cuda11 support] change the CMakeLists to support the cuda11 (#27124) · c71d79b1
  由 wangchaochaohu 提交于 9月 09, 2020
  
  c71d79b1
21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

14 8月, 2020 1 次提交
- 【paddle.fleet】upgrade fleet: modify role_maker (#26038) · 935da32d
  由 vslyu 提交于 8月 14, 2020
```
* add unittest for paddlerolemaker with gloo
```
  935da32d
28 5月, 2020 1 次提交
- Z
  
  add WITH_GPU for cudaerror download (#24056) · d1047d0a
  由 Zhou Wei 提交于 5月 28, 2020
  
  d1047d0a
27 5月, 2020 1 次提交
- Y
  
  Add crypto api (#24694) · 5a7a517c
  由 Yanghello 提交于 5月 27, 2020
  
  5a7a517c
22 4月, 2020 1 次提交
- Z
  
  Add note about the time cost and change HTTPS to HTTP to avoid unable to download(#24043) · 6f5669f9
  由 Zhou Wei 提交于 4月 22, 2020
  
  6f5669f9
20 4月, 2020 1 次提交

Optimize the error messages of paddle CUDA API (#23816) · 78170037

由 Zhou Wei 提交于 4月 20, 2020

* Optimize the error messages of paddle CUDA API, test=develop

* fix the error messages of paddle CUDA API, test=develop

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop

* remove build_ex_string,test=develop

* merge conflict,test=develop

78170037

09 4月, 2020 1 次提交

Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a

由 mozga-intel 提交于 4月 09, 2020

* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop

* Remove ngraph from:
1. Cmake file
2. Python file
test=develop

3baaee9a

02 3月, 2020 1 次提交
- Z
  fix bug that sourcecode of third_party can't be cached correctly,and add cache... · 0fb5ea78
  由 zhou wei 提交于 3月 02, 2020
```
fix bug that sourcecode of third_party can't be cached correctly,and add cache for xbyak and openblas (#22772)
```
  0fb5ea78
28 2月, 2020 1 次提交
- T
  
  fix typo word (#22784) · 433cef03
  由 tianshuo78520a 提交于 2月 28, 2020
  
  433cef03
14 1月, 2020 1 次提交
- X
  add collective communication library in fleet (#22211) · e3a457d3
  由 xujiaqi01 提交于 1月 14, 2020
```
* add collective communication library in fleet to replace mpi
* test=develop
```
  e3a457d3
09 1月, 2020 2 次提交
- 石
  
  [Feature] Lite subgraph (#22114) · ad0dfb17
  由石晓伟提交于 1月 09, 2020
  
  ad0dfb17
- Z
  tweak the interface of cache_third_party function - expose the SOURCE_DIR for... · 4f7a2bd0
  由 zhouwei25 提交于 1月 09, 2020
```
tweak the interface of cache_third_party function - expose the SOURCE_DIR for each external library (#21899)
```
  4f7a2bd0
25 12月, 2019 1 次提交
- Z
  
  remove patch command and file of cares to Improved quality of Paddle Repo (#21776) · a01663ca
  由 zhouwei25 提交于 12月 25, 2019
  
  a01663ca
05 12月, 2019 1 次提交

Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e

由 Leo Chen 提交于 12月 05, 2019

* test=develop, fix docker with paddle nccl problem

* don't expose numerous Tensor.set(), test=develop

* fix condition, test=develop

* fix float16 bug, test=develop

* feed should be Tensor or np.array, not Variable or number, test=develop

* use forcecast to copy numpy slice to new array, test=develop

* remove float16-uint16 hacking, test=develop

* add variable method to varbase and refactor to_variable to support return varbase

* support kwargs in varbase constructor

* add VarBase constructor to support default python args

* refine varbase initial method

* reset branch

* fix ut for change VarBase error info to PaddleEnforce

* cherry is parameter change before

* overload isinstance to replace too many change of is_variable

* rm useless files

* rm useless code merged by git

* test=develop, fix some ut failed error

* test=develop, fix test_graph_wrapper

* add some tests, test=develop

* refine __getitem__, test=develop

* add tests, test=develop

* fix err_msg, test=develop

cdd46d7e

25 11月, 2019 1 次提交
- Z
  
  Cache 3rd source code, improve stability, reduce the compilation time (#21190) · 341dee06
  由 zhouwei25 提交于 11月 25, 2019
  
  341dee06
18 11月, 2019 1 次提交
- Z
  fix bug when build openblas with a computer that has installed openblas... · 5d821578
  由 zhouwei25 提交于 11月 18, 2019
```
fix bug when build openblas with a computer that has installed openblas before,test=develop (#21160)
```
  5d821578
12 11月, 2019 1 次提交
- Z
  
  Remove useless code of openblas and fix the previous incorrect message (#21092) · d2573550
  由 zhouwei25 提交于 11月 12, 2019
  
  d2573550
11 11月, 2019 1 次提交
- M
  Add Shallow clone to ExternalProjects (#21060) · 6cc544aa
  由 Michał Gallus 提交于 11月 11, 2019
```
test=develop
```
  6cc544aa
08 11月, 2019 1 次提交
- Z
  
  move more third party library related logic to third_party.cmake (#20927) · 89bc18ee
  由 zhouwei25 提交于 11月 08, 2019
  
  89bc18ee
04 11月, 2019 1 次提交
- Z
  
  fix mklml and cblas bug,test=develop (#20970) · 394edd86
  由 zhouwei25 提交于 11月 04, 2019
  
  394edd86
31 10月, 2019 1 次提交
- Z
  
  Integration of third_party compilation structure (#20887) · b7417610
  由 zhouwei25 提交于 10月 31, 2019
  
  b7417610

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致