提交 · 526456678613101797f931fd64a0aa4e8d051942 · 机器未来 / Paddle

11 11月, 2021 2 次提交
- L
  
  Get global cluster information (#37084) · 31673a92
  由 LiYuRio 提交于 11月 11, 2021
  
  31673a92
- W
  fix 2 bug: 1.skip lodtensorarray; 2.delete feed op (#37090) · d5df6bdf
  由 wanghuancoder 提交于 11月 11, 2021
```
* fix 2 bug: 1.skip lodtensorarray; 2.delete feed op, test=develop

* program clone, test=develop
```
  d5df6bdf
10 11月, 2021 1 次提交
- A
  
  Fix inner_program in Executor (#37083) · 8a2ce0f2
  由 Aurelius84 提交于 11月 10, 2021
  
  8a2ce0f2
09 11月, 2021 1 次提交
- A
  
  fix CompileProgram in Executor (#37036) · 77a8c94b
  由 Aurelius84 提交于 11月 09, 2021
  
  77a8c94b
03 11月, 2021 1 次提交
- L
  
  executor framework (#36892) · 10b039b7
  由 LiYuRio 提交于 11月 03, 2021
  
  10b039b7
29 10月, 2021 1 次提交
- W
  fix some bug in new executor (#36822) · b5af9575
  由 wanghuancoder 提交于 10月 29, 2021
```
* fix some bug in new executor, test=develop

* fix error message, test=develop
```
  b5af9575
20 10月, 2021 1 次提交

Add FasterTokenizer Operator (#34491) · 3f2d6a3f

由 Steffy-zxf 提交于 10月 20, 2021

Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.

* support the text string as an input Tensor
* support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
* Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
* It first applies basic tokenization, followed by wordpiece tokenization.

3f2d6a3f

13 10月, 2021 1 次提交
- H
  Remove RunFromCinn in PE because We Will Call CinnRunner in Compute of SubgraphOp (#36385) · e051bba0
  由 Huihuang Zheng 提交于 10月 13, 2021
```
Remove RunFromCinn method in PE because We Will Call CinnRunner in Compute method of SubgraphOp
```
  e051bba0
11 10月, 2021 1 次提交

Add use_cinn Flag and RunFromCinn in PE (#36107) · 5690666c

由 Huihuang Zheng 提交于 10月 11, 2021

Add use_cinn flag and use it to control whether we run PaddlePaddle using CINN.

Also add:

Replace PaddlePaddle graph with a CINN graph in a pass
PE Method to feed data and run the graph by CINN

5690666c

08 10月, 2021 1 次提交

Support CUDA Graph on ParallelExecutor (#36250) · f9591bb1

由 Zeng Jinle 提交于 10月 08, 2021

* support CUDA Graph on PE

* add ut, fix CI compile

* reduce memory consumption

* fix CUDA 10 CI

* improve coverage

* improve python coverage

f9591bb1

22 9月, 2021 1 次提交
- W
  fix feed for new executor (#35803) · 4c2a06df
  由 wanghuancoder 提交于 9月 21, 2021
```
* fix feed, test=develop

* delete one test case, test=develop
```
  4c2a06df
15 9月, 2021 1 次提交
- A
  Enhance Check mechanism and Support single tuple/list of fetch_list in Executor (#35726) · cae050e8
  由 Aurelius84 提交于 9月 15, 2021
```
* Enhance Check mechanism of fetch_list in Executor

* support single tuple

* fix typo

* fix typo
```
  cae050e8
14 9月, 2021 1 次提交

Intergrate StandaloneExecutor in Static.Executor Interface with... · 4bc08530

由 Aurelius84 提交于 9月 14, 2021

Intergrate StandaloneExecutor in Static.Executor Interface with FLAGS_USE_STANDALONE_EXECUTOR (#35628)

* Intergrate StandaloneExecutor in Static.Executor Interface with FLAGS_USE_STANDALONE_EXECUTOR

* Enhance unittest and clean code in StandaloneExecutor

* polish unittest

4bc08530

28 7月, 2021 1 次提交

graph_to_program save parameter and stop_gradient information (#33771) · 8a7dee31

由 jiangcheng 提交于 7月 28, 2021

This PR added optional boolean is_parameter and stop_gradient in the VarDesc proto, and remove them during save_inference_model

8a7dee31

27 7月, 2021 1 次提交
- W
  
  [hybrid parallel] pipeline support adamw and LRScheduler (#34402) · 6ab0a6a8
  由 WangXi 提交于 7月 27, 2021
  
  6ab0a6a8
09 7月, 2021 1 次提交
- Y
  
  [hybrid performance] pipeline cache trainer (#33998) · 98c7191d
  由 Yuang Liu 提交于 7月 09, 2021
  
  98c7191d
06 7月, 2021 1 次提交
- W
  
  [hybrid performance] pipeline add program cache (#33954) · c9ae1362
  由 WangXi 提交于 7月 06, 2021
  
  c9ae1362
05 7月, 2021 1 次提交
- W
  
  [hybrid performance] optimize pipeline performance · 9914dff7
  由 WangXi 提交于 7月 05, 2021
  
  9914dff7
08 5月, 2021 1 次提交
- D
  【heterps】support cuda11 for heterps; add profiler in oneps (#32640) · beab9563
  由 danleifeng 提交于 5月 08, 2021
```
* add trainprofiler for heterps in oneps; test=develop

* add set_use_ps_gpu; test=develop
```
  beab9563
23 4月, 2021 1 次提交
- B
  solve hccl communicate conflict (#32447) · 0e74eea2
  由 Baibaifan 提交于 4月 23, 2021
```
solve hccl communicate conflict (#32447)
```
  0e74eea2
15 4月, 2021 1 次提交

heterps support pscore (#32093) · 9f8c8f96

由 Thunderbrook 提交于 4月 15, 2021

* pscore support heterps

* fleet cmake

* fleet wrapper

* macro

* solve conflict

* solve conflict

* add unitest

* paddle enforce

* unitest

* unitest

* unitest

9f8c8f96

09 4月, 2021 1 次提交

[NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d

由 Leo Chen 提交于 4月 09, 2021

* [feature] support npu allocator (#30840)

[feature] support npu allocator

* [feature] support npu operator (#30951)

[feature] support npu operator

* [feature] support npu allocator, part 2 (#30972)

* support npu allocator

* add npu device context

* fix some compile problem

* fix some compile problem

* add npu info

* compile ok

* fix include dir

* support naive_best_fit_allocator

* run ut ok, bug failed to exit

* call aclrtResetDevice before exit

* fix aclFinilize

* add system allocatot test

* add selected_gpus in gtest

* add tensor_test for npu

* support npu op, initial commit

* add npu stream

* add elementwise_add_op

* compile ok

* fix typo

* fix elementwise_add_op_npu_test

* support op run

* test can run but failed

* change aclopExecuteV2 to aclopCompileAndExecute

* support parsing ascend rank table file (#31000)

support parsing ascend rank table file

* Fix reshape on GE graph. (#31084)

Fix reshape on GE graph

* add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)

* add npu sub op

* fix typo

* rename test

* fix bug

* fix bug

* add fp16 kernel

* fix typo

* support sub grad op

* support elementwise_sub_grad op
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>

* Fix compilation problem (#31100)

Fix compilation problem (#31100)

* fix compile

* fix code stype

* remove const_cast

* support adding correct npu op in pybind.h (#31143)

* support adding correct npu op in pybind.h

* refine code

* [NPU] Support executor with NPU (#31057)

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

* refactor npu device manager (#31154)

refactor npu device manager (#31154)

* fix selected npus

* fix compile

* fix reading flags from env

* format
Co-authored-by: Nxiayanming <41795079@qq.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>

ccf5709d

26 3月, 2021 1 次提交
- L
  [3D-parallel] Reformat pipeline parallel (#31786) · c3974d0e
  由 lilong12 提交于 3月 26, 2021
```
* update, test=develop
```
  c3974d0e
07 1月, 2021 1 次提交
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
23 12月, 2020 1 次提交

heter box (#29734) · 09b6e719

由 Thunderbrook 提交于 12月 23, 2020

* 　add heter box

* add trainer, worker, wrapper...

* format

* for ci

* format

* remove boost get

* boost & copyright

* rename

* 　rename

* format

* format

* format
Co-authored-by: Nyaoxuefeng6 <yaoxuefeng@baidu.com>

09b6e719

23 11月, 2020 2 次提交

L
enable pipeline to run with Executor.run() (#28373) · f77a78cd
由 lilong12 提交于 11月 23, 2020
```
* update, test=develop
```
f77a78cd

support ps-gpu (#28752) · 0073f9bd

由 Thunderbrook 提交于 11月 23, 2020

* ps gpu transpile

* ps gpu

* remove op

* gps trainer

* local ps

* add macro

* HeterBox

* def cuda

* tab

* code style

* style

Co-authored-by: Thunderbrook <a754913769#163.com>

0073f9bd

30 10月, 2020 1 次提交
- 石
  update the version of pybind, test=develop (#28284) · d9b5f126
  由石晓伟提交于 10月 30, 2020
```
* update version pybind to v2.4.3, test=develop

* update unittests, test=develop
```
  d9b5f126
14 10月, 2020 1 次提交
- H
  Refine Executor API English Doc for 2.0rc (#27857) · 426de255
  由 Huihuang Zheng 提交于 10月 14, 2020
```
As the title
```
  426de255
13 10月, 2020 1 次提交

fix english doc, unittest, and remove useless alias of 2.0 lr_scheduler (#27686) · e122e164

由 Zhou Wei 提交于 10月 13, 2020

* fix doc and unittest of 2.0 lr_scheduler

* fix doc of 2.0 lr_scheduler

* fix unittest

* fix english doc of lr_scheduler

* fix api name of lr scheduler

* fix api name of lr scheduler

e122e164

10 10月, 2020 1 次提交
- W
  
  [API 2.0]Update 2.0 api from fluid to paddle (#27802) · c425cf18
  由 Wilber 提交于 10月 10, 2020
  
  c425cf18
30 9月, 2020 1 次提交
- W
  
  [API 2.0]Update 2.0 api from fluid to paddle. (#27598) · 488152a6
  由 Wilber 提交于 9月 30, 2020
  
  488152a6
25 9月, 2020 1 次提交

add xpu in heter mode (#27000) · 6f69a4cb

由 Thunderbrook 提交于 9月 25, 2020

* add xpu in heter mode
test=develop

* BOOST_CONST_GET; PADDLE_THROW
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* refine
test=develop

* refine
test=develop

* refine
test=develop

* refine code
test=develop

6f69a4cb

25 8月, 2020 1 次提交

optimized transformation form tensor to numpy (#26447) · c1f5df52

由 wanghuancoder 提交于 8月 25, 2020

* optimized transformation form tensor to numpy, test=develop

* optimized transformation form tensor to numpy, pass pre-commit, test=develop

* modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop

* modify py:array construct, test=develop

* fix _fetch_var to use deep copy, test=develop

c1f5df52

24 8月, 2020 1 次提交

[2.0API] Reconstruct all API related to LR Scheduler, unify dygraph and static (#26550) · 407de039

由 Zhou Wei 提交于 8月 24, 2020

* Reconstruct all API related to lr scheduler, unify dygraph and static

* Reconstruct all API related to lr scheduler, unify dygraph and static

* fix doc

* fix doc

* fix doc of lr_scheduler

* fix unittest and english doc

* fix english doc

* fix confilt

* fix doc

407de039

19 8月, 2020 1 次提交
- W
  Add check if fluid.data() variable no feed data (#25858) · ea6716a5
  由 wanghuancoder 提交于 8月 19, 2020
```
* add check if fluid.data() variable no feed data, test=develop

* Add testcase for feed check, test=develop
```
  ea6716a5
16 8月, 2020 1 次提交
- W
  
  [API2.0] add Device api (set_device and get_device)(#26103) · bb11cbc2
  由 wangchaochaohu 提交于 8月 16, 2020
  
  bb11cbc2
08 8月, 2020 1 次提交
- G
  
  Save checkpoint automatically (#25917) · 0067a2e4
  由 gongweibao 提交于 8月 08, 2020
  
  0067a2e4
06 8月, 2020 1 次提交

add heter ps mode (#25682) · 0cb60c70

由 Thunderbrook 提交于 8月 06, 2020

* add heter ps mode

* code style
test=develop

* add with_pslib
test=develop

* unitest
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* test monitor
test=develop

* prepare trainer
test=develop

* code style
test=develop

0cb60c70

04 8月, 2020 1 次提交
- H
  Modify Executor Example Code to Use fluid.data, test=document_fix (#25893) · e5514935
  由 Huihuang Zheng 提交于 8月 04, 2020
```
As the title
```
  e5514935

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致