提交 · 1435b4c0961a6d6206904a315cb6bfbabfbe6f72 · PaddlePaddle / Paddle

23 2月, 2021 1 次提交

[NPU] Support executor with NPU (#31057) · 1435b4c0

由 liym27 提交于 2月 23, 2021

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

1435b4c0

24 12月, 2020 1 次提交

[Feature] one ps (3/4) (#29604) · 032414ca

由 tangwei12 提交于 12月 24, 2020

* oneps (3/4)
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nmalin10 <malin10@baidu.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

032414ca

20 11月, 2020 1 次提交
- G
  
  Fix gpu memory allocation bug. (#28703) · 1dad8cea
  由 gongweibao 提交于 11月 20, 2020
  
  1dad8cea
30 10月, 2020 1 次提交
- L
  
  hide some logs of p2p (#28307) · 18c86fb2
  由 Leo Chen 提交于 10月 30, 2020
  
  18c86fb2
28 9月, 2020 2 次提交

A
Add support for mkldnn ops types selection with FLAGS in dygraph (#27482) · 0ecf441a
由 arlesniak 提交于 9月 28, 2020
```
* Add support for mkldnn ops types selection with FLAGS in dygraph

* use regex to match DNNL verbose

* python3 encoding fix
```
0ecf441a

add paddle.fluid._cuda_synchronize (#27595) · c68a0313

由 wanghuancoder 提交于 9月 28, 2020

* add paddle.fluid._cuda_synchronize, test=develop

* fix bug about core_avx core_noavx, test=develop

* delete CPUPlace and XPUPlace, test=develop

c68a0313

21 9月, 2020 1 次提交

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

28 8月, 2020 1 次提交

Update the demo code and the doc of varbase.backward. (#26506) · f9066e6a

由 Zhen Wang 提交于 8月 28, 2020

* update the demo code and the doc of varbase.backward.

* update the doc of the fake interface `paddle.fluid.Variable`.

* remove BackwardStrategy.

f9066e6a

21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

18 8月, 2020 1 次提交
- Y
  
  add cpu random Generator (#26013) · 23261ff4
  由 yaoxuefeng 提交于 8月 18, 2020
  
  23261ff4
07 8月, 2020 1 次提交
- L
  Add flags to control call stack of error message (#25997) · 751305ec
  由 Leo Chen 提交于 8月 07, 2020
```
* add flags_call_stack_level

* update

* refine code
```
  751305ec
18 6月, 2020 1 次提交

add new API: set_global_initializer (#24378) · 542a226c

由 Zhou Wei 提交于 6月 18, 2020

* add new api (set_global_initializer/reset_global_initializer),test=develop

* add new api (set_global_initializer/reset_global_initializer),test=develop

* fix doc and example code of set_global_initializer,test=develop

542a226c

13 5月, 2020 1 次提交
- H
  
  add enable_imperative, disable_imperative alis; test=develop (#24392) · f0df9026
  由 hong 提交于 5月 13, 2020
  
  f0df9026
20 4月, 2020 1 次提交
- Z
  
  remove high level api (#23854) · 6bd200db
  由 zhangchunle 提交于 4月 20, 2020
  
  6bd200db
09 4月, 2020 1 次提交

Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a

由 mozga-intel 提交于 4月 09, 2020

* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop

* Remove ngraph from:
1. Cmake file
2. Python file
test=develop

3baaee9a

04 4月, 2020 1 次提交

Dev/fix init flags (#23465) · f297a332

由 Leo Chen 提交于 4月 04, 2020

* fix init_gflags with 'python -c', test=develop

* add test, test=develop

* use sys.executable instead of python, test=develop

* keep dummy, test=develop

f297a332

02 4月, 2020 1 次提交
- Z
  
  add new method of gradient_clip, better to use,test=develop (#23224) · 7fda333a
  由 Zhou Wei 提交于 4月 02, 2020
  
  7fda333a
04 3月, 2020 1 次提交

Add flags to limit gpu memory (#22793) · d41d802b

由 Zeng Jinle 提交于 3月 04, 2020

* add recorded cuda memory apis, fix typo, test=develop

* add more ut, test=develop

* follow comments, test=develop

* fix py35 incompatible issues, test=develop

d41d802b

03 3月, 2020 1 次提交

Add functional dygraph mode api (#22745) · df87e79f

由 songyouwei 提交于 3月 03, 2020

* functional dygraph enable/disable
test=develop

* use context manager instead
test=develop

* refine sample code
test=develop

* rename api & expose to fluid
test=develop

* fix sample code
test=develop

df87e79f

17 1月, 2020 1 次提交
- T
  integrated HALF_ASYNC to communicator (#21869) · 82bc814a
  由 tangwei12 提交于 1月 17, 2020
```
* add half_async in the communicator
* fix DistributedStrategy
```
  82bc814a
19 12月, 2019 1 次提交
- Z
  Add some debug flags to auto growth allocator (#21766) · aa4d6a5d
  由 Zeng Jinle 提交于 12月 18, 2019
```
* add some debug flags to auto growth allocator, test=develop

* add comments about auto growth, test=develop
```
  aa4d6a5d
05 12月, 2019 1 次提交

Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e

由 Leo Chen 提交于 12月 05, 2019

* test=develop, fix docker with paddle nccl problem

* don't expose numerous Tensor.set(), test=develop

* fix condition, test=develop

* fix float16 bug, test=develop

* feed should be Tensor or np.array, not Variable or number, test=develop

* use forcecast to copy numpy slice to new array, test=develop

* remove float16-uint16 hacking, test=develop

* add variable method to varbase and refactor to_variable to support return varbase

* support kwargs in varbase constructor

* add VarBase constructor to support default python args

* refine varbase initial method

* reset branch

* fix ut for change VarBase error info to PaddleEnforce

* cherry is parameter change before

* overload isinstance to replace too many change of is_variable

* rm useless files

* rm useless code merged by git

* test=develop, fix some ut failed error

* test=develop, fix test_graph_wrapper

* add some tests, test=develop

* refine __getitem__, test=develop

* add tests, test=develop

* fix err_msg, test=develop

cdd46d7e

02 12月, 2019 1 次提交
- Z
  
  add fraction of cpu memory to use, test=develop (#21453) · 2a54c359
  由 Zeng Jinle 提交于 12月 02, 2019
  
  2a54c359
29 11月, 2019 1 次提交

add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856

由 Leo Chen 提交于 11月 29, 2019

* add unused input vars check for OpWithKernel, test=develop

* remove unused vars in some ops, test=develop

* fix batch_norm, test=develop

* add white list, test=develop

* add CI check for white list, test=develop

* :ove white list to c++, test=develop

* solve failure of CI, test=develop

* add unittest for unused_var_check, test=develop

* refine code, enable check in operator_test, test=develop

* skip mkldnn, test=develop

* extend white list, test=develop

* refine condition of mkldnn, test=develop

* fix paddle_build, test=develop

* follow comments, test=develop

* fix GetExpectedKernelType

* add wiki ref to err_msg, test=develop

* follow comment, test=develop

e0c9d856

28 11月, 2019 1 次提交

Use system allocator in OpTest (#21335) · 09696d5d

由 Zeng Jinle 提交于 11月 28, 2019

* use system allocator in unittests, test=develop

* fix op bugs, test=develop

* fix tensor copy bug when src and dst are the same, test=develop

09696d5d

29 10月, 2019 1 次提交

save load problem fix and new feature add (#20823) · ff0886a9

由 hong 提交于 10月 29, 2019

* fix persistable;

* fix save load bugs; test=develop

* fix bug; test=develop

* add example for new io api; test=develop

* addd example; test=develop

ff0886a9

20 10月, 2019 1 次提交
- 1
  test=develop, add communicator_is_sgd_optimizer flag (#20677) · 95e90aa1
  由 123malin 提交于 10月 20, 2019
```
* test=develop, communicator_is_sgd_optimizer flags
```
  95e90aa1
16 10月, 2019 1 次提交
- G
  
  Retry when failed to bind address. (#20642) · f3f52fc1
  由 gongweibao 提交于 10月 16, 2019
  
  f3f52fc1
10 10月, 2019 1 次提交

New save load interface (#20148) · fa43e80e

由 hong 提交于 10月 10, 2019

* add new save load interface; test=develop

* add new save interface; test=develop

* add save load interface ;

* fix save load error;

* fix dygraph set dict bug;

* add save load unit test; test=develop

* fix test_imperative_optimizer bug; test=develop

* fix unitest optimizer bug; test=develop

* fix code coverage; test=develop

* fix converage; test=develop

* add document for apis; test=develop

* fix unitest error; test=develop

* fix save load unit test error; test=develop

* fix error message; test=develop

* change set_parameter set_optimizer to save_dygraph; test=develop

* add load_graph check; test=develop

* fix api spec; test=develop

fa43e80e

07 10月, 2019 1 次提交
- T
  Trainer heartbeat for async mode (#19600) · b5a41046
  由 tangwei12 提交于 10月 07, 2019
```
Heartbeat for distributed async training.
```
  b5a41046
30 9月, 2019 1 次提交
- C
  Add GEO-SGD distribute training algorithm (#20018) · 728ec1b4
  由 Chengmo 提交于 9月 30, 2019
```
* refector geo sgd & communicator
```
  728ec1b4
26 9月, 2019 1 次提交

Add new data layer (#19916) · 88af4ab6

由 Huihuang Zheng 提交于 9月 26, 2019

The new "fluid.data" changes old "fluid.layers.data":

1. Add shape and dtype check.
2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation

TODO：
Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?

88af4ab6

24 9月, 2019 1 次提交

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

23 9月, 2019 1 次提交
- C
  Delete local execution scopes (#19749) · d7251a8e
  由 chengduo 提交于 9月 23, 2019
```
* Add RecordHistoryLocalExecScopes
test=develop
```
  d7251a8e
18 9月, 2019 2 次提交
- Z
  
  remove some flags and add comments to some flags, test=develop (#19813) · 13ca364c
  由 Zeng Jinle 提交于 9月 18, 2019
  
  13ca364c
- 1
  add retry function to try to solve grpc error code 14 (#19661) · 1bc285a5
  由 123malin 提交于 9月 18, 2019
```
* rpc retry for asycsend/get/prefetch

* test=develop, change retry vlog level to 3

* test=develop, set default grpc_retry_times is 3
```
  1bc285a5
12 9月, 2019 1 次提交

Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716) · 8c7e4119

由 Aurelius84 提交于 9月 12, 2019

* add one_hot_v2_op to remove last_dims==1 test=develop

* add api unittest code for CI_Coverage test=develop

* improve CI_Coverage rate by adding test_with_depth test=develop

8c7e4119

11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

09 9月, 2019 1 次提交
- Z
  
  add gpu_allocator_try_time config, test=develop (#19675) · a7691603
  由 Zeng Jinle 提交于 9月 09, 2019
  
  a7691603
26 8月, 2019 1 次提交
- M
  delete recordio writer (#19406) · c2e5eaa2
  由 mapingshuo 提交于 8月 26, 2019
```
test=develop
```
  c2e5eaa2

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功