提交 · 443a713c9e75a3de97c511a71ed322b911dcb280 · PaddlePaddle / Paddle

10 1月, 2020 1 次提交

Add bn and relu fuse pass (#22048) · 46189b16

由 Zhen Wang 提交于 1月 10, 2020

* add bn and relu fuse pass

* add op attr assert and dtype assert

* fix some inputs&&outputs bugs for the fused op and pattern.

* add the unittest for fuse_bn_act_pass. test=develop

* use normative enforce statements. test=develop

* add the cpu test. test=develop

* add the support of batch_size=1 for the bn with relu op. test=develop

* add the error type for paddle throws. test=develop

* add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop

46189b16

09 1月, 2020 4 次提交

test Optimizer in dygraph (#21949) · d0f0a252

由 zhongpu 提交于 1月 09, 2020

* test Optimizer in dygraph, test=develop

* add optest for Optimizer in dygraph, test=develop

* fix adagrad optimizer, test=develop

* fix dpsgd optimizer, test=develop

* fix test_optimizer.py, test=develop

* fix dpsgd optimizer, this op only support cpu, test=develop

* add optest for optimizer, test=develop

* add description for dpsgd, test=develop

* add rmsprop to white_list in unused_var_check.cc, test=develop

* polish code style, test=develop

* polish code style, test=develop

* delete seed attribute for DpsgdOptimizer, test=develop

* change testing to debugging, test=develop

d0f0a252

J

Add multiple quantize operators fuse (#22062) · 5b2e98aa
由 joanna.wozna.intel 提交于 1月 09, 2020

5b2e98aa

Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144) · 96980c22

由 Yiqun Liu 提交于 1月 09, 2020

* Polish the PADDLE_ENFORCE in fusion_group pass related codes.
test=develop

* Correct the unittest because of the change relu_grad's formula.
test=develop

96980c22

W
add support for nested profiling event and printing in different level (#22061) · c3876cf8
由 wangchaochaohu 提交于 1月 09, 2020
```
* add support for nested profiling event and printing in different level
```
c3876cf8

07 1月, 2020 3 次提交
- L
  
  fix xception precision problem, test=develop (#22124) · 724b13e4
  由 liu zhengxi 提交于 1月 07, 2020
  
  724b13e4
- Y
  Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094) · b1401fb7
  由 Yiqun Liu 提交于 1月 07, 2020
```
test=develop
```
  b1401fb7
- B
  
  fix format in operator.cc (#22101) · 4b4a9cc8
  由 bingyanghuang 提交于 1月 07, 2020
  
  4b4a9cc8
06 1月, 2020 3 次提交
- S
  
  test=develop, remove unused parameter from class RuntimeInferShapeContext constructors (#22046) · 6c20e7c4
  由 silingtong123 提交于 1月 06, 2020
  
  6c20e7c4
- J
  
  [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088) · b0b27ff6
  由 Jacek Czaja 提交于 1月 06, 2020
  
  b0b27ff6
- H
  
  Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029) · dd436156
  由 Huihuang Zheng 提交于 1月 06, 2020
  
  dd436156
05 1月, 2020 1 次提交
- J
  
  [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747) · ad8a9cb8
  由 Jacek Czaja 提交于 1月 05, 2020
  
  ad8a9cb8
03 1月, 2020 2 次提交

Add the first implememtation of fusion_group op (#19621) · d4832077

由 Yiqun Liu 提交于 1月 03, 2020

* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
test=develop

* Call CUDA driver api to launch the kernel compiled by nvrtc.
test=develop

* Disable for mac and windows.
test=develop

* Refine the codes to support manually specified num_threads and workload_per_thread.
test=develop

* Refine the CUDA kernel to support large dims.
test=develop

* Add DeviceCodePool to manage all device codes.

* Add the first implementation fusion_group op.

* Add unit-test for fusion_group op.

* Add the check of result.

* Add the check of nvrtc in unit-test.
test=develop

* Add comment to explain the inputs, outputs and features of fusion_group op.
test=develop

* Disable fusion_group op for mac and windows.
test=develop

* Make the compiling of device code return status instead of hanging up.
test=develop

* Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.

* Unify fusion_group_op's input and output names.
test=develop

* Add the check of CUDA driver library in unittest.
test=develop

* Refine the calling of PADDLE_ENFORCE.
test=develop

d4832077

M

[DNNL] 3D Fully-Connected (#21746) · 61921084
由 Michał Gallus 提交于 1月 03, 2020

61921084

29 12月, 2019 1 次提交

Fix multi-threads memory out of bounds error for passes (#21920) · 196e20df

由 liu zhengxi 提交于 12月 29, 2019

* fix seqconv_eltadd_relu pass during multi-threads predictor, test=develop

* fix attention_lstm_fuse_pass during multi-threads inference, test=develop

* fix embedding_fc_lstm_fuse_pass during multi-threads inference, test=develop

* fix fc_lstm_fuse_pass during multi-threads inference, test=develop

* fix seq_concat_fc_fuse_pass during multi-threads inference, test=develop

196e20df

27 12月, 2019 1 次提交
- 石
  fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841) · 03479469
  由石晓伟提交于 12月 27, 2019
```
* fix multi-thread error of fc_gru_fuse_pass.cc, test=develop

* export FLAGS and GLOG symbols, test=develop
```
  03479469
25 12月, 2019 2 次提交
- P
  
  fix trt calib not working bug, test=develop (#21934) · 3e5008ad
  由 Pei Yang 提交于 12月 25, 2019
  
  3e5008ad
- Q
  Pack imperative/layer into paddle_framework.so (#21921) · 20667458
  由 qingqing01 提交于 12月 25, 2019
```
* Pack imperative/layer into paddle_framework.so
```
  20667458
24 12月, 2019 1 次提交

Optimize adam speed (#21777) · 51a86d2b

由 Aurelius84 提交于 12月 24, 2019

* optimize adam speed by removing _finish_update test=develop

* fix SparseAdamFunctor param list test=develop

* Remove scale_op in expect_list of adam_op test=develop

* fix test optimizer loss assert error test=develop

* fix test optimizer loss assert error test=develop

* modify PADDLE_ENFORCE usage test=develop

* fix op_type in lamb_op.cc test=develop

* fix errors ostream format bug test=develop

* add betaPowOut in ngraph op test=develop

* fix ngraph::op api for gcc8 test=develop

* clean code test=develop

* modify struct into class test=develop

* remove code of beta1Tensor in lamb_op test=develop

51a86d2b

20 12月, 2019 1 次提交

add table id in cache shuffle (#21585) · c3cf42d0

由 Thunderbrook 提交于 12月 20, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

* solve pslib stop core
test=develop

* barrier
test=develop

* add notes
test=develop

* add table id in cache shuffle
test=develop

* table id
test=develop

* code style
test=develop

c3cf42d0

19 12月, 2019 1 次提交
- W
  
  fix batch_norm_grad infer shape=0 & add allreduce enforce shape, test=develop (#21801) · 17299b8d
  由 WangXi 提交于 12月 19, 2019
  
  17299b8d
18 12月, 2019 2 次提交

Fix Backward Bugs in Conditional Block (#21809) · 557bce77

由 Huihuang Zheng 提交于 12月 18, 2019

The fixed bugs:

1. The condition sub-graph is not pruned
2. When backward graph is extremely simple, the whole backward ops are pruned.

557bce77

X
fix compiled error when with_pslib=on (#21769) · 0eb4d990
由 xujiaqi01 提交于 12月 18, 2019
```
* fix compiled error of butil when with_pslib=on and with_testing=on
* test=develop
```
0eb4d990

16 12月, 2019 1 次提交

Add fc-dequantize squash in cpu_quantize_squash_pass for ernie model (#21714) · d3a96632

由 lidanqing 提交于 12月 16, 2019

* fc-dequantize squash
test=develop

* change according to reviews
test=develop

* change PADDLE_ENFORCE
test=develop

* add second test when fc-dequant do not fuse
test=develop

* change all related PADDLE_ENFORCE
test=develop

d3a96632

15 12月, 2019 1 次提交
- W
  
  fix std::min type in nan_inf, test=develop (#21725) · 8754cbd1
  由 WangXi 提交于 12月 15, 2019
  
  8754cbd1
12 12月, 2019 3 次提交

Add reshape int8 mkldnn op (#21428) · d419b859

由 joanna.wozna.intel 提交于 12月 12, 2019

* Add reshape int8 op

test=develop

* Change test to CPUPlace

test=develop

* Correct tests

test=develop

d419b859

W

Rewrite check nan inf tools (#21076) · 8a0f611b
由 WangXi 提交于 12月 12, 2019

8a0f611b

memory leak for cpu (#21174) · 9ad940fd

由 tangwei12 提交于 12月 12, 2019

* add fake init for the trainer, fix large memory hold in the trainer
* do not merge recv vars from a remote endpoint, test=develop
* add recv and save op, merge slice var in one op, save memory
* remove hsigmoid with pull sparse, test=develop

9ad940fd

11 12月, 2019 2 次提交
- Z
  Make OperatorWithKernel::InferShape abstract (#21633) · 73461a7a
  由 Zeng Jinle 提交于 12月 11, 2019
```
* make OperatorWithKernel::InferShape virtual, test=develop

* fix test_prepare_op by relu, test=develop
```
  73461a7a
- Z
  
  fix op_registry, add ignore op_function_impl.h, test=develop (#21654) · 6828f368
  由 Zeng Jinle 提交于 12月 11, 2019
  
  6828f368
10 12月, 2019 3 次提交

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

X
fix master patch when slot is dense (#21580) · f4041572
由 xujiaqi01 提交于 12月 10, 2019
```
* fix master patch when slot is dense
* test=develop
```
f4041572
X
fix code style of fleet_wrapper (#21639) · c05706fe
由 xujiaqi01 提交于 12月 10, 2019
```
* fix code style of fleet_wrapper
* test=develop
```
c05706fe

07 12月, 2019 1 次提交
- X
  rm optimize_for in framework.proto (#21571) · 88960684
  由 xujiaqi01 提交于 12月 07, 2019
```
* remove optimize_for in framework.proto
* test=develop
```
  88960684
06 12月, 2019 3 次提交

Polish op registry codes (#21561) · 0f888836

由 Zeng Jinle 提交于 12月 06, 2019

* polish infer shape registry, test=develop

* modify some operators registry, test=develop

0f888836

H
Paddlebox Related to Framework (#21586) · c5aec2fe
由 hutuxian 提交于 12月 06, 2019
```
* Add a single_process_multi_thread transpiler.
* Add some UTs.
* Fix some API description.
```
c5aec2fe

add file check_op_desc.py and add interface to get default value. (#21530) · 9da7e6b4

由 liym27 提交于 12月 06, 2019

* add file check_op_desc.py and add interface to get default value. test=develop

* add test for c++ coverage rate. test=develop

* Correct typo. test=develop

9da7e6b4

04 12月, 2019 1 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
03 12月, 2019 2 次提交
- J
  
  [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466) · 18a5d307
  由 Jacek Czaja 提交于 12月 03, 2019
  
  18a5d307
- Z
  NV jetson(nano, tx2, xavier) inference compile support (#21393) · c5f0293c
  由 Zhaolong Xing 提交于 12月 03, 2019
```
* add jeston compile support
test=develop

* refine the cmake
test=develop
```
  c5f0293c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功