提交 · e2d849b98928828e7ab7d8566bcc761d896aa5b0 · PaddlePaddle / Paddle

10 12月, 2019 7 次提交

M
Dropout with seed (#21590) · e2d849b9
由 mapingshuo 提交于 12月 10, 2019
```
* add seed op
```
e2d849b9

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

R
fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461) · 7f5d532a
由 rensilin 提交于 12月 10, 2019
```
* ZeroCopyTensor::mutable_data in the right device, test=develop

* add unittest for zerocopy, test=develop
```
7f5d532a
X
fix master patch when slot is dense (#21580) · f4041572
由 xujiaqi01 提交于 12月 10, 2019
```
* fix master patch when slot is dense
* test=develop
```
f4041572
X
fix code style of fleet_wrapper (#21639) · c05706fe
由 xujiaqi01 提交于 12月 10, 2019
```
* fix code style of fleet_wrapper
* test=develop
```
c05706fe
W
Mean gpu optimize (#21643) · 95b95a28
由 wangchaochaohu 提交于 12月 09, 2019
```
* accelerate mean op test=develop
```
95b95a28

Add op function generator for dygraph (#21569) · 48600d7f

由 Leo Chen 提交于 12月 10, 2019

* add op function generator, test=develop

* add unittest, test=develop

* follow comments, test=develop

* fix windows compilation problem, test=develop

48600d7f

09 12月, 2019 3 次提交

QAT Int8 document (#21360) · fbf9eca0

由 lidanqing 提交于 12月 09, 2019

* update benchmark for int8v2, QAT1, QAT2 accuracy and performance
test=document_fix

* change according to reviews
test=develop test=document_fix

* improve some descriptions and some models
test=develop test=document_fix

* update models benchmark data
test=develop test=document_fix

* update int8v2 and qat2 performance
test=develop test=document_fix

fbf9eca0

Refine VarBase init function (#21587) · 4f81d1bd

由 Leo Chen 提交于 12月 09, 2019

* refine init function, test=develop

* add tests, test=develop

* remove extern, which may cause symbol error in gcc-4.8, test=develop

4f81d1bd

dygraph_grad_maker supports varbase without grad_var (#21524) · 84b72671

由 Leo Chen 提交于 12月 09, 2019

* dygraph_grad_maker supports varbase without grad_var, test=develop

* fix compile, test=develop

* fix test_tracer, test=develop

* follow comments, test=develop

84b72671

07 12月, 2019 1 次提交
- X
  rm optimize_for in framework.proto (#21571) · 88960684
  由 xujiaqi01 提交于 12月 07, 2019
```
* remove optimize_for in framework.proto
* test=develop
```
  88960684
06 12月, 2019 8 次提交

Polish op registry codes (#21561) · 0f888836

由 Zeng Jinle 提交于 12月 06, 2019

* polish infer shape registry, test=develop

* modify some operators registry, test=develop

0f888836

A

Set lod_level of Out in compile time of sequence_pool_op (#21604) · 3d9dee57
由 Aurelius84 提交于 12月 06, 2019

3d9dee57
Z

refine dev_ctx.Wait() exception throw, test=develop (#21600) · 97e76cb9
由 Zeng Jinle 提交于 12月 06, 2019

97e76cb9

Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72

由 Huihuang Zheng 提交于 12月 06, 2019

Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.

Fix bugs:

1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.

2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.

This PR also did some code clean up:
1. Print the var name when sgd_op catches shape error so that it is easier to debug
2. Fix a typo: dicta -> dict

1dcf6a72

H
Paddlebox Related to Framework (#21586) · c5aec2fe
由 hutuxian 提交于 12月 06, 2019
```
* Add a single_process_multi_thread transpiler.
* Add some UTs.
* Fix some API description.
```
c5aec2fe

add file check_op_desc.py and add interface to get default value. (#21530) · 9da7e6b4

由 liym27 提交于 12月 06, 2019

* add file check_op_desc.py and add interface to get default value. test=develop

* add test for c++ coverage rate. test=develop

* Correct typo. test=develop

9da7e6b4

J
- Fix to regression in performance of ResNet-50 training (#21588) · 8f5a93a0
由 Jacek Czaja 提交于 12月 06, 2019
```
test=develop
```
8f5a93a0

[MKL-DNN] Batch norm mkl-dnn NHWC support (#21553) · 9ce0e29d

由 Jacek Czaja 提交于 12月 06, 2019

* - BAtch norm mkl-dnn NHWC

test=develop

- compilation fix

test=develop

- UT fix

- cosmetics

test=develop

- Fix to Batch Norm MKL-DNN NHWC UT

test=develop

Conflicts:
	paddle/fluid/operators/batch_norm_op.h

* - Lint fixes

test=develop

9ce0e29d

05 12月, 2019 5 次提交

Z

add grad maker assert, test=develop (#21564) · 3a7caf48
由 Zeng Jinle 提交于 12月 05, 2019

3a7caf48
H
Refine a Warning Which Can Occur Not Only During Init (#21546) · b241c732
由 Huihuang Zheng 提交于 12月 05, 2019
```
As the title
```
b241c732
P

fix glog warning, test=develop (#21573) · 20d61414
由 Pei Yang 提交于 12月 05, 2019

20d61414
W
Add Branch to avoid CPU profiler warning print (#21556) · 932aca16
由 wangchaochaohu 提交于 12月 05, 2019
```
* fix profiler warning message in cpu profile mode test=develop
```
932aca16

Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e

由 Leo Chen 提交于 12月 05, 2019

* test=develop, fix docker with paddle nccl problem

* don't expose numerous Tensor.set(), test=develop

* fix condition, test=develop

* fix float16 bug, test=develop

* feed should be Tensor or np.array, not Variable or number, test=develop

* use forcecast to copy numpy slice to new array, test=develop

* remove float16-uint16 hacking, test=develop

* add variable method to varbase and refactor to_variable to support return varbase

* support kwargs in varbase constructor

* add VarBase constructor to support default python args

* refine varbase initial method

* reset branch

* fix ut for change VarBase error info to PaddleEnforce

* cherry is parameter change before

* overload isinstance to replace too many change of is_variable

* rm useless files

* rm useless code merged by git

* test=develop, fix some ut failed error

* test=develop, fix test_graph_wrapper

* add some tests, test=develop

* refine __getitem__, test=develop

* add tests, test=develop

* fix err_msg, test=develop

cdd46d7e

04 12月, 2019 7 次提交
- Y
  dygraph Embedding layer use lookuptable v2 (#21209) · cdba41af
  由 Youwei Song 提交于 12月 04, 2019
```
* dygraph Embedding layer use lookuptable v2
test=develop

* fix test_nce
test=develop
```
  cdba41af
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
- W
  fill_constant_batch_size_like OP precious problem fix (#21337) · 4c9b3daf
  由 wangchaochaohu 提交于 12月 04, 2019
```
* fix fill_constant_batch_size_like_op precious problem  test=develop
```
  4c9b3daf
- Z
  add conv, depthwise_conv, pooling (#20966) · da7748c5
  由 Zhaolong Xing 提交于 12月 04, 2019
```
test=develop
```
  da7748c5
- W
  
  Fix dgc clip & rampup step, test=develop (#21491) · 768f9242
  由 WangXi 提交于 12月 04, 2019
  
  768f9242
- H
  add overrider for virtual function to avoid warning (#21503) · 0b75a0c1
  由 hong 提交于 12月 04, 2019
```
* add overrider for virtual function; test=develop

* fix layer.h OutputName bug; test=develop
```
  0b75a0c1
- A
  Add get_all_kernels api of registered data_type in pybind.cc (#21499) · 54382ce4
  由 Aurelius84 提交于 12月 04, 2019
```
* add _get_all_register_op_kernels api test=develop

* refine usage of check_op_register_type test=develop

* add import in core test=develop
```
  54382ce4
03 12月, 2019 9 次提交

Z

remove eval() calls in Eigen, test=develop (#21498) · 3662fb71
由 Zeng Jinle 提交于 12月 03, 2019

3662fb71
J

[MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466) · 18a5d307
由 Jacek Czaja 提交于 12月 03, 2019

18a5d307
G
Add ernie large c++ inference test (#21365) · 250a1921
由 GaoWei8 提交于 12月 03, 2019
```
* add ernie-large test
test=develop

* add ernie large c++ inference test
test=develop
```
250a1921

support SelectedRows in dygraph, test=develop (#21078) · 6ebf0f47

由 zhongpu 提交于 12月 03, 2019

* support SelectedRows in dygraph, test=develop

* fix bug of _grad_ivar interface, test=develop

* add optest for support seletedrows, test=develop

* fix bug for gradient_accumulator in GPU mode, test=develop

* fix error when Selectedrows addto LodTensor in sorted_gradient mdoe in dygraph, test=develop

* refine and simplify gradient accumulator code, test=develop

* add optest, test=develop

* add optest and simplify code, test=develop

* fix bug for test_imperative_selected_rows, test=develop

* add optest for Coverage, test=develop

* fix gradient interface and simplify code, test=develop

* update api for gradient, test=develop

* fix ShareDim's bug in DygraphExecutionContext class, test=develop

* add optest, test=develop

6ebf0f47

T
remove unused snappy/snappystream depends in distributed codes (#21484) · 70eb3976
由 Tao Luo 提交于 12月 03, 2019
```
test=develop
```
70eb3976

set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402) · 0bc8bdf7

由 lilong12 提交于 12月 03, 2019

* set dim[0] to -1 if dim[0] < 0 and remove assertion to runtime, test=develop

* modify ENFORCE message, test=develop

* add validation for x.shape[0] > 0, test=develop

* add ut, test=develop

0bc8bdf7

Z
NV jetson(nano, tx2, xavier) inference compile support (#21393) · c5f0293c
由 Zhaolong Xing 提交于 12月 03, 2019
```
* add jeston compile support
test=develop

* refine the cmake
test=develop
```
c5f0293c
Z
specify the auto growth allocator for inference. (#21448) · b39c0116
由 Zhaolong Xing 提交于 12月 03, 2019
```
test=develop
```
b39c0116
T

fix async mode, test=develop (#21367) · 0bddb951
由 tangwei12 提交于 12月 03, 2019

0bddb951

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功