提交 · d623e863c9348c19d9438f2f0b14f3b011877eda · 机器未来 / Paddle

19 11月, 2019 2 次提交

fix data_norm op to avoid impractical normalization result test=develop (#21152) · b5d8ba83

由 yaoxuefeng 提交于 11月 19, 2019

* fix auc drop first commit test=develop

* update datanorm op

* update datanorm with enforce test=develop

* update test=develop

* update format test=develop

* update format

* update format test=develop

* add unit test test=develop

* update unit test test=develop

* update format test=develop

* update format test=develop

* update API description test=develop

* update API description test=develop

* update format test=develop

* fix codes as comments test=develop

* fix description as comments test=develop

* fix description as comments test=develop

* update codes.. test=develop

b5d8ba83

Polish jit trace codes (#21218) · 67e88424

由 Zeng Jinle 提交于 11月 19, 2019

* polish jit trace codes, test=develop

* polish codes again by removing var_id, test=develop

67e88424

18 11月, 2019 8 次提交

Fix warn of gcc8 (#21205) · cdb3d279

由 Zeng Jinle 提交于 11月 18, 2019

* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop

cdb3d279

Better TensorRT support (#20858) · 330b173c

由 Jeng Bai-Cheng 提交于 11月 18, 2019

* Fix TensorRT detection bug

1. Add new search path for TensorRT at tensorrt.cmake
2. Add better debug message
3. Fix the bug of detection of TensorRT version

In NVIDIA official docker image, TensorRT headers are located at
`/usr/include/x86_64-linux-gnu` and TensorRT libraries are located
at `/usr/lib/x86_64-linux-gnu`, so using `-DTENSORRT_ROOT` will
fail to detect TensorRT.

There is no debug/warning message to tell developer that TensorRT
is failed to be detected.

In later version of TensorRT (e.g. v6), `NV_TENSORRT_MAJOR` is
defined at `NvInferVersion.h` instead of `NvInfer.h`, so add
compatibility fix.

* Fix TensorRT variables in CMake

1. Replace `${TENSORRT_ROOT}/include` with `${TENSORRT_INCLUDE_DIR}`
2. Replace `${TENSORRT_ROOT}/lib` with `${TENSORRT_LIBRARY}`

Manually type path may locate incorrect path of TensorRT. Use the
paths detected by system instead.

* Fix TensorRT library path

1. Add new variable - `${TENSORRT_LIBRARY_DIR}`
2. Fix TensorRT library path

inference_lib.cmake and setup.py.in need the path of TensorRT library
instead of the file of TensorRT library, so add new variable to fix it.

* Add more general search rule for TensoRT

Let system detect architecture instead of manually assign it, so
replace `x86_64-linux-gnu` with `${CMAKE_LIBRARY_ARCHITECTURE}`.

* Add more general search rule for TensorRT

Remove duplicate search rules for TensorRT libraries. Use
`${TENSORRT_LIBRARY_DIR}` to get full path of libnvinfer.so

test=develop

330b173c

D

add store_true to use_paddlecloud argument in launch.py (#21168) · 3fe63d67
由 danleifeng 提交于 11月 18, 2019

3fe63d67

modified error message and API doc for channel_last supported Op (#21002) · 9cbe7bcc

由 Zhang Ting 提交于 11月 18, 2019

* modified error message for conv and conv_transpose, test=develop

* modified doc of conv and conv_transpose op, test=develop

* modified the expression for error message, test=develop

* modified error message for group_norm op, test=develop

* modified detail of Attr(data_format) or Attr(data_layout)

* add ValueError in API doc for maxout op, test=develop

9cbe7bcc

Control flow API: switch_case (#21103) · 92475282

由 liym27 提交于 11月 18, 2019

* add API switch_case. test=develop

add Nest

* modify code according to reviews:
1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'.
2.remove useless code.
test=develop

* replace fluid.layers.data with fluid.data and polish API document. test=develop

92475282

G

Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118) · 56b5d147
由 guofei 提交于 11月 18, 2019

56b5d147
W

Fix INF bug of softmax_cross_entropy_op (#21165) · 3c98ec90
由 WangXi 提交于 11月 18, 2019

3c98ec90
Z

fix dygraph trace bug, test=develop (#21193) · 0f30d3a2
由 Zeng Jinle 提交于 11月 18, 2019

0f30d3a2

16 11月, 2019 1 次提交

Support more ops in post training quantization, test=develop (#21073) · 00b11a4a

由 juncaipeng 提交于 11月 16, 2019

* Support  more ops in post training quantization, and save the output scale in quantized op.
* Update docs in post training quantization and qat

00b11a4a

15 11月, 2019 3 次提交

X
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052) · 23876de5
由 xujiaqi01 提交于 11月 15, 2019
```
* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop
```
23876de5

add copy table (#21086) · 9e045170

由 xujiaqi01 提交于 11月 15, 2019

* copy some feasigns and corresponding embeddings from one sparse table to another
* copy all feasigns and corresponding embeddings from one sparse table to another
* copy all dense params from one table to another
* copy some local vars to other local vars

9e045170

R

Refine edit distance cn (#21121) · aeb88791
由 ruri 提交于 11月 15, 2019

aeb88791

14 11月, 2019 6 次提交
- K
  
  fix elementwise_mod float point kernel. test=develop (#21183) · 98b59cb8
  由 Kaipeng Deng 提交于 11月 14, 2019
  
  98b59cb8
- H
  
  disable reshape inplace in dygraph model; test=develop (#21157) · 835119c7
  由 hong 提交于 11月 14, 2019
  
  835119c7
- Z
  Add friendly dygraph trace API (#21091) · 5fdfbe34
  由 Zeng Jinle 提交于 11月 14, 2019
```
* friendly trace interface, test=develop

* refine TracedLayer, test=develop

* add some docs, test=develop
```
  5fdfbe34
- W
  
  Fix warpctc in padding mode. (#21033) · cfdd1fc2
  由 whs 提交于 11月 14, 2019
  
  cfdd1fc2
- T
  add input type and dtype check template, and update some APIs check (#21161) · 3976bbe2
  由 Tao Luo 提交于 11月 14, 2019
```
* add input type and dtype check template, and update some APIs check

* refine check template, and update some APIs check in nn.py

* update some APIs check in loss.py

test=develop
```
  3976bbe2
- J
  QAT int8 accuracy little improvement (#21074) · 37e0e7a9
  由 joanna.wozna.intel 提交于 11月 14, 2019
```
test=develop
```
  37e0e7a9
13 11月, 2019 1 次提交
- G
  Use 2 cards for hallreduce unit test. (#21085) · a5fc291f
  由 gongweibao 提交于 11月 13, 2019
```
use 2 cards test=develop
```
  a5fc291f
12 11月, 2019 7 次提交

Split some APIs from nn.py to loss.py (#21117) · 8f659d43

由 Tao Luo 提交于 11月 12, 2019

* Split some APIs from nn.py to loss.py

test=develop

* fix test_detection unit-test

test=develop

8f659d43

Add Asypadding for conv fusion. (#21041) · 4a544762

由 zhaoyuchen2018 提交于 11月 12, 2019

* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.

4a544762

W

Fix dgc buffer illegal & reuse velocity (#21012) · de5d3ff6
由 WangXi 提交于 11月 12, 2019

de5d3ff6

modify the implementation of save_persistables and save_inference_model for... · 53148e06

由 lilong12 提交于 11月 12, 2019

modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802)

* modify the implementation of  save_persistables and save_inference_model functions for fleet collective, test=develop

* add ut, test=develop

53148e06

B

fix distiller typo, test=develop (#21070) · bd8b0eba
由 Bai Yifan 提交于 11月 12, 2019

bd8b0eba
C
fix instance norm (#21042) · f62a9291
由 ceci3 提交于 11月 12, 2019
```
* fix instance norm

* update unitest,test=develop
```
f62a9291

fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3

由 lilong12 提交于 11月 12, 2019

* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop

e249d9a3

11 11月, 2019 3 次提交

H

Add basic Python Cond Layer (#21050) · e64d55f0
由 Huihuang Zheng 提交于 11月 11, 2019

e64d55f0
H

Disable cudnn_conv in unit tests. (#21080) · dcf371b6
由 Huihuang Zheng 提交于 11月 11, 2019

dcf371b6

Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2

由 Yiqun Liu 提交于 11月 11, 2019

* Add the check of lod_level between compile-time and runtime.
test=develop

* Fix bug in check_compile_vs_runtime.
test=develop

* Fix the check of output when it is dispensiable or intermediate.
test=develop

* Share lod of x to out in match_matrix_tensor op in compile-time.

* Implement GetLoDLevel in InferShapeContext.

* Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
test=develop

* Enable check_compile_vs_runtime in test_match_matrix_tensor.

* Add the implementation of SetLoDLevel in InferShapeContext.

* Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.

* Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.

* Refine some ops and unittests.
test=develop

* Fix a typo.
test=develop

* Remove the check of var type, and change int to int32_t.
test=develop

* Add unittest for Get/SetLoDLevel.
test=develop

35f17ae2

08 11月, 2019 5 次提交

Split some APIs from nn.py to rnn.py and sequence_lod.py (#21030) · 78cc1ca6

由 Tao Luo 提交于 11月 08, 2019

* split some APIs from nn.py to rnn.py

* split some APIs from nn.py to sequence_lod.py

test=develop

* fix unit-test bug

test=develop

* fix test_layers unit-test bug

test=develop

78cc1ca6

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

J

delete test resnet50 in post train quantization to avoid timeout error, test=develop (#21081) · 2c07727f
由 juncaipeng 提交于 11月 08, 2019

2c07727f
L

add op locality_aware_nms, test=develop (#20976) · 06063b70
由 LielinJiang 提交于 11月 08, 2019

06063b70

fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation,... · 26a6e27a

由 liym27 提交于 11月 08, 2019

fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)

* fix bug in pool/conv/conv_transpose:
1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation;
2. fix bug of func  _get_padding_with_SAME in test_conv/conv_transpose_op.py;
3. fix bug of the computation process in function conv2dtranspose_forward_naive.
test=develop

* change test to make the data of different dimensions different. test=develop

26a6e27a

07 11月, 2019 2 次提交
- A
  Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062) · 3fda695b
  由 Adam 提交于 11月 07, 2019
```
* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop
```
  3fda695b
- H
  Add select_input_op and select_output_op (#21016) · 1957192f
  由 Huihuang Zheng 提交于 11月 07, 2019
```
These ops are useful in control flow.
```
  1957192f
06 11月, 2019 2 次提交
- H
  fix uniform random (#21009) · 72e0969b
  由 hong 提交于 11月 06, 2019
```
* fix uniform random; test=develop

* add uniform random test; test=develop
```
  72e0969b
- W
  Remove fuse_with_relu argument from batch_norm constructor (#21028) · 226bc22a
  由 Wojciech Uss 提交于 11月 06, 2019
```
test=develop
```
  226bc22a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致