提交 · c918788ba9a94ea55409af2176a934ab15f0972b · PaddlePaddle / Paddle

24 11月, 2019 1 次提交

Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310) · c918788b

由 Yiqun Liu 提交于 11月 24, 2019

* Disable fusion_group pass for windows and mac. We will do some experiments on Linux first.
test=develop

* Print the subgraph when check failed.
test=develop

c918788b

22 11月, 2019 7 次提交

Fix the crash issue when scale or bias was null-pointer. (#21284) · 69dd5152

由 Yihua Xu 提交于 11月 22, 2019

* Fix the crash issue when scale or bias was null-pointer.

test=develop

* Add the error message for passing CI.

test=develop

69dd5152

Z

optimize lod_reset op to avoid data transform · 698b8b73
由 Zhang Ting 提交于 11月 22, 2019

698b8b73

add dequantize_abs_max op and modify lookup_table op (#20899) · f0b15184

由 Liufang Sang 提交于 11月 22, 2019

* add int8 kernel to lookup_table op and add dequantize op test=develop

* change paddle_enforce to paddle_enforce_eq test=develop

* change copyright and change some not suitable code test=develop

* remove debug log test=develop

* replace GetInputType with IndicateVarDataType test=develop

* fix EmptyGradMaker test=develop

* fix diff between cpu and gpu test=develop

* use memcopy when int8_t test=develop

f0b15184

support cvm_op run in gpu (#21300) · a6ce2306

由 hutuxian 提交于 11月 22, 2019

Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel.
What's more, we improve the UTs about CVM OP.

a6ce2306

Avoid the string as the key of map to improve the jit performance (#21292) · b085ecc2

由 Yihua Xu 提交于 11月 22, 2019

* Avoid the string as the key of map to improve the jit performance.

test=develop

* Use map to replace unordered_map.

test=develop

b085ecc2

C
Polish some PE code details (#21274) · 95250852
由 Chen Weihang 提交于 11月 22, 2019
```
* polish code details, test=develop

* futher polish hint msg, test=develop
```
95250852
Y
fix bug of issue #21259 (#21287) · 0fd1281e
由 Yi Liu 提交于 11月 22, 2019
```
pass the argument `allow_out_of_range` of one_hot op to c++ back end.
```
0fd1281e

21 11月, 2019 8 次提交

fix fs_client_param bug (#21212) · 319d2ba9

由 xujiaqi01 提交于 11月 21, 2019

* fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config
* test=develop

319d2ba9

solve pslib core in stop worker (#21263) · 0d17c1b8

由 Thunderbrook 提交于 11月 21, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

* solve pslib stop core
test=develop

* barrier
test=develop

* add notes
test=develop

0d17c1b8

Z

fix bug for python/paddle/fluid/tests/unittests/test_elementwise_mul_op.py, test=develop (#21289) · fa4d0550
由 zhongpu 提交于 11月 21, 2019

fa4d0550

open dygraph op test, test=develop (#19787) · c4ede95c

由 zhongpu 提交于 11月 21, 2019

* open dygraph op test, test=develop

* modify to_variable, test=develop

* modify input and output for dygraph, test=develop

* modify input and output for dygraph(fix bug), test=develop

* fix input processing of dygraph op test, test=develop

* fix bug, test=develop

* fix op test, test=develop

* fix forward bug for dygraph, test=develop

* fix mkldnn op test for forward, test=develop

* update nn.py for dygraph, test=develop

* fix crop_tensor_op, test=develop

* fix elementwise_mul_op, test=develop

* fix fill_op, test=develop

* fix some mkldnn op, test=develop

* open backward op test for dygraph, test=develop

* delete log, test=develop

* close backward op test for dygraph, test=develop

* fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop

* fix optest backward bug for dygraph, test=develop

* fix optest backward bug for dygraph, test=develop

* close backward op test for dygraph, test=develop

* close backward op test for dygraph, test=develop

* open dygraph op test, test=develop

* fix op test for dygraph, fix GradOpDescMaker, test=develop

* fix bug for linear_chain_crf_op.h, test=develop

* remove log, test=develop

* remove log, test=develop

* remove log for op_test.py, test=develop

* remove log for op_test.py, test=develop

* fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop

* fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop

* fix bug for test_increment_ngraph_op.py, test=develop

* fix lod for op test in dygraph, test=develop

* refactor op_test.py to reduce redundant code, test=develop

* fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop

* remove debug log, test=develop

* remove redundant code in base.py, test=develop

* fix some error in optest, test=develop

* fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop

* refactor op_test.py, test=develop

* remove redundant writing, test=develop

* fix error(get tensor of the grad variable), test=develop

* fix test_concat_mkldnn test_conv2d_mkldnn, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix some redundant code, test=develop

* reslove conflict and rewrite paddle error message, test=develop

c4ede95c

K
fix mkldnn include. test=develop (#21247) · 3ab60f5b
由 Kaipeng Deng 提交于 11月 21, 2019
```
* fix mkldnn include. test=develop

* fix mkldnn inlcude. test=develop
```
3ab60f5b
X
fix fleet util bug (#21254) · eca66f31
由 xujiaqi01 提交于 11月 21, 2019
```
* fix fleet util bug in save paddle inference model
* test=develop
```
eca66f31
S

fix the bug of scatter_nd, test=develop (#21257) · 1f39a9f1
由 ShenLiang 提交于 11月 21, 2019

1f39a9f1
L
add input type and input data type check for Print_op test=develop (#21250) · 382cf5d7
由 lijianshe02 提交于 11月 21, 2019
```
* add input type and input data type check for Print_op test=develop
```
382cf5d7

20 11月, 2019 13 次提交

D

edit elementwise_mul doublegrad inplace (#21245) · 6fc3e8ec
由 danleifeng 提交于 11月 20, 2019

6fc3e8ec
T
change api document review (#21255) · 508b898d
由 tianshuo78520a 提交于 11月 20, 2019
```
* change api document review;test=develop;test=document_fix

* test=develop;test=document_fix
```
508b898d

support general embedding params (#21217) · 349e82d6

由 Thunderbrook 提交于 11月 20, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

349e82d6

Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240) · 3cb6c0a0

由 liu zhengxi 提交于 11月 20, 2019

* fix the CAPI ZeroCopy shape error and reconstruct the output obtain

* use an anonymous namespace to cover the functor

* fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop

3cb6c0a0

Add control flow api: case (#21114) · b0fc8227

由 liym27 提交于 11月 20, 2019

* add control flow API: case. test=develop

* delete 'raise TypeError' in _error_message() and return a string. test=develop

* polish API document. test=develop

b0fc8227

Enable generating code for a given subgraph. (#21126) · 6b1e1f0d

由 Yiqun Liu 提交于 11月 20, 2019

* Enable generating code for a given subgraph.

* Support sorting the subgraph.

* Remove the rearange of expressions because we use the sorted subgraph directly.

* Enable generating code for a subgraph which is composed of grad ops.

* Use expression information to check the accuracy in unittest.

* Separate load and store from computation expressions.
test=develop

* Improve the loading statements in generated codes.
test=develop

* Remove unused arguments from formal list.
test=develop

6b1e1f0d

Z
Fix topk compile failed on windows (#21243) · 3ff5cc2d
由 zhaoyuchen2018 提交于 11月 20, 2019
```
* Fix topk compile failed on windows
* Use explicit cast for assign data
```
3ff5cc2d
P
fix trt weight bug (#21231) · 2e2f92a5
由 Pei Yang 提交于 11月 20, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
2e2f92a5
J
support set model_filename and params_filename in post_training_quantization, test=develop (#21213) · 29b63f0a
由 juncaipeng 提交于 11月 20, 2019
```
* support set model_filename and params_filename in post_training_quantization, test=develop
```
29b63f0a
D
update worker_num for MPISymetricRoleMaker (#20798) · ccbdd7aa
由 Dong Daxiang 提交于 11月 20, 2019
```
test=develop
```
ccbdd7aa

optimize assign op to avoid copy data from GPU to GPU (#21181) · 01a96463

由 Zhang Ting 提交于 11月 20, 2019

* optimize assign op to avoid copy data from GPU to GPU, test=develop

* modified GetkernelTypeForVar and just avoid device transform, test=develop

01a96463

L

fix load checkpoint error in test_reader (#20924) · c91cb6c5
由 Liufang Sang 提交于 11月 20, 2019

c91cb6c5
Z
Change GCC version to be 8.2 in Dockerfile.GCC8 (#21222) · 925280b9
由 Zeng Jinle 提交于 11月 20, 2019
```
* make Docker to gcc 8.2, test=develop

* add -std=c11 to grpc.cmake, test=develop
```
925280b9

19 11月, 2019 8 次提交
- Z
  
  Determine whether to copy and link inference lib by ON_INFER (#20931) · c0dcb090
  由 zhouwei25 提交于 11月 19, 2019
  
  c0dcb090
- C
  Fix PADDLE_ENFORCE ci check bug (#21233) · 2dfcbb8b
  由 Chen Weihang 提交于 11月 19, 2019
```
* fix PADDLE_ENFORCE ci check bug, test=develop, test=document_fix

* fix PADDLE_ENFORCE match error, test=develop, test=document_fix
```
  2dfcbb8b
- K
  
  add custom_op include: imperative, error_codes.pb.h, mkldnn.h. test=develop (#21227) · 4747940b
  由 Kaipeng Deng 提交于 11月 19, 2019
  
  4747940b
- D
  
  extend elementwise broadcast function (#20957) · 0e7baabe
  由 danleifeng 提交于 11月 19, 2019
  
  0e7baabe
- A
  Fix GELU grad error (#21204) · d623e863
  由 Adam 提交于 11月 19, 2019
```
test=develop
```
  d623e863
- Z
  
  refine Tensor method, test=develop (#21031) · a152315b
  由 Zeng Jinle 提交于 11月 19, 2019
  
  a152315b
- Y
  fix data_norm op to avoid impractical normalization result test=develop (#21152) · b5d8ba83
  由 yaoxuefeng 提交于 11月 19, 2019
```
* fix auc drop first commit test=develop

* update datanorm op

* update datanorm with enforce test=develop

* update test=develop

* update format test=develop

* update format

* update format test=develop

* add unit test test=develop

* update unit test test=develop

* update format test=develop

* update format test=develop

* update API description test=develop

* update API description test=develop

* update format test=develop

* fix codes as comments test=develop

* fix description as comments test=develop

* fix description as comments test=develop

* update codes.. test=develop
```
  b5d8ba83
- Z
  Polish jit trace codes (#21218) · 67e88424
  由 Zeng Jinle 提交于 11月 19, 2019
```
* polish jit trace codes, test=develop

* polish codes again by removing var_id, test=develop
```
  67e88424
18 11月, 2019 3 次提交

Fix warn of gcc8 (#21205) · cdb3d279

由 Zeng Jinle 提交于 11月 18, 2019

* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop

cdb3d279

Z
fix bug when build openblas with a computer that has installed openblas... · 5d821578
由 zhouwei25 提交于 11月 18, 2019
```
fix bug when build openblas with a computer that has installed openblas before,test=develop (#21160)
```
5d821578

Better TensorRT support (#20858) · 330b173c

由 Jeng Bai-Cheng 提交于 11月 18, 2019

* Fix TensorRT detection bug

1. Add new search path for TensorRT at tensorrt.cmake
2. Add better debug message
3. Fix the bug of detection of TensorRT version

In NVIDIA official docker image, TensorRT headers are located at
`/usr/include/x86_64-linux-gnu` and TensorRT libraries are located
at `/usr/lib/x86_64-linux-gnu`, so using `-DTENSORRT_ROOT` will
fail to detect TensorRT.

There is no debug/warning message to tell developer that TensorRT
is failed to be detected.

In later version of TensorRT (e.g. v6), `NV_TENSORRT_MAJOR` is
defined at `NvInferVersion.h` instead of `NvInfer.h`, so add
compatibility fix.

* Fix TensorRT variables in CMake

1. Replace `${TENSORRT_ROOT}/include` with `${TENSORRT_INCLUDE_DIR}`
2. Replace `${TENSORRT_ROOT}/lib` with `${TENSORRT_LIBRARY}`

Manually type path may locate incorrect path of TensorRT. Use the
paths detected by system instead.

* Fix TensorRT library path

1. Add new variable - `${TENSORRT_LIBRARY_DIR}`
2. Fix TensorRT library path

inference_lib.cmake and setup.py.in need the path of TensorRT library
instead of the file of TensorRT library, so add new variable to fix it.

* Add more general search rule for TensoRT

Let system detect architecture instead of manually assign it, so
replace `x86_64-linux-gnu` with `${CMAKE_LIBRARY_ARCHITECTURE}`.

* Add more general search rule for TensorRT

Remove duplicate search rules for TensorRT libraries. Use
`${TENSORRT_LIBRARY_DIR}` to get full path of libnvinfer.so

test=develop

330b173c

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功