提交 · c0aa13672edf484b280988f3400636b1a3aff050 · PaddlePaddle / Paddle

27 11月, 2019 3 次提交

Support data_norm gpu kernel (#21325) · 47a82e38

由 hutuxian 提交于 11月 27, 2019

* support data_norm_op run in CUDA
* add two parameters sync_stats & summary_decay_rate
* add UT

47a82e38

Support numpy bridge (enabled by default in dygraph mode) (#20983) · d5ff79e5

由 Youwei Song 提交于 11月 27, 2019

* add numpy bridge

* fix template compile

* add unittest, add default
test=develop

* fix unittest
test=develop

* fix unittest
test=develop

* zero_copy=True for to_variable,
test=develop

* bug fix
test=develop

* disable deprecated NumPy API
test=develop

* use better design of NumpyAllocator
test=develop

* fix Py_None check
test=develop

* reset c++ tracer when jump out dygraph guard
test=develop

* refine PADDLE_ENFORCE_xx format
test=develop

* bug fix of tracer switch
test=develop

* update decref
test=develop

d5ff79e5

INT8 Fully-connected (#17641) · 5d7d5482

由 Michał Gallus 提交于 11月 27, 2019

* Implement Int8 FC

* Integrate FC into INT8v2

test=develop

* int8 FC: transpose weights before computing scales

test=develop

* Add support for activation_type string in FC

test=develop

* Disable MKL-DNN's FC in VGG16 and 19

test=develop

* Disable FC quantization when mkldnn FC is disabled

test=develop

* Solve PADDLE_ENFORCES in FC int8

* Fix Paddle enforces and remove const cast

test=develop

* Fix style changes

test=develop

* Fix quantizer_tester test and add fc quantization

test=develop

* Fix FC test fail on CUDA

* Remove unnecessary log from quantize placement pass

test=develop

* Add Thread ID to FC hash key

test=develop

* Add comments to MKL-DNN FC Kernel

test=develop

* Refactor quantizer

test=develop

* Fix linter issues

test=develop

* Fix crash in slim googlenet

test=develop

* Fix PADDLE_ENFORCE messages

test=develop

5d7d5482

26 11月, 2019 6 次提交

add the framework support for distfc (#21197) · 41d13209

由 lilong12 提交于 11月 26, 2019

* add the framework support for distfc and ut, test=develop
* fix the implementation of shard_index_op, test=develop

41d13209

change download log format (#21290) · a214a308

由 hong 提交于 11月 26, 2019

* change download log formate; test=develop

* add unittest for data download; test=develop

* remove cache before download; test=develop

a214a308

Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8

由 GaoWei8 提交于 11月 26, 2019

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

234060f8

R

reduce interp op input size to pass CI, test=develop (#21341) · 6cfcbe05
由 ruri 提交于 11月 26, 2019

6cfcbe05
J

[MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207) · f4cf028a
由 Jacek Czaja 提交于 11月 26, 2019

f4cf028a

Refactor MKL-DNN ElementwiseMul (#21061) · ed9ceb9f

由 Michał Gallus 提交于 11月 26, 2019

* Refactor MKL-DNN ElementwiseMul

remove manual fallback, remove format attrs
test=develop

* Refine PADDLE_ENFORCEs in eltwise_mul_op.h

test=develop

* Make ElementwiseMulOp inherit from ElementwiseOp

* Change type of simd_width to int

test=develop

* Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp

test=develop

* Restore attributes

test=develop

* Fix test coverage for mkldnn eltwise mul

test=develop

* Conform to new is_run_common_broadcast API

test=develop

* Add UT for AreDimsAndFormatCorrect

test=develop

ed9ceb9f

25 11月, 2019 3 次提交

Improve argsort performance. (#21267) · 08c19c58

由 zhaoyuchen2018 提交于 11月 25, 2019

* Improve argsort performance.

- Give 200000 data to compute argsort on v100,
can speed up ~190x
before opt cost: 0.53s
after opt cost:0.0027s

- Add fp16 support

* Refine error message
* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

08c19c58

W

Fix dgc accuracy by mv regularization to local (#21278) · 8ac7687e
由 WangXi 提交于 11月 25, 2019

8ac7687e
Z
Add global value getter setter (#21285) · b9f8ae84
由 Zeng Jinle 提交于 11月 25, 2019
```
* add global value getter setter, test=develop

* fix error messages, test=develop
```
b9f8ae84

24 11月, 2019 3 次提交

Refactor fetch handler (#21264) · 691ced87

由 Dong Daxiang 提交于 11月 24, 2019

* fix fetch handler problem and refactor
when a user define FetchHandler class, he or she should initialize a handler
with variable dict. the key of a variable dict is a user defined name,
the value of a variable dict is a Varaible generated from python API.

For each fetching, a user should implement handler function in which
fetched_result_dict will be available and the user can access the fetched value
with user defined keys.

691ced87

Y
adapt test_collective_base.py for only two GPU cards available. (#21307) · f1b09ba3
由 Yi Liu 提交于 11月 24, 2019
```
* adapt test_collective_base.py for only two GPU cards available.
test=develop

* fix bug of issue #21259
test=develop
```
f1b09ba3
G

optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597) · ed2a1852
由 gongweibao 提交于 11月 24, 2019

ed2a1852

22 11月, 2019 3 次提交

add dequantize_abs_max op and modify lookup_table op (#20899) · f0b15184

由 Liufang Sang 提交于 11月 22, 2019

* add int8 kernel to lookup_table op and add dequantize op test=develop

* change paddle_enforce to paddle_enforce_eq test=develop

* change copyright and change some not suitable code test=develop

* remove debug log test=develop

* replace GetInputType with IndicateVarDataType test=develop

* fix EmptyGradMaker test=develop

* fix diff between cpu and gpu test=develop

* use memcopy when int8_t test=develop

f0b15184

support cvm_op run in gpu (#21300) · a6ce2306

由 hutuxian 提交于 11月 22, 2019

Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel.
What's more, we improve the UTs about CVM OP.

a6ce2306

C
Polish some PE code details (#21274) · 95250852
由 Chen Weihang 提交于 11月 22, 2019
```
* polish code details, test=develop

* futher polish hint msg, test=develop
```
95250852

21 11月, 2019 4 次提交

fix fs_client_param bug (#21212) · 319d2ba9

由 xujiaqi01 提交于 11月 21, 2019

* fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config
* test=develop

319d2ba9

Z

fix bug for python/paddle/fluid/tests/unittests/test_elementwise_mul_op.py, test=develop (#21289) · fa4d0550
由 zhongpu 提交于 11月 21, 2019

fa4d0550

open dygraph op test, test=develop (#19787) · c4ede95c

由 zhongpu 提交于 11月 21, 2019

* open dygraph op test, test=develop

* modify to_variable, test=develop

* modify input and output for dygraph, test=develop

* modify input and output for dygraph(fix bug), test=develop

* fix input processing of dygraph op test, test=develop

* fix bug, test=develop

* fix op test, test=develop

* fix forward bug for dygraph, test=develop

* fix mkldnn op test for forward, test=develop

* update nn.py for dygraph, test=develop

* fix crop_tensor_op, test=develop

* fix elementwise_mul_op, test=develop

* fix fill_op, test=develop

* fix some mkldnn op, test=develop

* open backward op test for dygraph, test=develop

* delete log, test=develop

* close backward op test for dygraph, test=develop

* fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop

* fix optest backward bug for dygraph, test=develop

* fix optest backward bug for dygraph, test=develop

* close backward op test for dygraph, test=develop

* close backward op test for dygraph, test=develop

* open dygraph op test, test=develop

* fix op test for dygraph, fix GradOpDescMaker, test=develop

* fix bug for linear_chain_crf_op.h, test=develop

* remove log, test=develop

* remove log, test=develop

* remove log for op_test.py, test=develop

* remove log for op_test.py, test=develop

* fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop

* fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop

* fix bug for test_increment_ngraph_op.py, test=develop

* fix lod for op test in dygraph, test=develop

* refactor op_test.py to reduce redundant code, test=develop

* fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop

* remove debug log, test=develop

* remove redundant code in base.py, test=develop

* fix some error in optest, test=develop

* fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop

* refactor op_test.py, test=develop

* remove redundant writing, test=develop

* fix error(get tensor of the grad variable), test=develop

* fix test_concat_mkldnn test_conv2d_mkldnn, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix optest.py for get tensor of LoDTensor, test=develop

* fix some redundant code, test=develop

* reslove conflict and rewrite paddle error message, test=develop

c4ede95c

L
add input type and input data type check for Print_op test=develop (#21250) · 382cf5d7
由 lijianshe02 提交于 11月 21, 2019
```
* add input type and input data type check for Print_op test=develop
```
382cf5d7

20 11月, 2019 1 次提交

Add control flow api: case (#21114) · b0fc8227

由 liym27 提交于 11月 20, 2019

* add control flow API: case. test=develop

* delete 'raise TypeError' in _error_message() and return a string. test=develop

* polish API document. test=develop

b0fc8227

19 11月, 2019 2 次提交

D

extend elementwise broadcast function (#20957) · 0e7baabe
由 danleifeng 提交于 11月 19, 2019

0e7baabe

fix data_norm op to avoid impractical normalization result test=develop (#21152) · b5d8ba83

由 yaoxuefeng 提交于 11月 19, 2019

* fix auc drop first commit test=develop

* update datanorm op

* update datanorm with enforce test=develop

* update test=develop

* update format test=develop

* update format

* update format test=develop

* add unit test test=develop

* update unit test test=develop

* update format test=develop

* update format test=develop

* update API description test=develop

* update API description test=develop

* update format test=develop

* fix codes as comments test=develop

* fix description as comments test=develop

* fix description as comments test=develop

* update codes.. test=develop

b5d8ba83

18 11月, 2019 6 次提交
- Z
  Fix warn of gcc8 (#21205) · cdb3d279
  由 Zeng Jinle 提交于 11月 18, 2019
```
* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop
```
  cdb3d279
- D
  
  add store_true to use_paddlecloud argument in launch.py (#21168) · 3fe63d67
  由 danleifeng 提交于 11月 18, 2019
  
  3fe63d67
- L
  Control flow API: switch_case (#21103) · 92475282
  由 liym27 提交于 11月 18, 2019
```
* add API switch_case. test=develop

add Nest

* modify code according to reviews:
1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'.
2.remove useless code.
test=develop

* replace fluid.layers.data with fluid.data and polish API document. test=develop
```
  92475282
- G
  
  Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118) · 56b5d147
  由 guofei 提交于 11月 18, 2019
  
  56b5d147
- W
  
  Fix INF bug of softmax_cross_entropy_op (#21165) · 3c98ec90
  由 WangXi 提交于 11月 18, 2019
  
  3c98ec90
- Z
  
  fix dygraph trace bug, test=develop (#21193) · 0f30d3a2
  由 Zeng Jinle 提交于 11月 18, 2019
  
  0f30d3a2
15 11月, 2019 1 次提交
- X
  fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052) · 23876de5
  由 xujiaqi01 提交于 11月 15, 2019
```
* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop
```
  23876de5
14 11月, 2019 4 次提交
- K
  
  fix elementwise_mod float point kernel. test=develop (#21183) · 98b59cb8
  由 Kaipeng Deng 提交于 11月 14, 2019
  
  98b59cb8
- Z
  Add friendly dygraph trace API (#21091) · 5fdfbe34
  由 Zeng Jinle 提交于 11月 14, 2019
```
* friendly trace interface, test=develop

* refine TracedLayer, test=develop

* add some docs, test=develop
```
  5fdfbe34
- W
  
  Fix warpctc in padding mode. (#21033) · cfdd1fc2
  由 whs 提交于 11月 14, 2019
  
  cfdd1fc2
- T
  add input type and dtype check template, and update some APIs check (#21161) · 3976bbe2
  由 Tao Luo 提交于 11月 14, 2019
```
* add input type and dtype check template, and update some APIs check

* refine check template, and update some APIs check in nn.py

* update some APIs check in loss.py

test=develop
```
  3976bbe2
13 11月, 2019 1 次提交
- G
  Use 2 cards for hallreduce unit test. (#21085) · a5fc291f
  由 gongweibao 提交于 11月 13, 2019
```
use 2 cards test=develop
```
  a5fc291f
12 11月, 2019 3 次提交

Add Asypadding for conv fusion. (#21041) · 4a544762

由 zhaoyuchen2018 提交于 11月 12, 2019

* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.

4a544762

W

Fix dgc buffer illegal & reuse velocity (#21012) · de5d3ff6
由 WangXi 提交于 11月 12, 2019

de5d3ff6

modify the implementation of save_persistables and save_inference_model for... · 53148e06

由 lilong12 提交于 11月 12, 2019

modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802)

* modify the implementation of  save_persistables and save_inference_model functions for fleet collective, test=develop

* add ut, test=develop

53148e06

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功