提交 · 857cd9f8516b8a52f580c0388662f92974a31d33 · BaiXuePrincess / Paddle

04 12月, 2019 2 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21544) · 857cd9f8
  由 Pei Yang 提交于 12月 04, 2019
```
make config option DisableGlogInfo() able to mute all inference logs
```
  857cd9f8
- Z
  [cherry-pick] NV JETSON support and auto_growth strategy for inference. (#21500) · 20a09375
  由 Zhaolong Xing 提交于 12月 04, 2019
```
* ADD NV JETSON SUPPORT
test=release/1.6

* CHERRY_PICK: specify the auto growth allocator for inference.
test=release/1.6
```
  20a09375
03 12月, 2019 1 次提交
- B
  
  cherry-pick LRN and Pool2d (FWD) NHWC support (#21476) · ccb508dc
  由 bingyanghuang 提交于 12月 03, 2019
  
  ccb508dc
02 12月, 2019 1 次提交

[cherry-pick] Improve topk performance. (#21087) (#21441) · 5dbe9e59

由 zhaoyuchen2018 提交于 12月 02, 2019

* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

5dbe9e59

25 11月, 2019 1 次提交

cherry-pick (#21201) to release/1.6 (#21306) · a91b8014

由 liuwei1031 提交于 11月 25, 2019

cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows

a91b8014

24 11月, 2019 1 次提交
- C
  Further simplify the C++ error info stack (#21093) (#21304) · 9110c896
  由 Chen Weihang 提交于 11月 24, 2019
```
* simplify C++ error stack by rewrite Place, test=develop

* polish assignment overload func, test=develop
```
  9110c896
21 11月, 2019 1 次提交

Cherry-pick error type support for release1.6 (#21294) · 974b8a83

由 Chen Weihang 提交于 11月 21, 2019

* delete paddle infershape enforce marco (#20832)

* Polish and arrange code in enforce.h (#20901)

* Enrich the type of error and declare the error type interfaces (#21024)

* Enrich the type of error and declare the error type interfaces, test=develop

* adjust tests to adapt new form, test=develop

* add inference deps with error_codes.pb.h, test=develop

* restore stack iter start pos, test=develop

* polish code based review comments, test=develop

* Add dependency for error_codes.proto (#21084)

* fix activation_functions deps, test=develop, test=document_fix

* add error_codes_proto deps, test=develop, test=document_fix

* try delete enforce.h, test=develop, test=document_fix

* change cuda enforce & add example (#21142)
test=release/1.6

974b8a83

07 11月, 2019 1 次提交

[cherry-pick] Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21072) · e8890031

由 Adam 提交于 11月 07, 2019

* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop

e8890031

30 10月, 2019 1 次提交
- L
  [cherry-pick] Add support to gcc8, add docker env (#20892) · 6fb04e8a
  由 liu zhengxi 提交于 10月 30, 2019
```
* add support to gcc8, add docker env
* remove the warning issue
```
  6fb04e8a
25 10月, 2019 1 次提交
- C
  
  Make formatted ENFORCE stack adapt to more situations (#20826) (#20828) · 4841474e
  由 Chen Weihang 提交于 10月 25, 2019
  
  4841474e
22 10月, 2019 1 次提交
- A
  Minor MKL-DNN conv int8 performance fixes (#20768) · 4f6b43a0
  由 Adam 提交于 10月 22, 2019
```
test=develop
```
  4f6b43a0
21 10月, 2019 1 次提交
- W
  
  [Cherry-pick 1.6] Fix DGC test and DGC nan bug (#20708) · 2378aa8a
  由 WangXi 提交于 10月 21, 2019
  
  2378aa8a
20 10月, 2019 1 次提交
- 1
  test=develop, add communicator_is_sgd_optimizer flag (#20677) (#20734) · 5baf1b23
  由 123malin 提交于 10月 20, 2019
```
* test=develop, communicator_is_sgd_optimizer flags
```
  5baf1b23
18 10月, 2019 2 次提交

B

cherry-pick PR#20640 to release 1.6, test=release/1.6 (#20706) · 4cf499c0
由 bingyanghuang 提交于 10月 18, 2019

4cf499c0

MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) (#20693) · 2099618d

由 Michał Gallus 提交于 10月 18, 2019

test=release/1.6

* - Flushing mkl-dnn cache

test=develop

- Disabled clearing cache for LoadModel

- Added clearing of mkl-dnn cache when Executor is created

test=develop

- Do not clear for GPU places

test=develop

- compilation fix

test=develop

* - Moved clearing of mkl-dnn cache in destructor of executor

test=develop

* - Compilation fix

test=develop

- Reverted conditional clearing of mkl-dnn cache in Executors's
  destructor

test=develop

- compilation fix

2099618d

16 10月, 2019 1 次提交
- Z
  
  make_conv_workspace_size_configurable, test=release/1.6 (#20664) · c054adf2
  由 Zeng Jinle 提交于 10月 16, 2019
  
  c054adf2
14 10月, 2019 1 次提交
- 6
  
  support convert tensor to cudf depends on dlpack test=release/1.6 (#20611) · 5da8db61
  由 633WHU 提交于 10月 14, 2019
  
  5da8db61
13 10月, 2019 1 次提交

fix cpu machine run paddle in gpu lib test=release/1.6 (#20548) · b0b0d628

由 Wilber 提交于 10月 13, 2019

cpu机器在gpu库上运行paddle出core，原因是由于缺失显卡driver，显卡driver与cuda driver不匹配

加上driver check解决该问题

b0b0d628

11 10月, 2019 1 次提交
- Z
  
  refine allocator_flag, test=release/1.6, test=document_fix (#20401) · dc206128
  由 Zeng Jinle 提交于 10月 11, 2019
  
  dc206128
01 10月, 2019 1 次提交
- D
  
  [cherry pick]Improve elementwise operators performance in same dimensions (#20134) · 43f11b5e
  由 danleifeng 提交于 10月 01, 2019
  
  43f11b5e
28 9月, 2019 2 次提交

Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02

由 qingqing01 提交于 9月 28, 2019

* How to write custom op needs to follow framework OP spec.
* Package fluid_framework.so and headers into whl.
* Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
* Export some C-APIs to merge OpInfo between core.so and custom_op.so.
* Add unit testing.
* Update API.spec.

1a3eef02

fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472

由 liym27 提交于 9月 28, 2019

* fix pool2d pool3d:
1. support asymmetric padding;
2. support padding algorithm:"SAME" and "VALID";
3. support channel_last: data_format NHWC and NDHWC;
4. support inferring shape when input with negative dims in compile time;
5. change doc of python API and c++;
6. fix bug in cuda kernel when Attr(adaptive) is true.

test=develop,test=document_preview

* fix 'tensors' to 'Tensors'. test=develop,test=document_preview

* add test for converage ValueError.test=develop,test=document_preview

* resolve conflict in test_pool2d. test=develop

24010472

27 9月, 2019 1 次提交

Paddle error message stack shaping and optimization (#19895) · b9163350

由 Chen Weihang 提交于 9月 27, 2019

* shape and optimize paddle error message stack, test=develop

* limit exception type & add unittest, test=develop

* fix multi-platform problem, test=develop

* fix related unnitest failed, test=develop

* add doc & fix unittest errors, test=develop

* fix function name error, test=develop

* update tensor test exception msg compare, test=develop

* remove unittest on win32, the dir format is different, test=develop

* remove useless package, test=develop

* add paddle enforce handler unittest, test=develop

* add exception checkout, test=develop

* fix coverage failed, test=develop

* fix op registry test failed, test=develop

* refactor whole pr, test=develop

* remove test in CMakelist, test=develop

* fix coverage, test=develop

b9163350

26 9月, 2019 1 次提交

Fix test pool2d int8 mkldnn (#19976) · 1d32897c

由 joanna.wozna.intel 提交于 9月 26, 2019

* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop

1d32897c

24 9月, 2019 2 次提交
- Z
  
  fix cuda dev_ctx allocator cmake deps, test=develop (#19953) · 37f76407
  由 Zeng Jinle 提交于 9月 24, 2019
  
  37f76407
- J
  - ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c
  由 Jacek Czaja 提交于 9月 24, 2019
```
- First implementation of BWD and FWD of pooling mkl-dnn

- Compilation fix

- Fix

- Fix

 - Fix

- Fix to crash

- Compilation fix

- Combined AcquireBacward with Fwd

test=develop
```
  5b07ca9c
23 9月, 2019 1 次提交
- C
  Delete local execution scopes (#19749) · d7251a8e
  由 chengduo 提交于 9月 23, 2019
```
* Add RecordHistoryLocalExecScopes
test=develop
```
  d7251a8e
22 9月, 2019 1 次提交

Add lock to cudnn handle calls (#19845) · c7f36e7c

由 Zeng Jinle 提交于 9月 22, 2019

* refine reallocate of workspace size, test=develop

* add lock to cudnn handle calls, test=develop

c7f36e7c

20 9月, 2019 2 次提交

Z

remove enforce.h file written, test=develop (#19897) · b25d1e75
由 Zeng Jinle 提交于 9月 20, 2019

b25d1e75

[MKL-DNN] LRN refactoring (#19798) · 619c797a

由 Jacek Czaja 提交于 9月 20, 2019

- LRN mkl-dnn kernel refactor

test=develop

- compilation fix

- Another compilation fix

- Compilation fix

- another compilation fix

- compilation fix

- Crash fix

- optional LRN mkldnn workspace

- Added mid allocation

- Workaround for tests

- Removed gradient from is_test ut

- Removed mid for inference

- Reverted LRN mid removal for is_test

- PADDLE_ENFORCE adjusted

- Rebase to templatization commit

- Compilation fix

- compilation fix

test=develop

- lint

test=develop

- Fix to crash

- Rebase to recent codebase

 - lin

- lint

- compilation fix

619c797a

19 9月, 2019 2 次提交

Refactor conv computeINT8 (#19574) · 2c32c2d6

由 lidanqing 提交于 9月 19, 2019

* fix conflicts
test=develop

* change mask_bias_reorder
test=develop

* add ComputeMask function to make code clear
test=develop

* change according to reviews
test=develop

* change according to reviews
test=develop

2c32c2d6

Add template functions for Acquire primitive/primitive_desc (#19867) · c7e68892

由 Adam 提交于 9月 19, 2019

* Add template functions for Acquire primitive/primitive_desc
test=develop

* Move acquire primitive descriptor to protected section
test=develop

c7e68892

18 9月, 2019 2 次提交
- Z
  
  remove some flags and add comments to some flags, test=develop (#19813) · 13ca364c
  由 Zeng Jinle 提交于 9月 18, 2019
  
  13ca364c
- Z
  
  refine reallocate of workspace size, test=develop (#19843) · 5eb381a3
  由 Zeng Jinle 提交于 9月 18, 2019
  
  5eb381a3
17 9月, 2019 1 次提交
- A
  Add MKLDNNhandlerT templatized class (#19801) · dfdd73cb
  由 Adam 提交于 9月 17, 2019
```
test=develop
```
  dfdd73cb
16 9月, 2019 1 次提交
- Z
  
  reduce default value of cudnn workspace size, test=develop (#19780) · 32b1151f
  由 Zeng Jinle 提交于 9月 16, 2019
  
  32b1151f
14 9月, 2019 2 次提交
- A
  Add common CreateKey for mkldnn handlers (#19767) · d4413a54
  由 Adam 提交于 9月 14, 2019
```
test=develop
```
  d4413a54
- Y
  Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774) · 0d6ea529
  由 Yihua Xu 提交于 9月 13, 2019
```
test=develop
```
  0d6ea529
12 9月, 2019 1 次提交
- J
  Refactoring activation mkldnn op (#19748) · 9e4c9585
  由 Jacek Czaja 提交于 9月 12, 2019
```
test=develop

- fix to BWD

test=develop
```
  9e4c9585
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致