提交 · e81f0228df7a15010230e2193460de0616e55bd3 · BaiXuePrincess / Paddle

10 12月, 2019 1 次提交

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

06 12月, 2019 1 次提交
- Z
  
  refine dev_ctx.Wait() exception throw, test=develop (#21600) · 97e76cb9
  由 Zeng Jinle 提交于 12月 06, 2019
  
  97e76cb9
05 12月, 2019 2 次提交
- H
  Refine a Warning Which Can Occur Not Only During Init (#21546) · b241c732
  由 Huihuang Zheng 提交于 12月 05, 2019
```
As the title
```
  b241c732
- W
  Add Branch to avoid CPU profiler warning print (#21556) · 932aca16
  由 wangchaochaohu 提交于 12月 05, 2019
```
* fix profiler warning message in cpu profile mode test=develop
```
  932aca16
04 12月, 2019 1 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
03 12月, 2019 2 次提交
- Z
  NV jetson(nano, tx2, xavier) inference compile support (#21393) · c5f0293c
  由 Zhaolong Xing 提交于 12月 03, 2019
```
* add jeston compile support
test=develop

* refine the cmake
test=develop
```
  c5f0293c
- H
  Add warning message when initialize GLOG failed. (#21487) · a71f53d7
  由 Huihuang Zheng 提交于 12月 03, 2019
```
Add warning message when initialize GLOG failed
```
  a71f53d7
02 12月, 2019 1 次提交

fix -Wno-error=sign-compare warning in gcc8 (#21434) · 01fa4ead

由 Tao Luo 提交于 12月 02, 2019

* fix -Wno-error=sign-compare warning in gcc8

test=develop

* fix warning in distributed codes

test=develop

01fa4ead

01 12月, 2019 1 次提交
- J
  
  nhwc optimization for batchnorm (#21090) · 5e813b53
  由 Jie Fang 提交于 12月 01, 2019
  
  5e813b53
29 11月, 2019 1 次提交
- J
  
  [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375) · cd43c444
  由 Jacek Czaja 提交于 11月 29, 2019
  
  cd43c444
28 11月, 2019 2 次提交
- W
  Profile refine (#21258) · 8293f21a
  由 wangchaochaohu 提交于 11月 28, 2019
```
* fix profile api high version test=develop
```
  8293f21a
- W
  
  fix the profiling bug test=develop (#21396) · e0e205ea
  由 wangchaochaohu 提交于 11月 28, 2019
  
  e0e205ea
25 11月, 2019 1 次提交
- Z
  
  remove warning LNK4006 and warning LNK4221 (#21226) · 345b67b5
  由 zhouwei25 提交于 11月 25, 2019
  
  345b67b5
24 11月, 2019 1 次提交
- G
  
  optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597) · ed2a1852
  由 gongweibao 提交于 11月 24, 2019
  
  ed2a1852
18 11月, 2019 2 次提交

Fix warn of gcc8 (#21205) · cdb3d279

由 Zeng Jinle 提交于 11月 18, 2019

* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop

cdb3d279

fix sporadically hang issue on windows(#21201) · d8b6cf2b

由 liuwei1031 提交于 11月 18, 2019

cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows

d8b6cf2b

14 11月, 2019 2 次提交

Improve topk performance. (#21087) · b93870e6

由 zhaoyuchen2018 提交于 11月 13, 2019

* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

b93870e6

C

change cuda enforce & add example (#21142) · b3a3e6f6
由 Chen Weihang 提交于 11月 14, 2019

b3a3e6f6

13 11月, 2019 1 次提交
- C
  
  add examples for resource exhausted error, test=develop (#21140) · 27fa9c10
  由 Chen Weihang 提交于 11月 13, 2019
  
  27fa9c10
12 11月, 2019 1 次提交
- C
  Further simplify the C++ error info stack (#21093) · edd6680a
  由 Chen Weihang 提交于 11月 12, 2019
```
* simplify C++ error stack by rewrite Place, test=develop

* polish assignment overload func, test=develop
```
  edd6680a
08 11月, 2019 2 次提交

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

Enrich the type of error and declare the error type interfaces (#21024) · 7ee25189

由 Chen Weihang 提交于 11月 08, 2019

* Enrich the type of error and declare the error type interfaces, test=develop

* adjust tests to adapt new form, test=develop

* add inference deps with error_codes.pb.h, test=develop

* restore stack iter start pos, test=develop

* polish code based review comments, test=develop

7ee25189

07 11月, 2019 1 次提交

Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062) · 3fda695b

由 Adam 提交于 11月 07, 2019

* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop

3fda695b

06 11月, 2019 1 次提交
- Z
  
  refine error message of allocator again, test=develop (#21023) · a710ccc0
  由 Zeng Jinle 提交于 11月 06, 2019
  
  a710ccc0
01 11月, 2019 1 次提交
- W
  
  gpu info query refine test=develop (#20904) · 7695b713
  由 wangchaochaohu 提交于 11月 01, 2019
  
  7695b713
31 10月, 2019 1 次提交
- C
  
  Polish and arrange code in enforce.h (#20901) · 3358455c
  由 Chen Weihang 提交于 10月 31, 2019
  
  3358455c
28 10月, 2019 1 次提交
- C
  
  delete paddle infershape enforce marco (#20832) · 8b59ac3a
  由 Chen Weihang 提交于 10月 28, 2019
  
  8b59ac3a
25 10月, 2019 1 次提交

Make formatted ENFORCE stack adapt to more situations (#20826) · 1d1552d1

由 Chen Weihang 提交于 10月 25, 2019

* Make formatted ENFORCE stack adapt to more situations and polish details, test=develop

* restore template message position, test=develop

1d1552d1

22 10月, 2019 1 次提交
- A
  Minor MKL-DNN conv int8 performance fixes (#20753) · 67b59ddb
  由 Adam 提交于 10月 22, 2019
```
test=develop
```
  67b59ddb
20 10月, 2019 1 次提交
- 1
  test=develop, add communicator_is_sgd_optimizer flag (#20677) · 95e90aa1
  由 123malin 提交于 10月 20, 2019
```
* test=develop, communicator_is_sgd_optimizer flags
```
  95e90aa1
18 10月, 2019 3 次提交
- W
  add support to gcc8, add docker env test=develop (#19807) · 9e594823
  由 wopeizl 提交于 10月 18, 2019
```
* add support to gcc8, add docker env test=develop
```
  9e594823
- W
  
  Fix dgc nan by stripping nccl from sparseReduce. (#20630) · 507afa8a
  由 WangXi 提交于 10月 17, 2019
  
  507afa8a
- L
  Revert "Refactor conv computeINT8" (#20640) · 46e93f7c
  由 lidanqing 提交于 10月 18, 2019
```
* Revert "Refactor conv computeINT8 (#19574)"

This reverts commit 2c32c2d6.

test=develop

* replace PADDLE_ENFORCE
test=develop
```
  46e93f7c
17 10月, 2019 1 次提交

[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) · a1cd27f1

由 Jacek Czaja 提交于 10月 17, 2019

* - Flushing mkl-dnn cache

test=develop

- Disabled clearing cache for LoadModel

- Added clearing of mkl-dnn cache when Executor is created

test=develop

- Do not clear for GPU places

test=develop

- compilation fix

test=develop

* - Moved clearing of mkl-dnn cache in destructor of executor

test=develop

* - Compilation fix

test=develop

- Reverted conditional clearing of mkl-dnn cache in Executors's
  destructor

test=develop

- compilation fix

a1cd27f1

16 10月, 2019 1 次提交
- Z
  
  make_conv_workspace_size_configurable, test=develop (#20662) · 4922eb6d
  由 Zeng Jinle 提交于 10月 16, 2019
  
  4922eb6d
14 10月, 2019 1 次提交

Dlpack support (#20039) · 12e4be03

由 633WHU 提交于 10月 14, 2019

* support dlpack to tensor and implement python interface test=develop

* add unittest for _to_dlpack and from_dlpack test=develop

12e4be03

12 10月, 2019 1 次提交
- W
  enable cpu machine to run paddle in gpu lib · 751812a6
  由 Wilber 提交于 10月 12, 2019
```
enable cpu machine to run paddle model in gpu lib
```
  751812a6
11 10月, 2019 1 次提交
- Z
  
  refine allocator_flag, test=develop, test=document_fix (#20400) · 1d1d221f
  由 Zeng Jinle 提交于 10月 11, 2019
  
  1d1d221f
30 9月, 2019 1 次提交
- D
  Improve elementwise operators performance in same dimensions. (#19763) · 425279a5
  由 danleifeng 提交于 9月 30, 2019
```
Improve elementwise operators performance in same dimensions
```
  425279a5
28 9月, 2019 1 次提交

Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02

由 qingqing01 提交于 9月 28, 2019

* How to write custom op needs to follow framework OP spec.
* Package fluid_framework.so and headers into whl.
* Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
* Export some C-APIs to merge OpInfo between core.so and custom_op.so.
* Add unit testing.
* Update API.spec.

1a3eef02

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致