提交 · 1957192f05133c5c15e9c30fb55cffccc39a291d · BaiXuePrincess / Paddle

06 11月, 2019 1 次提交
- Z
  
  refine error message of allocator again, test=develop (#21023) · a710ccc0
  由 Zeng Jinle 提交于 11月 06, 2019
  
  a710ccc0
01 11月, 2019 1 次提交
- W
  
  gpu info query refine test=develop (#20904) · 7695b713
  由 wangchaochaohu 提交于 11月 01, 2019
  
  7695b713
31 10月, 2019 1 次提交
- C
  
  Polish and arrange code in enforce.h (#20901) · 3358455c
  由 Chen Weihang 提交于 10月 31, 2019
  
  3358455c
28 10月, 2019 1 次提交
- C
  
  delete paddle infershape enforce marco (#20832) · 8b59ac3a
  由 Chen Weihang 提交于 10月 28, 2019
  
  8b59ac3a
25 10月, 2019 1 次提交

Make formatted ENFORCE stack adapt to more situations (#20826) · 1d1552d1

由 Chen Weihang 提交于 10月 25, 2019

* Make formatted ENFORCE stack adapt to more situations and polish details, test=develop

* restore template message position, test=develop

1d1552d1

22 10月, 2019 1 次提交
- A
  Minor MKL-DNN conv int8 performance fixes (#20753) · 67b59ddb
  由 Adam 提交于 10月 22, 2019
```
test=develop
```
  67b59ddb
20 10月, 2019 1 次提交
- 1
  test=develop, add communicator_is_sgd_optimizer flag (#20677) · 95e90aa1
  由 123malin 提交于 10月 20, 2019
```
* test=develop, communicator_is_sgd_optimizer flags
```
  95e90aa1
18 10月, 2019 3 次提交
- W
  add support to gcc8, add docker env test=develop (#19807) · 9e594823
  由 wopeizl 提交于 10月 18, 2019
```
* add support to gcc8, add docker env test=develop
```
  9e594823
- W
  
  Fix dgc nan by stripping nccl from sparseReduce. (#20630) · 507afa8a
  由 WangXi 提交于 10月 17, 2019
  
  507afa8a
- L
  Revert "Refactor conv computeINT8" (#20640) · 46e93f7c
  由 lidanqing 提交于 10月 18, 2019
```
* Revert "Refactor conv computeINT8 (#19574)"

This reverts commit 2c32c2d6.

test=develop

* replace PADDLE_ENFORCE
test=develop
```
  46e93f7c
17 10月, 2019 1 次提交

[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) · a1cd27f1

由 Jacek Czaja 提交于 10月 17, 2019

* - Flushing mkl-dnn cache

test=develop

- Disabled clearing cache for LoadModel

- Added clearing of mkl-dnn cache when Executor is created

test=develop

- Do not clear for GPU places

test=develop

- compilation fix

test=develop

* - Moved clearing of mkl-dnn cache in destructor of executor

test=develop

* - Compilation fix

test=develop

- Reverted conditional clearing of mkl-dnn cache in Executors's
  destructor

test=develop

- compilation fix

a1cd27f1

16 10月, 2019 1 次提交
- Z
  
  make_conv_workspace_size_configurable, test=develop (#20662) · 4922eb6d
  由 Zeng Jinle 提交于 10月 16, 2019
  
  4922eb6d
14 10月, 2019 1 次提交

Dlpack support (#20039) · 12e4be03

由 633WHU 提交于 10月 14, 2019

* support dlpack to tensor and implement python interface test=develop

* add unittest for _to_dlpack and from_dlpack test=develop

12e4be03

12 10月, 2019 1 次提交
- W
  enable cpu machine to run paddle in gpu lib · 751812a6
  由 Wilber 提交于 10月 12, 2019
```
enable cpu machine to run paddle model in gpu lib
```
  751812a6
11 10月, 2019 1 次提交
- Z
  
  refine allocator_flag, test=develop, test=document_fix (#20400) · 1d1d221f
  由 Zeng Jinle 提交于 10月 11, 2019
  
  1d1d221f
30 9月, 2019 1 次提交
- D
  Improve elementwise operators performance in same dimensions. (#19763) · 425279a5
  由 danleifeng 提交于 9月 30, 2019
```
Improve elementwise operators performance in same dimensions
```
  425279a5
28 9月, 2019 2 次提交

Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02

由 qingqing01 提交于 9月 28, 2019

* How to write custom op needs to follow framework OP spec.
* Package fluid_framework.so and headers into whl.
* Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
* Export some C-APIs to merge OpInfo between core.so and custom_op.so.
* Add unit testing.
* Update API.spec.

1a3eef02

fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472

由 liym27 提交于 9月 28, 2019

* fix pool2d pool3d:
1. support asymmetric padding;
2. support padding algorithm:"SAME" and "VALID";
3. support channel_last: data_format NHWC and NDHWC;
4. support inferring shape when input with negative dims in compile time;
5. change doc of python API and c++;
6. fix bug in cuda kernel when Attr(adaptive) is true.

test=develop,test=document_preview

* fix 'tensors' to 'Tensors'. test=develop,test=document_preview

* add test for converage ValueError.test=develop,test=document_preview

* resolve conflict in test_pool2d. test=develop

24010472

27 9月, 2019 1 次提交

Paddle error message stack shaping and optimization (#19895) · b9163350

由 Chen Weihang 提交于 9月 27, 2019

* shape and optimize paddle error message stack, test=develop

* limit exception type & add unittest, test=develop

* fix multi-platform problem, test=develop

* fix related unnitest failed, test=develop

* add doc & fix unittest errors, test=develop

* fix function name error, test=develop

* update tensor test exception msg compare, test=develop

* remove unittest on win32, the dir format is different, test=develop

* remove useless package, test=develop

* add paddle enforce handler unittest, test=develop

* add exception checkout, test=develop

* fix coverage failed, test=develop

* fix op registry test failed, test=develop

* refactor whole pr, test=develop

* remove test in CMakelist, test=develop

* fix coverage, test=develop

b9163350

26 9月, 2019 1 次提交

Fix test pool2d int8 mkldnn (#19976) · 1d32897c

由 joanna.wozna.intel 提交于 9月 26, 2019

* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop

1d32897c

24 9月, 2019 2 次提交
- Z
  
  fix cuda dev_ctx allocator cmake deps, test=develop (#19953) · 37f76407
  由 Zeng Jinle 提交于 9月 24, 2019
  
  37f76407
- J
  - ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c
  由 Jacek Czaja 提交于 9月 24, 2019
```
- First implementation of BWD and FWD of pooling mkl-dnn

- Compilation fix

- Fix

- Fix

 - Fix

- Fix to crash

- Compilation fix

- Combined AcquireBacward with Fwd

test=develop
```
  5b07ca9c
23 9月, 2019 1 次提交
- C
  Delete local execution scopes (#19749) · d7251a8e
  由 chengduo 提交于 9月 23, 2019
```
* Add RecordHistoryLocalExecScopes
test=develop
```
  d7251a8e
22 9月, 2019 1 次提交

Add lock to cudnn handle calls (#19845) · c7f36e7c

由 Zeng Jinle 提交于 9月 22, 2019

* refine reallocate of workspace size, test=develop

* add lock to cudnn handle calls, test=develop

c7f36e7c

20 9月, 2019 2 次提交

Z

remove enforce.h file written, test=develop (#19897) · b25d1e75
由 Zeng Jinle 提交于 9月 20, 2019

b25d1e75

[MKL-DNN] LRN refactoring (#19798) · 619c797a

由 Jacek Czaja 提交于 9月 20, 2019

- LRN mkl-dnn kernel refactor

test=develop

- compilation fix

- Another compilation fix

- Compilation fix

- another compilation fix

- compilation fix

- Crash fix

- optional LRN mkldnn workspace

- Added mid allocation

- Workaround for tests

- Removed gradient from is_test ut

- Removed mid for inference

- Reverted LRN mid removal for is_test

- PADDLE_ENFORCE adjusted

- Rebase to templatization commit

- Compilation fix

- compilation fix

test=develop

- lint

test=develop

- Fix to crash

- Rebase to recent codebase

 - lin

- lint

- compilation fix

619c797a

19 9月, 2019 2 次提交

Refactor conv computeINT8 (#19574) · 2c32c2d6

由 lidanqing 提交于 9月 19, 2019

* fix conflicts
test=develop

* change mask_bias_reorder
test=develop

* add ComputeMask function to make code clear
test=develop

* change according to reviews
test=develop

* change according to reviews
test=develop

2c32c2d6

Add template functions for Acquire primitive/primitive_desc (#19867) · c7e68892

由 Adam 提交于 9月 19, 2019

* Add template functions for Acquire primitive/primitive_desc
test=develop

* Move acquire primitive descriptor to protected section
test=develop

c7e68892

18 9月, 2019 2 次提交
- Z
  
  remove some flags and add comments to some flags, test=develop (#19813) · 13ca364c
  由 Zeng Jinle 提交于 9月 18, 2019
  
  13ca364c
- Z
  
  refine reallocate of workspace size, test=develop (#19843) · 5eb381a3
  由 Zeng Jinle 提交于 9月 18, 2019
  
  5eb381a3
17 9月, 2019 1 次提交
- A
  Add MKLDNNhandlerT templatized class (#19801) · dfdd73cb
  由 Adam 提交于 9月 17, 2019
```
test=develop
```
  dfdd73cb
16 9月, 2019 1 次提交
- Z
  
  reduce default value of cudnn workspace size, test=develop (#19780) · 32b1151f
  由 Zeng Jinle 提交于 9月 16, 2019
  
  32b1151f
14 9月, 2019 2 次提交
- A
  Add common CreateKey for mkldnn handlers (#19767) · d4413a54
  由 Adam 提交于 9月 14, 2019
```
test=develop
```
  d4413a54
- Y
  Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774) · 0d6ea529
  由 Yihua Xu 提交于 9月 13, 2019
```
test=develop
```
  0d6ea529
12 9月, 2019 1 次提交
- J
  Refactoring activation mkldnn op (#19748) · 9e4c9585
  由 Jacek Czaja 提交于 9月 12, 2019
```
test=develop

- fix to BWD

test=develop
```
  9e4c9585
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

10 9月, 2019 2 次提交

A
MKLDNN handler cleanup (#19713) · 428b2b9e
由 Adam 提交于 9月 10, 2019
```
* MKLDNN handler cleanup

* MKLDNN handler cleanup
test=develop
```
428b2b9e

Add document annotations for FLAGS that need to be open to external developers... · 27235cf2

由 XiaoguangHu 提交于 9月 10, 2019

Add document annotations for FLAGS that need to be open to external developers test=develop (#19692)

Add document annotations for FLAGS that need to be open to external developers

27235cf2

09 9月, 2019 1 次提交

paddle::framework::vectorize() templatization [PART3] (#19643) · f05d2c51

由 Tao Luo 提交于 9月 09, 2019

* paddle::framework::vectorize() templatization

test=develop

* update pybind/imperative.cc

test=develop

* revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc

test=develop

f05d2c51

05 9月, 2019 1 次提交

Integrate NVRTC to support compiling CUDA kernel at runtime (#19422) · 42b5bec6

由 Yiqun Liu 提交于 9月 05, 2019

* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
test=develop

* Call CUDA driver api to launch the kernel compiled by nvrtc.
test=develop

* Disable for mac and windows.
test=develop

* Refine the codes to support manually specified num_threads and workload_per_thread.
test=develop

* Refine the CUDA kernel to support large dims.
test=develop

42b5bec6

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致