提交 · 5b71eefc760f4f99be84056257c314ad1bd8b857 · PaddlePaddle / Paddle

13 3月, 2018 1 次提交

由 QI JUN 提交于 3月 13, 2018

* fix nccl op unit test

* fix build error

* format code

* refine nccl related unit test

* fix build error

* add setGPUData

* clean up

* follow comments

* rm test_nccl.cu

* follow comment

* rm wait

7287630e

12 3月, 2018 6 次提交
- Y
  
  Extract Prepare from Executor · 43d09a1c
  由 Yu Yang 提交于 3月 12, 2018
  
  43d09a1c
- Q
  [Memory]More memory optimization policy (#8690) · f7e9fe57
  由 QI JUN 提交于 3月 12, 2018
```
* add memopt level

* add opt level for image classification demo

* clean code

* add delete op

* clean code

* test machine translation demo

* clean code

* clean code

* skip fill constant with force cpu

* clean code

* clean code

* refine code

* clean code

* fix bug
```
  f7e9fe57
- Y
  
  Fix dist compile error (#8987) · b5ef315c
  由 Yancey 提交于 3月 12, 2018
  
  b5ef315c
- Q
  Fix bug in detection_output and mAP calculation in SSD. (#8985) · b3d26cd3
  由 qingqing01 提交于 3月 12, 2018
```
* Clipping bbox in the mAP evaluator calculation.

* Fix bug in detection_output and mAP calculation in SSD.

* Fix bug in detection.py.

* Fix bug in test_detection_map_op.py.
```
  b3d26cd3
- K
  
  add comment · c88f58db
  由 Kexin Zhao 提交于 3月 11, 2018
  
  c88f58db
- K
  
  address comments · 3b44b849
  由 Kexin Zhao 提交于 3月 11, 2018
  
  3b44b849
10 3月, 2018 5 次提交
- P
  MKLDNN pool2d OP kernel added (#8879) · 4730a4be
  由 pzelazko-intel 提交于 3月 10, 2018
```
* MKLDNN pool2d OP kernel added

* conv2d and pool2d MKLDNN kernels renamed

* MKLDNN conv2d kernel refactoring
```
  4730a4be
- K
  
  fix bug · 95de7617
  由 Kexin Zhao 提交于 3月 09, 2018
  
  95de7617
- K
  
  add gpu info func to get compute cap · 1998d5af
  由 Kexin Zhao 提交于 3月 09, 2018
  
  1998d5af
- K
  
  fix math function arch mismatch for older GPU · d400b419
  由 Kexin Zhao 提交于 3月 09, 2018
  
  d400b419
- F
  
  fix a potential bug in the c++ reader · 614c33fb
  由 fengjiayi 提交于 3月 10, 2018
  
  614c33fb
09 3月, 2018 10 次提交
- Q
  Refine cast op (#8923) · b341bac7
  由 QI JUN 提交于 3月 09, 2018
```
* fix mac build error

* override GetExpectedKernelType for cast op

* fix typo

* add cuda unittest
```
  b341bac7
- Y
  Fix sparse update memory error for distributed training (#8837) · 84680379
  由 Yancey 提交于 3月 09, 2018
```
Fix sparse update memory error for distributed training
```
  84680379
- F
  
  uses channel to replace the traditional buffer · 35e1e0d5
  由 fengjiayi 提交于 3月 09, 2018
  
  35e1e0d5
- F
  
  fix a compile error · 6e5736e2
  由 fengjiayi 提交于 3月 09, 2018
  
  6e5736e2
- F
  
  remove HasNext · 4e517881
  由 fengjiayi 提交于 3月 09, 2018
  
  4e517881
- L
  
  Refine the profile codes for inference. · a8e85077
  由 Liu Yiqun 提交于 3月 09, 2018
  
  a8e85077
- 武
  
  update unpushed commits for zerocopy grpc (#8900) · 9dd34e41
  由武毅提交于 3月 09, 2018
  
  9dd34e41
- K
  Add float16 GEMM math function on GPU (#8695) · 90215b78
  由 kexinzhao 提交于 3月 08, 2018
```
* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function
```
  90215b78
- 武
  
  Performance/zero copy variable seriralization (#8839) · 45af8c1e
  由武毅提交于 3月 09, 2018
  
  45af8c1e
- X
  
  Print exception message from threads · 9a27d3af
  由 Xin Pan 提交于 3月 08, 2018
  
  9a27d3af
08 3月, 2018 11 次提交
- C
  
  Add ElementwiseOpInferVarType · 53d19f5b
  由 chengduoZH 提交于 3月 08, 2018
  
  53d19f5b
- Q
  
  Clipping bbox in the mAP evaluator calculation. (#8872) · ffda2c41
  由 qingqing01 提交于 3月 08, 2018
  
  ffda2c41
- Y
  Add test for nested RecordEvent. (#8773) · fecc9a38
  由 Yiqun Liu 提交于 3月 08, 2018
```
* Add test for nested RecordEvent.

* Remove the debug information.

* Add log information for the 3 usages and reduce the loop counts of nested case.
```
  fecc9a38
- X
  
  Use vlog instead. · 30e556d6
  由 Xin Pan 提交于 3月 07, 2018
  
  30e556d6
- Q
  
  fix mac build error (#8856) · 47ca1814
  由 QI JUN 提交于 3月 08, 2018
  
  47ca1814
- C
  
  Add log before op Run · f7c71356
  由 chengduoZH 提交于 3月 08, 2018
  
  f7c71356
- X
  
  Add warning · eb468453
  由 Xin Pan 提交于 3月 07, 2018
  
  eb468453
- Y
  Add profiling information for inference example (#8748) · a032f56f
  由 Yiqun Liu 提交于 3月 08, 2018
```
* Add profiling information for inference example, recognize digits.

* Refine the profiling method.

* Correct the use of RecordEvent and simplify recognize_digits.
```
  a032f56f
- Q
  
  Fix detection_map_op for multi-device. (#8845) · ded34b2c
  由 qingqing01 提交于 3月 08, 2018
  
  ded34b2c
- K
  
  Add context wait in type_transform (#8850) · 7f00716c
  由 kexinzhao 提交于 3月 07, 2018
  
  7f00716c
- T
  compile and install the static library of fluid inference (#7827) · 6f50dee4
  由 Tao Luo 提交于 3月 08, 2018
```
* compile and install the static library of fluid inference

* fix dynload_cuda not in CPU mode

* update shared library and adjust the deploy of openblas

* adjust the deploy of openblas

* * auto add all fluid modules for static library
* use libprotobuf.a instead of libprotobuf-lite.a for profiler

* use set_property to set the global varible instead of ENV

* add gpu depends of fluid modules, auto add inference_lib_dist depends

* change the condition of openblas_lib, and fix a typo
```
  6f50dee4
07 3月, 2018 7 次提交
- F
  
  fix errors · b1f647fd
  由 fengjiayi 提交于 3月 07, 2018
  
  b1f647fd
- F
  
  fix an error · e8d21b63
  由 fengjiayi 提交于 3月 07, 2018
  
  e8d21b63
- F
  
  Add basic double buffer reader · 4fb7b967
  由 fengjiayi 提交于 3月 07, 2018
  
  4fb7b967
- L
  
  add back framework_proto depends · 49f3f1db
  由 Luo Tao 提交于 3月 07, 2018
  
  49f3f1db
- L
  
  rename concat_functor to concat, refine CMakeLists based on comments · 3ddc9971
  由 Luo Tao 提交于 3月 07, 2018
  
  3ddc9971
- P
  MKLDNN conv2d kernel added (#8451) · 8c71adaa
  由 pzelazko-intel 提交于 3月 07, 2018
```
* MKLDNN conv2 OP kernel added

* TODOs added

* mkldnn conv2d OP refactor

* CanCUDNNBeUsed and CanMKLDNNBeUsed moved
```
  8c71adaa
- Y
  
  add inplace to reshape (#8747) · 049383c6
  由 Yan Chunwei 提交于 3月 07, 2018
  
  049383c6

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功