提交 · 0c0ff2828ccedb51db23290d6df9e4c83839d6af · PaddlePaddle / PaddleDetection

24 11月, 2017 1 次提交

由 Qiao Longfei 提交于 11月 24, 2017

* make enforce a target and dependent on nccl when gpu is enabled

* add some more dependency

c9172c1c

11 11月, 2017 2 次提交

D

Use G++ to compile some cu operators. · f5e36765
由 dangqingqing 提交于 11月 11, 2017

f5e36765

Fix a dead lock bug for dyload/nccl.h when nccl lib cannot be loaded (#5533) · 2378679a

由 emailweixu 提交于 11月 10, 2017

It caused by a bug of std::call_once described in https://stackoverflow.com/questions/41717579/stdcall-once-hangs-on-second-call-after-callable-threw-on-first-call. It is likely caused by a deeper bug of pthread_once, which is discussed in https://patchwork.ozlabs.org/patch/482350/

2378679a

26 10月, 2017 1 次提交

Cudnn batch norm op (#5067) · 56b723c4

由 Qiao Longfei 提交于 10月 25, 2017

* init cudnn batch norm op

* rename batch_norm_cudnn_op.cc batch_norm_op.cu

* correct name style

* add ExtractNCWHD, simplify code

* fix ExtractNCWHD

* use CUDNN_ENFORCE instead of PADDLE_ENFORCE

56b723c4

24 10月, 2017 2 次提交
- Y
  
  Use external project for NCCL (#5028) · 94e741d6
  由 Yu Yang 提交于 10月 23, 2017
  
  94e741d6
- Y
  Feature/nccl dso (#5001) · 43c6ff21
  由 Yu Yang 提交于 10月 23, 2017
```
* "add nccl enforce"

* Dev

* Update comment

* Add nccl test

* Follow comments
```
  43c6ff21
18 10月, 2017 1 次提交

MatMul operator (#4856) · 16489827

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

16 10月, 2017 1 次提交
- D
  
  "fix enforce error" · d8aebaf5
  由 Dong Zhihong 提交于 10月 15, 2017
  
  d8aebaf5
15 10月, 2017 1 次提交
- D
  
  "add enforce check" · 54d3dbd8
  由 Dong Zhihong 提交于 10月 14, 2017
  
  54d3dbd8
31 8月, 2017 1 次提交
- D
  
  Add unit testing for cuDNN wrapper. · 20713222
  由 dangqingqing 提交于 8月 31, 2017
  
  20713222
10 8月, 2017 4 次提交
- Y
  
  Add curandGenerateNormal to curand.h · d2995288
  由 Yu Yang 提交于 8月 10, 2017
  
  d2995288
- Q
  
  format code · 688c43b1
  由 qijun 提交于 8月 10, 2017
  
  688c43b1
- Y
  Fix gaussian_random_op compile error · 45911102
  由 Yu Yang 提交于 8月 10, 2017
```
* Should always use `dynload::` for cuda function.
* Fix cublas.h without DSO load.
```
  45911102
- Q
  
  fix bug in dynload · 5f1081d8
  由 qijun 提交于 8月 10, 2017
  
  5f1081d8
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
15 7月, 2017 1 次提交
- L
  
  ENH: unify PADDLE_ENFORCE · f812de2c
  由 liaogang 提交于 7月 15, 2017
  
  f812de2c
13 7月, 2017 1 次提交
- Q
  
  fix bug in dynload · 4e918377
  由 qijun 提交于 7月 13, 2017
  
  4e918377
12 7月, 2017 1 次提交
- Q
  
  split device_context · 14d2c399
  由 qijun 提交于 7月 12, 2017
  
  14d2c399
11 7月, 2017 2 次提交
- Q
  
  fix cublas dynload bug · 69d76812
  由 qijun 提交于 7月 11, 2017
  
  69d76812
- Y
  
  Refine CUDA Related libraries · a0466053
  由 Yu Yang 提交于 7月 11, 2017
  
  a0466053
04 7月, 2017 3 次提交
- Q
  
  fix wrong including header-file in files in paddle/platform/dynload dir · e6fcdd47
  由 qijun 提交于 7月 04, 2017
  
  e6fcdd47
- L
  
  Delete cmake in dynload · 379434b2
  由 liaogang 提交于 7月 04, 2017
  
  379434b2
- Q
  
  move to dynload directory · 3567ea6d
  由 qijun 提交于 7月 04, 2017
  
  3567ea6d

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功