提交 · 00b9e9a1357bb3fa6e6adceb4e650d9f6424aa2a · BaiXuePrincess / Paddle

22 11月, 2018 2 次提交

Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1

由 chengduo 提交于 11月 22, 2018

* refine cublase
test=develop

* code refine

* refine cublas

* add GEMME_EX

* add enable_cublas_tensor_op_math doc and add cublasCall
test=develop

* fix CublasCall for cuda version
test=develop

* fix error
test=develop

* fix GEMM_EX to be compatible with gcc 4.8
test=develop

* add GEMM_EX
test=develop

* to compatiable with gcc4.8
test=develop

00b9e9a1

Windows/online (#14474) · d9a1f3e5

由 wopeizl 提交于 11月 22, 2018

* add recordio support

* disable the openblas multi-thread on windows since no support
adjust the python script

* code style

* code style
test=develop

* add create_recordio_file_reader back

* fix code style
test=develop

* fix the gtest.cmake on windows

* fix cc_test on windows

* fix the win build
test=develop

* remove fused compile support on windows
test=develop

* add the jit support
test=develop

* add the jit support, test=develop

* add the jit support, test=develop

* add the jit back
fix compile error on windows

* rollback test=develop

* test case fix

* disable DSO by default on windows

* exclude warpctc_op on windows

* exclude the dynload_warpctc out on windows
test=develop

* fix the scripts error
test=develop

* disable avx on windows by default
test=develop

* re-organize the cmake file

* disable mkl on windows by default

* add warp_ctc back

* fix the dependency

* fix the dependency

* fix the build issue on windows

* remove unsupported flag on windows

* code style

* code style
test=develop

* fix issue

* add profiler, parallel_executor back

* clean up the pre-definitions on windows

* fix build issue

* test=develop

d9a1f3e5

19 11月, 2018 1 次提交
- Q
  Convolution fusion operator. (#14449) · fd7e6431
  由 qingqing01 提交于 11月 19, 2018
```
* Convolution fusion operator.
* Clean code
test=develop
```
  fd7e6431
16 11月, 2018 1 次提交

Add cudnn ctc loss (#12366) · b32c13dc

由 Wu Yi 提交于 11月 16, 2018

* add cudnn ctc loss

* wip add test test=develop

* wip

* wip

* done test=develop

* move include cudnn test=develop

* test test=develop

* fix build test=develop

* fix build test=develop

* fix build on cudnn5 test=develop

* fix cudnn5 build test=develop

* fix cudnn5 build test=develop

* merge develop softmax functor change test=develop

b32c13dc

13 11月, 2018 1 次提交
- T
  
  add mkl vsqr and vpow · 1be85d01
  由 tensor-tang 提交于 11月 13, 2018
  
  1be85d01
09 11月, 2018 1 次提交

Exhaustive search for cuDNN conv. (#14286) · abe20923

由 qingqing01 提交于 11月 09, 2018

* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
* Fix compiling test=develop

abe20923

08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
07 11月, 2018 2 次提交
- Q
  Revert " Exhaustive search for cuDNN conv. (#14043)" · db8c52da
  由 qingqing01 提交于 11月 07, 2018
```
This reverts commit ce7d9b07.
```
  db8c52da
- Q
  Exhaustive search for cuDNN conv. (#14043) · ce7d9b07
  由 qingqing01 提交于 11月 07, 2018
```
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
```
  ce7d9b07
02 11月, 2018 1 次提交

Add affine grid generator op (#12238) · 0c319e0b

由 whs 提交于 11月 02, 2018

* Add affine grid generator.

* fix ffine grid.

* Add unitest.

* Add CPU kernel and fix unitest.

* Fix CPU kernel.

* Refine code.
test=develop

* Fix python api.
test=develop

* Update python api.
test=develop

* Fix comment.
test=develop

* Rename affine_grid_generator to affine_grid and enhence unitest.
test=develop

* Fix unitest.
test=develop

0c319e0b

29 10月, 2018 2 次提交
- D
  
  fix some inappropriate expressions in api doc for grid_sampler. test=develop · ff6329bd
  由 dengkaipeng 提交于 10月 29, 2018
  
  ff6329bd
- D
  
  add Grid Sampler Operator for STN. · 0bb0e0c1
  由 dengkaipeng 提交于 10月 19, 2018
  
  0bb0e0c1
28 9月, 2018 1 次提交
- D
  namespace issue (#13543) · 2d00e658
  由 dzhwinter 提交于 9月 28, 2018
```
* flags

* "follow comment"
```
  2d00e658
15 9月, 2018 1 次提交
- D
  
  debug version · 85f8dd1c
  由 dzhwinter 提交于 9月 15, 2018
  
  85f8dd1c
05 9月, 2018 1 次提交
- J
  
  add error info for nccl not found · e322fc4e
  由 JiabinYang 提交于 9月 05, 2018
  
  e322fc4e
29 8月, 2018 1 次提交
- D
  
  done · b78394ea
  由 dzhwinter 提交于 8月 29, 2018
  
  b78394ea
27 8月, 2018 4 次提交
- D
  
  operator module is done · cd8f3e9e
  由 dzhwinter 提交于 8月 27, 2018
  
  cd8f3e9e
- D
  
  add unstack_op · 0153c21d
  由 dzhwinter 提交于 8月 27, 2018
  
  0153c21d
- D
  
  fix concat synchronization bug · 6cc78705
  由 dzhwinter 提交于 8月 27, 2018
  
  6cc78705
- D
  platform module (#12932) · d361624c
  由 dzhwinter 提交于 8月 27, 2018
```
* platform module

* Update profiler.h
```
  d361624c
26 8月, 2018 1 次提交
- D
  
  check some operators · 7dceb8a0
  由 dzhwinter 提交于 8月 26, 2018
  
  7dceb8a0
24 8月, 2018 1 次提交
- D
  
  windows port · 34f8c9b6
  由 dzhwinter 提交于 8月 24, 2018
  
  34f8c9b6
22 8月, 2018 3 次提交
- T
  
  add blas vexp · 3dd66390
  由 tensor-tang 提交于 8月 22, 2018
  
  3dd66390
- T
  
  fix blas dot and add cblas scal · 0ec1f65c
  由 tensor-tang 提交于 8月 22, 2018
  
  0ec1f65c
- T
  
  add cblas dot · a2203d04
  由 tensor-tang 提交于 8月 22, 2018
  
  a2203d04
21 8月, 2018 1 次提交
- D
  
  status (#12764) · e23ddf6a
  由 dzhwinter 提交于 8月 21, 2018
  
  e23ddf6a
20 8月, 2018 1 次提交
- D
  cudnn windows support (#12757) · 00463fdf
  由 dzhwinter 提交于 8月 20, 2018
```
* cudnn widndows

* "add comment"

* "windows support"

* "fix cmake error"
```
  00463fdf
17 8月, 2018 3 次提交
- D
  
  "windows support" · 64ce1210
  由 dzhwinter 提交于 8月 17, 2018
  
  64ce1210
- D
  
  "windows support" · 59160e8d
  由 dzhwinter 提交于 8月 17, 2018
  
  59160e8d
- D
  
  dlfnh · 335398f1
  由 dzhwinter 提交于 8月 17, 2018
  
  335398f1
16 8月, 2018 1 次提交
- T
  
  add mklml vmul · 6644ce79
  由 tensor-tang 提交于 8月 16, 2018
  
  6644ce79
03 8月, 2018 1 次提交
- T
  
  add mkl packed gemm · 43cee33a
  由 tensor-tang 提交于 8月 02, 2018
  
  43cee33a
05 7月, 2018 1 次提交
- D
  
  "remove lapack" (#11966) · 99a99ec7
  由 dzhwinter 提交于 7月 05, 2018
  
  99a99ec7
23 6月, 2018 1 次提交
- Y
  No NCCL on macOS (#11652) · 2625178a
  由 Yi Wang 提交于 6月 22, 2018
```
* Make paddle no longer depend on boost

* Update enforce.h
```
  2625178a
21 6月, 2018 1 次提交
- T
  
  remove usr local lib when dynamic load lib · 28a0ef95
  由 tensor-tang 提交于 6月 21, 2018
  
  28a0ef95
20 6月, 2018 2 次提交
- T
  
  add usr local lib to dynamic search path · 3e73a7a9
  由 tensor-tang 提交于 6月 20, 2018
  
  3e73a7a9
- T
  
  enable dynamic load mklml lib on fluid · f503f129
  由 tensor-tang 提交于 6月 20, 2018
  
  f503f129
14 6月, 2018 1 次提交

Remove cuptiFinalize. · d2afd210

由 Xin Pan 提交于 6月 14, 2018

In cupti samples, only cuptiFlush is used.
I can't find any places calling cuptiFinalize and
this API can error out as not_implemented in some
cuda installation.

d2afd210

01 6月, 2018 2 次提交
- Y
  
  Static DSO handle · 53dab95b
  由 yuyang18 提交于 6月 01, 2018
  
  53dab95b
- Y
  
  Use static for dlsym · c5115950
  由 yuyang18 提交于 6月 01, 2018
  
  c5115950

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致