提交 · 1399e5a39a51af9a4b9b8552bf6dc694ee1fbdb4 · s920243400 / PaddleDetection

18 12月, 2017 1 次提交
- Q
  add more place test and rename Cudnn to CUDNN (#6621) · 93a2d9c5
  由 QI JUN 提交于 12月 18, 2017
```
* add more place_test and rename Cudnn to CUDNN

* fix ci
```
  93a2d9c5
14 12月, 2017 1 次提交

"derived cudnnDevice context" (#6585) · 0e9b393b

由 dzhwinter 提交于 12月 14, 2017

* "derived cudnnDevice context"

* "leave remove cudnn handle from CUDADeviceContext"

* "fix math function error"

0e9b393b

12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

27 11月, 2017 1 次提交
- Y
  
  implement forward · 1abd3b3a
  由 Yancey1989 提交于 11月 27, 2017
  
  1abd3b3a
23 11月, 2017 1 次提交
- D
  
  Fix lstm_op and gru_op in debug mode. · 7fb1f7a2
  由 dangqingqing 提交于 11月 23, 2017
  
  7fb1f7a2
16 11月, 2017 1 次提交

feature/while_grad_op (#5554) · 18f0c40a

由 Yang Yang(Tony) 提交于 11月 16, 2017

* first commit

* Python API for while op

* Python Unittest for simple while_op forward

* fix out to be list

* Fix UT

* VarType

* Fix several bugs

* Fix bug

* Fix bug

* Fix Bug

* Fix bug

* Fix unittest

* Remove debug log

* Add comments

* add PADDLE_ENFORCE

* while_grad_op first commit

* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs

* not sure how to setdim of while outputs

* push for test

* add executor vlog

* fix bug of while_op cond

* Several enhancement for code

1. Backward always infer shape & infer var type. Since there are RENAME
variables will be created when creating backward operator, but their
shape & var types are not inferenced.
2. Never use SomePtr-> directly, since every pointer could be nullptr if
it is a function return value. Add `detail::Ref` to cast pointer to
reference safely.
3. Enhance error message for backward.
4. Infer data type of variable in `sum` and `tensor_write`

* Fix bugs of while_op gradient

* Fix several bugs of while_op grad

* fix fill zeros like

* fix 3 >= 3

* fix place holder shouldn't be null

* fail on sum op

* Fix SumOp of TensorList

* clean up

* pass while test

* fix test_array_write_read

* pass sum op

* Support int/int64 for fill_constant_batch_size_like

* Fix compile

18f0c40a

14 11月, 2017 1 次提交
- D
  
  Move RowwiseAdd functor to math_funcion and Add ColwiseSum functor. · 26736576
  由 dangqingqing 提交于 11月 14, 2017
  
  26736576
13 11月, 2017 1 次提交
- D
  
  Resume unit testing. · e9082bb7
  由 dangqingqing 提交于 11月 13, 2017
  
  e9082bb7
11 11月, 2017 2 次提交

D

Use G++ to compile some cu operators. · f5e36765
由 dangqingqing 提交于 11月 11, 2017

f5e36765

Fixing duplicate struct name TensorSetConstant. (#5532) · 58b4c9af

由 emailweixu 提交于 11月 10, 2017

TensorSetConstant struct is used both in math_function.cc and math_function.cu. Somehow the release version can correctly handle it. But in debug version, set_constant_with_place() in math_function.cu uses the TensorSetConstant in math_function.cc and causes crash.

58b4c9af

08 11月, 2017 2 次提交
- Y
  
  Fix CI · 0708a155
  由 Yu Yang 提交于 11月 07, 2017
  
  0708a155
- Y
  
  Add `op::math::set_constant` without template · aadb0981
  由 Yu Yang 提交于 11月 07, 2017
  
  aadb0981
26 10月, 2017 1 次提交
- D
  
  Add gradient check unit testing and fix bug. · cd382866
  由 dangqingqing 提交于 10月 26, 2017
  
  cd382866
18 10月, 2017 1 次提交

MatMul operator (#4856) · 16489827

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

16 10月, 2017 1 次提交
- Q
  
  remove SelectedRows functors to selected_rows_functor.h · ab5dc9fe
  由 qijun 提交于 10月 15, 2017
  
  ab5dc9fe
15 10月, 2017 4 次提交
- Q
  
  fix code style · 89758adb
  由 qijun 提交于 10月 14, 2017
  
  89758adb
- Q
  
  fix code style · df2d1769
  由 qijun 提交于 10月 14, 2017
  
  df2d1769
- Q
  
  fix gpu unittest error · 7ef568e8
  由 qijun 提交于 10月 14, 2017
  
  7ef568e8
- Q
  
  add gpu functor for SelectedRows · f59a7c1d
  由 qijun 提交于 10月 14, 2017
  
  f59a7c1d
21 9月, 2017 1 次提交
- G
  
  Add gemm with stride · 9ffa79cd
  由 guosheng 提交于 9月 20, 2017
  
  9ffa79cd
19 9月, 2017 1 次提交

Remove lazy-initialization in device_context · 81d56ca8

由 Yu Yang 提交于 9月 18, 2017

* Also use `const DeviceContext&` all the time, to prevent `const_cast`

Fix #4169
Fix #3468
Fix #3475

81d56ca8

22 8月, 2017 2 次提交
- Q
  
  fix gpu build error · 1918ad87
  由 qijun 提交于 8月 22, 2017
  
  1918ad87
- Q
  
  expose random seed to users · 36e8e725
  由 qijun 提交于 8月 22, 2017
  
  36e8e725
21 8月, 2017 4 次提交
- Q
  
  use dynload curand · 08c987d7
  由 qijun 提交于 8月 21, 2017
  
  08c987d7
- Q
  
  fix gpu build error · 2f47f35b
  由 qijun 提交于 8月 21, 2017
  
  2f47f35b
- Q
  
  use curand · 7c274dc0
  由 qijun 提交于 8月 21, 2017
  
  7c274dc0
- Q
  
  refine random related ops · d525abed
  由 qijun 提交于 8月 21, 2017
  
  d525abed
14 8月, 2017 2 次提交
- Q
  
  fix gpu build error · 960a5255
  由 qijun 提交于 8月 14, 2017
  
  960a5255
- Q
  
  follow comments · 2ec8dab4
  由 qijun 提交于 8月 14, 2017
  
  2ec8dab4
11 8月, 2017 2 次提交
- Q
  
  refine unittest · 37aa4b98
  由 qijun 提交于 8月 11, 2017
  
  37aa4b98
- Q
  
  add unittest · c2631ebf
  由 qijun 提交于 8月 11, 2017
  
  c2631ebf
10 8月, 2017 4 次提交
- Q
  
  fix gpu build error · 52b52ba8
  由 qijun 提交于 8月 10, 2017
  
  52b52ba8
- Q
  
  fix gpu build error · 8b7d48bc
  由 qijun 提交于 8月 10, 2017
  
  8b7d48bc
- Q
  
  set gemm support continuous memory now · de967fce
  由 qijun 提交于 8月 10, 2017
  
  de967fce
- Q
  
  disable gpu implementation temporarily · 8de4e3bd
  由 qijun 提交于 8月 10, 2017
  
  8de4e3bd
09 8月, 2017 1 次提交
- Q
  
  fix gpu build error · 7307b439
  由 qijun 提交于 8月 09, 2017
  
  7307b439
07 8月, 2017 2 次提交
- Q
  
  add .clang-format file · 5703eb50
  由 qijun 提交于 8月 07, 2017
  
  5703eb50
- Q
  
  add global matmul function for Tensor · 97d8175a
  由 qijun 提交于 8月 07, 2017
  
  97d8175a
03 8月, 2017 2 次提交
- Q
  
  fix gpu build error · f190a795
  由 qijun 提交于 8月 03, 2017
  
  f190a795
- Q
  
  add gemm for both cpu and gpu · 22dac40c
  由 qijun 提交于 8月 03, 2017
  
  22dac40c

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致