提交 · d20c6eb6de9bbce1de800156217a1f459bea8990 · s920243400 / PaddleDetection

16 3月, 2018 1 次提交
- X
  
  add math_function to selected_rows_functor dependency list · d20c6eb6
  由 Xi Chen 提交于 3月 15, 2018
  
  d20c6eb6
15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 3月, 2018 1 次提交
- K
  
  address comments · 3b44b849
  由 Kexin Zhao 提交于 3月 11, 2018
  
  3b44b849
10 3月, 2018 3 次提交
- K
  
  fix bug · 95de7617
  由 Kexin Zhao 提交于 3月 09, 2018
  
  95de7617
- K
  
  add gpu info func to get compute cap · 1998d5af
  由 Kexin Zhao 提交于 3月 09, 2018
  
  1998d5af
- K
  
  fix math function arch mismatch for older GPU · d400b419
  由 Kexin Zhao 提交于 3月 09, 2018
  
  d400b419
09 3月, 2018 1 次提交

Add float16 GEMM math function on GPU (#8695) · 90215b78

由 kexinzhao 提交于 3月 08, 2018

* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function

90215b78

07 3月, 2018 3 次提交
- L
  
  add back framework_proto depends · 49f3f1db
  由 Luo Tao 提交于 3月 07, 2018
  
  49f3f1db
- L
  
  rename concat_functor to concat, refine CMakeLists based on comments · 3ddc9971
  由 Luo Tao 提交于 3月 07, 2018
  
  3ddc9971
- K
  Integrate float16 into data_type_transform (#8619) · 266ccaa8
  由 kexinzhao 提交于 3月 06, 2018
```
* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* add context wait
```
  266ccaa8
05 3月, 2018 2 次提交
- C
  
  fix bug for big number; float->double and code refine · 131ec276
  由 chengduoZH 提交于 3月 05, 2018
  
  131ec276
- C
  
  follow comments and refine code · 82bd82c1
  由 chengduoZH 提交于 3月 05, 2018
  
  82bd82c1
03 3月, 2018 1 次提交
- C
  
  get max threads of GPU · 00e596ed
  由 chengduoZH 提交于 3月 02, 2018
  
  00e596ed
02 3月, 2018 2 次提交
- L
  
  refine operator/math/CMakeLists.txt, seperate im2col from math_function · f67275a9
  由 Luo Tao 提交于 3月 01, 2018
  
  f67275a9
- C
  
  refine concat_op · 60e7ee06
  由 chengduoZH 提交于 2月 28, 2018
  
  60e7ee06
15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致