提交 · 750aff10cebd03c3a52bec28508cc5a6195ef937 · PaddlePaddle / PaddleDetection

23 3月, 2018 2 次提交
- C
  
  code refine · 750aff10
  由 chengduoZH 提交于 3月 23, 2018
  
  750aff10
- C
  
  fix concat op · 043f47b2
  由 chengduoZH 提交于 3月 23, 2018
  
  043f47b2
21 3月, 2018 1 次提交
- K
  
  initial commit · 70e71227
  由 Kexin Zhao 提交于 3月 20, 2018
  
  70e71227
20 3月, 2018 2 次提交

CMake refine for HIP support. · e50205e7

由 sabreshao 提交于 3月 20, 2018

1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.

e50205e7

X

add math_function to softmax's dep list · 9eae086e
由 Xi Chen 提交于 3月 19, 2018

9eae086e

17 3月, 2018 2 次提交

K

initial commit · 39c676e2
由 Kexin Zhao 提交于 3月 16, 2018

39c676e2

Fix compilation for gcc5.4 · ab3543e3

由 xuwei06 提交于 3月 16, 2018

The error is:

paddle/fluid/operators/math/concat.cc:47:72: error: invalid initialization of non-const reference of type 'paddle::platform::CPUPlace&' from an rvalue of type 'paddle::platform::CPUPlace'
auto& cpu_place = boost::get<platform::CPUPlace>(context.GetPlace());

Should not use reference for cpu_place.

ab3543e3

16 3月, 2018 4 次提交
- Y
  
  Finish adaption for backward. · bf3f56e8
  由 yangyaming 提交于 3月 15, 2018
  
  bf3f56e8
- S
  Demostration of cmake refine for HIP support. · 45c988d8
  由 sabreshao 提交于 3月 16, 2018
```
1. Add option WITH_AMD_GPU.
2. Add cmake/hip.cmake for HIP toolchain.
3. Some external module such as eigen may need HIP port.
4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
```
  45c988d8
- Q
  
  Delete the detection_output_op, which had been split into several operators. (#9121) · 7c1a0b77
  由 qingqing01 提交于 3月 16, 2018
  
  7c1a0b77
- X
  
  add math_function to selected_rows_functor dependency list · d20c6eb6
  由 Xi Chen 提交于 3月 15, 2018
  
  d20c6eb6
15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 3月, 2018 1 次提交
- K
  
  address comments · 3b44b849
  由 Kexin Zhao 提交于 3月 11, 2018
  
  3b44b849
10 3月, 2018 3 次提交
- K
  
  fix bug · 95de7617
  由 Kexin Zhao 提交于 3月 09, 2018
  
  95de7617
- K
  
  add gpu info func to get compute cap · 1998d5af
  由 Kexin Zhao 提交于 3月 09, 2018
  
  1998d5af
- K
  
  fix math function arch mismatch for older GPU · d400b419
  由 Kexin Zhao 提交于 3月 09, 2018
  
  d400b419
09 3月, 2018 1 次提交

Add float16 GEMM math function on GPU (#8695) · 90215b78

由 kexinzhao 提交于 3月 08, 2018

* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function

90215b78

07 3月, 2018 3 次提交
- L
  
  add back framework_proto depends · 49f3f1db
  由 Luo Tao 提交于 3月 07, 2018
  
  49f3f1db
- L
  
  rename concat_functor to concat, refine CMakeLists based on comments · 3ddc9971
  由 Luo Tao 提交于 3月 07, 2018
  
  3ddc9971
- K
  Integrate float16 into data_type_transform (#8619) · 266ccaa8
  由 kexinzhao 提交于 3月 06, 2018
```
* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* add context wait
```
  266ccaa8
05 3月, 2018 2 次提交
- C
  
  fix bug for big number; float->double and code refine · 131ec276
  由 chengduoZH 提交于 3月 05, 2018
  
  131ec276
- C
  
  follow comments and refine code · 82bd82c1
  由 chengduoZH 提交于 3月 05, 2018
  
  82bd82c1
03 3月, 2018 1 次提交
- C
  
  get max threads of GPU · 00e596ed
  由 chengduoZH 提交于 3月 02, 2018
  
  00e596ed
02 3月, 2018 2 次提交
- L
  
  refine operator/math/CMakeLists.txt, seperate im2col from math_function · f67275a9
  由 Luo Tao 提交于 3月 01, 2018
  
  f67275a9
- C
  
  refine concat_op · 60e7ee06
  由 chengduoZH 提交于 2月 28, 2018
  
  60e7ee06
15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功