提交 · 5b71eefc760f4f99be84056257c314ad1bd8b857 · PaddlePaddle / Paddle

12 3月, 2018 1 次提交
- K
  
  address comments · 3b44b849
  由 Kexin Zhao 提交于 3月 11, 2018
  
  3b44b849
10 3月, 2018 3 次提交
- K
  
  fix bug · 95de7617
  由 Kexin Zhao 提交于 3月 09, 2018
  
  95de7617
- K
  
  add gpu info func to get compute cap · 1998d5af
  由 Kexin Zhao 提交于 3月 09, 2018
  
  1998d5af
- K
  
  fix math function arch mismatch for older GPU · d400b419
  由 Kexin Zhao 提交于 3月 09, 2018
  
  d400b419
09 3月, 2018 1 次提交

Add float16 GEMM math function on GPU (#8695) · 90215b78

由 kexinzhao 提交于 3月 08, 2018

* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function

90215b78

15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
15 1月, 2018 1 次提交

Feature/hooks (#7513) · b9b75377

由 dzhwinter 提交于 1月 15, 2018

* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci

b9b75377

09 1月, 2018 1 次提交
- Y
  Rename CopyFrom to Copy for tensors (#7292) · ce6dad3b
  由 Yu Yang 提交于 1月 09, 2018
```
* Rename Tensor::CopyFrom to Tensor::Copy

* Fix CI

* Fix compile
```
  ce6dad3b
25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

26 10月, 2017 1 次提交
- D
  
  Add unit testing for gemv and fix the gradien check for bais. · ac3370a4
  由 dangqingqing 提交于 10月 26, 2017
  
  ac3370a4
20 10月, 2017 1 次提交

Remove template parameter for Tensor methods (#4937) · c532b967

由 Yu Yang 提交于 10月 19, 2017

* Remove template parameter for Tensor methods

* Also check the type is correct when data()
* Simplize holder_

* Fix accuracy_op

* Register Code

c532b967

16 10月, 2017 1 次提交
- Q
  
  remove SelectedRows functors to selected_rows_functor.h · ab5dc9fe
  由 qijun 提交于 10月 15, 2017
  
  ab5dc9fe
15 10月, 2017 2 次提交
- Q
  
  fix gpu unittest error · 7ef568e8
  由 qijun 提交于 10月 14, 2017
  
  7ef568e8
- Q
  
  add gpu functor for SelectedRows · f59a7c1d
  由 qijun 提交于 10月 14, 2017
  
  f59a7c1d
14 10月, 2017 3 次提交
- Q
  
  remove unused method · 4741266d
  由 qijun 提交于 10月 13, 2017
  
  4741266d
- Q
  
  SelectedRowsAddTensor method · 931572e2
  由 qijun 提交于 10月 13, 2017
  
  931572e2
- Q
  
  add selected_rows add cpu functor · 5be10872
  由 qijun 提交于 10月 13, 2017
  
  5be10872
12 10月, 2017 1 次提交

Unify CUDA stream in Tensor CopyFrom interface (#4692) · 2603cb7e

由 QI JUN 提交于 10月 11, 2017

* init

* unify CopyFrom interface

* fix gpu build error

* fix bug in tensor_py.h

* refine code comments and add TODO list

* fix conflicts in FeedOp and FetchOp

2603cb7e

05 10月, 2017 2 次提交

Y

Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
由 Yi Wang 提交于 10月 04, 2017

4558807c

Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94

由 Yu Yang 提交于 10月 04, 2017

By shell command

```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```

84500f94

29 9月, 2017 1 次提交
- Q
  
  add SetConstant method in math_function.h · c634a848
  由 qijun 提交于 9月 28, 2017
  
  c634a848
21 9月, 2017 1 次提交
- G
  
  Add gemm with stride · 9ffa79cd
  由 guosheng 提交于 9月 20, 2017
  
  9ffa79cd
19 9月, 2017 1 次提交

Remove lazy-initialization in device_context · 81d56ca8

由 Yu Yang 提交于 9月 18, 2017

* Also use `const DeviceContext&` all the time, to prevent `const_cast`

Fix #4169
Fix #3468
Fix #3475

81d56ca8

11 8月, 2017 1 次提交
- Q
  
  add unittest · c2631ebf
  由 qijun 提交于 8月 11, 2017
  
  c2631ebf
10 8月, 2017 3 次提交
- Q
  
  format code · 688c43b1
  由 qijun 提交于 8月 10, 2017
  
  688c43b1
- Q
  
  fix bug in dynload · 5f1081d8
  由 qijun 提交于 8月 10, 2017
  
  5f1081d8
- Q
  
  add math_function_test · c5a7471e
  由 qijun 提交于 8月 10, 2017
  
  c5a7471e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功