提交 · 01fbcb0bbb6509b50a4f0082e29e0c075c85b5ce · 兽拳 / Paddle

25 6月, 2018 1 次提交
- S
  
  Add Python array reader op · 697ba4b1
  由 sneaxiy 提交于 6月 25, 2018
  
  697ba4b1
20 6月, 2018 1 次提交
- F
  
  Add unit tests · 47c02b5c
  由 fengjiayi 提交于 6月 20, 2018
  
  47c02b5c
02 5月, 2018 1 次提交
- F
  
  fix · d11b8e56
  由 fengjiayi 提交于 4月 28, 2018
  
  d11b8e56
25 4月, 2018 1 次提交
- Y
  
  Clean memcpy async · 0c24b3f9
  由 Yu Yang 提交于 4月 25, 2018
  
  0c24b3f9
17 4月, 2018 4 次提交
- Y
  
  Add comments and clean code · 2ab12ca2
  由 Yu Yang 提交于 4月 17, 2018
  
  2ab12ca2
- Y
  
  Add wait · a822f8dd
  由 Yu Yang 提交于 4月 16, 2018
  
  a822f8dd
- Y
  
  Revert · e9e27e0f
  由 Yu Yang 提交于 4月 16, 2018
  
  e9e27e0f
- Y
  
  Sync Copy · 0ca28b85
  由 Yu Yang 提交于 4月 16, 2018
  
  0ca28b85
08 4月, 2018 1 次提交
- L
  
  fix compiler error on `tensor_py.h` · 50e036a4
  由 Luo Tao 提交于 4月 08, 2018
  
  50e036a4
07 4月, 2018 1 次提交
- Y
  Fix cpplint errors of paddle/fluid/pybind and add some tests (#9694) · 1543c4cf
  由 Yi Wang 提交于 4月 06, 2018
```
* cpplint test and add tesnor_py_test.cc

* Update

* Update
```
  1543c4cf
06 4月, 2018 1 次提交
- C
  
  follow comments · 4ff237f9
  由 chengduoZH 提交于 4月 06, 2018
  
  4ff237f9
04 4月, 2018 1 次提交
- C
  
  add PyCUDAPinnedTensorSetFromArray · 8e4e155c
  由 chengduoZH 提交于 4月 04, 2018
  
  8e4e155c
15 3月, 2018 2 次提交

Q

Always synchronize when copy data on GPU from C++ to Numpy array. (#9110) · 45073b7c
由 qingqing01 提交于 3月 15, 2018

45073b7c

Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123

由 Kexin Zhao 提交于 3月 14, 2018

* add fp16 mul op support

* small fix

* fix bug

* small fix

* fix PADDLE_WITH_CUDA compiling issue

* reorg code

* test for pybind

* treate as float16 as uint16_t in pybind

* bind np.float16 to paddle float16

* small fix

* clean code

* remove redundancy

* fix mul_op test

* address comments

* small fix

* add is_float16_supported func

e26f1123

15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

28 12月, 2017 1 次提交
- Y
  
  Fix ALL RNN error · d2cb2841
  由 Yang Yu 提交于 12月 28, 2017
  
  d2cb2841
27 12月, 2017 3 次提交
- Y
  Rename API of DeviceContext (#7055) · 15e8c80e
  由 Yu Yang 提交于 12月 27, 2017
```
* Rename API of DeviceContext

Make them as usual names.

* Rename API of DeviceContext

Make them as usual names.

* Fix compile

* Fix compile

* Fix compile

* Fix compile

* Fix compile
```
  15e8c80e
- Y
  
  Fix compile · 16a84328
  由 Yang Yu 提交于 12月 27, 2017
  
  16a84328
- Y
  
  Fix compile · a5291f9c
  由 Yang Yu 提交于 12月 27, 2017
  
  a5291f9c
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

22 12月, 2017 1 次提交

"remove GPU Sync Interface" (#6793) · abde3130

由 dzhwinter 提交于 12月 22, 2017

* "remove GPU Sync Interface"

* "fix typo"

* "fix type cast error"

* "fix related Copy with stream"

* "fix failed tests with DevicePool"

* "fix stupid removed position error"

abde3130

08 11月, 2017 1 次提交
- Y
  Compare Operator (#5325) · f74fb790
  由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
  f74fb790
29 10月, 2017 1 次提交

support sparse output for lookup table grad op (#5145) · 008f40ce

由 QI JUN 提交于 10月 28, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

* support sparse output for lookup table grad op

* refine codes

* fix gpu build error

* fix lookup table grad gpu kernel

* fix ci

* fix ci

* fix ci

* fix bug in lookup_table_grad op

* fix bug in test_word2vec

* register double kernel for some operators

* set is_sparse=True in test_word2vec

* fix lookup table grad op CUDA kernel bug

* disable test_modified_huber_loss_op temporarily

* disable test_lstm_unit_op temporarily

008f40ce

12 10月, 2017 1 次提交

Unify CUDA stream in Tensor CopyFrom interface (#4692) · 2603cb7e

由 QI JUN 提交于 10月 11, 2017

* init

* unify CopyFrom interface

* fix gpu build error

* fix bug in tensor_py.h

* refine code comments and add TODO list

* fix conflicts in FeedOp and FetchOp

2603cb7e

05 10月, 2017 2 次提交

Y

Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
由 Yi Wang 提交于 10月 04, 2017

4558807c

Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94

由 Yu Yang 提交于 10月 04, 2017

By shell command

```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```

84500f94

29 9月, 2017 2 次提交
- Y
  
  Stablize elementwise_mul by using double precision · 61cc3ae4
  由 Yu Yang 提交于 9月 28, 2017
  
  61cc3ae4
- Y
  
  Stablize elementwise_mul by using double precision · fd479631
  由 Yu Yang 提交于 9月 28, 2017
  
  fd479631
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
27 9月, 2017 1 次提交
- Y
  
  Unify clang-format and add some missing clang-format · 60857f49
  由 Yu Yang 提交于 9月 26, 2017
  
  60857f49
23 9月, 2017 1 次提交
- Y
  
  Change namespace of pybind.cc to pybind · f0cd5142
  由 Yu Yang 提交于 9月 22, 2017
  
  f0cd5142
13 9月, 2017 2 次提交
- D
  
  Using LoDTensor instead of Tensor in every operator. · f2992063
  由 dangqingqing 提交于 9月 13, 2017
  
  f2992063
- D
  
  Use the inheritance in the definition of LoDTensor. · d11430e0
  由 dangqingqing 提交于 9月 13, 2017
  
  d11430e0
06 9月, 2017 1 次提交
- Q
  
  make dim int to int64_t · 11163dfc
  由 qijun 提交于 9月 06, 2017
  
  11163dfc

兽拳 / Paddle 与 Fork 源项目一致

兽拳 / Paddle
与 Fork 源项目一致