提交 · 4ff237f93c85521fbd69ac618735de3acdd822e2 · s920243400 / PaddleDetection

06 4月, 2018 1 次提交
- C
  
  follow comments · 4ff237f9
  由 chengduoZH 提交于 4月 06, 2018
  
  4ff237f9
04 4月, 2018 1 次提交
- C
  
  add PyCUDAPinnedTensorSetFromArray · 8e4e155c
  由 chengduoZH 提交于 4月 04, 2018
  
  8e4e155c
15 3月, 2018 2 次提交

Q

Always synchronize when copy data on GPU from C++ to Numpy array. (#9110) · 45073b7c
由 qingqing01 提交于 3月 15, 2018

45073b7c

Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123

由 Kexin Zhao 提交于 3月 14, 2018

* add fp16 mul op support

* small fix

* fix bug

* small fix

* fix PADDLE_WITH_CUDA compiling issue

* reorg code

* test for pybind

* treate as float16 as uint16_t in pybind

* bind np.float16 to paddle float16

* small fix

* clean code

* remove redundancy

* fix mul_op test

* address comments

* small fix

* add is_float16_supported func

e26f1123

15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

28 12月, 2017 1 次提交
- Y
  
  Fix ALL RNN error · d2cb2841
  由 Yang Yu 提交于 12月 28, 2017
  
  d2cb2841
27 12月, 2017 3 次提交
- Y
  Rename API of DeviceContext (#7055) · 15e8c80e
  由 Yu Yang 提交于 12月 27, 2017
```
* Rename API of DeviceContext

Make them as usual names.

* Rename API of DeviceContext

Make them as usual names.

* Fix compile

* Fix compile

* Fix compile

* Fix compile

* Fix compile
```
  15e8c80e
- Y
  
  Fix compile · 16a84328
  由 Yang Yu 提交于 12月 27, 2017
  
  16a84328
- Y
  
  Fix compile · a5291f9c
  由 Yang Yu 提交于 12月 27, 2017
  
  a5291f9c
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

22 12月, 2017 1 次提交

"remove GPU Sync Interface" (#6793) · abde3130

由 dzhwinter 提交于 12月 22, 2017

* "remove GPU Sync Interface"

* "fix typo"

* "fix type cast error"

* "fix related Copy with stream"

* "fix failed tests with DevicePool"

* "fix stupid removed position error"

abde3130

08 11月, 2017 1 次提交
- Y
  Compare Operator (#5325) · f74fb790
  由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
  f74fb790
29 10月, 2017 1 次提交

support sparse output for lookup table grad op (#5145) · 008f40ce

由 QI JUN 提交于 10月 28, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

* support sparse output for lookup table grad op

* refine codes

* fix gpu build error

* fix lookup table grad gpu kernel

* fix ci

* fix ci

* fix ci

* fix bug in lookup_table_grad op

* fix bug in test_word2vec

* register double kernel for some operators

* set is_sparse=True in test_word2vec

* fix lookup table grad op CUDA kernel bug

* disable test_modified_huber_loss_op temporarily

* disable test_lstm_unit_op temporarily

008f40ce

12 10月, 2017 1 次提交

Unify CUDA stream in Tensor CopyFrom interface (#4692) · 2603cb7e

由 QI JUN 提交于 10月 11, 2017

* init

* unify CopyFrom interface

* fix gpu build error

* fix bug in tensor_py.h

* refine code comments and add TODO list

* fix conflicts in FeedOp and FetchOp

2603cb7e

05 10月, 2017 2 次提交

Y

Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
由 Yi Wang 提交于 10月 04, 2017

4558807c

Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94

由 Yu Yang 提交于 10月 04, 2017

By shell command

```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```

84500f94

29 9月, 2017 2 次提交
- Y
  
  Stablize elementwise_mul by using double precision · 61cc3ae4
  由 Yu Yang 提交于 9月 28, 2017
  
  61cc3ae4
- Y
  
  Stablize elementwise_mul by using double precision · fd479631
  由 Yu Yang 提交于 9月 28, 2017
  
  fd479631
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
27 9月, 2017 1 次提交
- Y
  
  Unify clang-format and add some missing clang-format · 60857f49
  由 Yu Yang 提交于 9月 26, 2017
  
  60857f49
23 9月, 2017 1 次提交
- Y
  
  Change namespace of pybind.cc to pybind · f0cd5142
  由 Yu Yang 提交于 9月 22, 2017
  
  f0cd5142
13 9月, 2017 2 次提交
- D
  
  Using LoDTensor instead of Tensor in every operator. · f2992063
  由 dangqingqing 提交于 9月 13, 2017
  
  f2992063
- D
  
  Use the inheritance in the definition of LoDTensor. · d11430e0
  由 dangqingqing 提交于 9月 13, 2017
  
  d11430e0
06 9月, 2017 1 次提交
- Q
  
  make dim int to int64_t · 11163dfc
  由 qijun 提交于 9月 06, 2017
  
  11163dfc
23 8月, 2017 1 次提交
- D
  
  Move pybind from package paddle/framework into paddle/pybind. · bfcaf880
  由 dangqingqing 提交于 8月 23, 2017
  
  bfcaf880
03 8月, 2017 1 次提交
- Y
  
  Change `tensor_bind.h` -> `tensor_py.h` · fe5bca49
  由 Yu Yang 提交于 8月 03, 2017
  
  fe5bca49
02 8月, 2017 1 次提交
- Y
  Move pybind.cc/tensor_bind.h to paddle::framework · 3fc68f6f
  由 Yu Yang 提交于 8月 02, 2017
```
Fix #3171
```
  3fc68f6f
25 7月, 2017 6 次提交
- Q
  
  fix bug in register gpu OpKernel · 4ecf68e0
  由 qijun 提交于 7月 25, 2017
  
  4ecf68e0
- Q
  
  fix gpu build error · 358261f0
  由 qijun 提交于 7月 25, 2017
  
  358261f0
- Q
  
  fix gpu build error · a71a9e63
  由 qijun 提交于 7月 25, 2017
  
  a71a9e63
- Q
  
  fix build error · aa5ca8a9
  由 qijun 提交于 7月 25, 2017
  
  aa5ca8a9
- Q
  
  set default cpu place for tensor alloc · d5109130
  由 qijun 提交于 7月 25, 2017
  
  d5109130
- Q
  
  enable operator gpu unittest · e2ba1337
  由 qijun 提交于 7月 25, 2017
  
  e2ba1337
19 7月, 2017 1 次提交

Simplify Tensor implimentation · 55d30172

由 fengjiayi 提交于 7月 19, 2017

ATTENTION: some interfaces changed:
1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims).
2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src)
3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const

55d30172

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致