提交 · 735eba29760d8b6f58e0374401a78b64a76c3158 · BaiXuePrincess / Paddle

24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

22 12月, 2017 1 次提交

"remove GPU Sync Interface" (#6793) · abde3130

由 dzhwinter 提交于 12月 22, 2017

* "remove GPU Sync Interface"

* "fix typo"

* "fix type cast error"

* "fix related Copy with stream"

* "fix failed tests with DevicePool"

* "fix stupid removed position error"

abde3130

08 11月, 2017 1 次提交
- Y
  Compare Operator (#5325) · f74fb790
  由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
  f74fb790
29 10月, 2017 1 次提交

support sparse output for lookup table grad op (#5145) · 008f40ce

由 QI JUN 提交于 10月 28, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

* support sparse output for lookup table grad op

* refine codes

* fix gpu build error

* fix lookup table grad gpu kernel

* fix ci

* fix ci

* fix ci

* fix bug in lookup_table_grad op

* fix bug in test_word2vec

* register double kernel for some operators

* set is_sparse=True in test_word2vec

* fix lookup table grad op CUDA kernel bug

* disable test_modified_huber_loss_op temporarily

* disable test_lstm_unit_op temporarily

008f40ce

12 10月, 2017 1 次提交

Unify CUDA stream in Tensor CopyFrom interface (#4692) · 2603cb7e

由 QI JUN 提交于 10月 11, 2017

* init

* unify CopyFrom interface

* fix gpu build error

* fix bug in tensor_py.h

* refine code comments and add TODO list

* fix conflicts in FeedOp and FetchOp

2603cb7e

05 10月, 2017 2 次提交

Y

Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
由 Yi Wang 提交于 10月 04, 2017

4558807c

Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94

由 Yu Yang 提交于 10月 04, 2017

By shell command

```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```

84500f94

29 9月, 2017 2 次提交
- Y
  
  Stablize elementwise_mul by using double precision · 61cc3ae4
  由 Yu Yang 提交于 9月 28, 2017
  
  61cc3ae4
- Y
  
  Stablize elementwise_mul by using double precision · fd479631
  由 Yu Yang 提交于 9月 28, 2017
  
  fd479631
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
27 9月, 2017 1 次提交
- Y
  
  Unify clang-format and add some missing clang-format · 60857f49
  由 Yu Yang 提交于 9月 26, 2017
  
  60857f49
23 9月, 2017 1 次提交
- Y
  
  Change namespace of pybind.cc to pybind · f0cd5142
  由 Yu Yang 提交于 9月 22, 2017
  
  f0cd5142
13 9月, 2017 2 次提交
- D
  
  Using LoDTensor instead of Tensor in every operator. · f2992063
  由 dangqingqing 提交于 9月 13, 2017
  
  f2992063
- D
  
  Use the inheritance in the definition of LoDTensor. · d11430e0
  由 dangqingqing 提交于 9月 13, 2017
  
  d11430e0
06 9月, 2017 1 次提交
- Q
  
  make dim int to int64_t · 11163dfc
  由 qijun 提交于 9月 06, 2017
  
  11163dfc
23 8月, 2017 1 次提交
- D
  
  Move pybind from package paddle/framework into paddle/pybind. · bfcaf880
  由 dangqingqing 提交于 8月 23, 2017
  
  bfcaf880
03 8月, 2017 1 次提交
- Y
  
  Change `tensor_bind.h` -> `tensor_py.h` · fe5bca49
  由 Yu Yang 提交于 8月 03, 2017
  
  fe5bca49
02 8月, 2017 1 次提交
- Y
  Move pybind.cc/tensor_bind.h to paddle::framework · 3fc68f6f
  由 Yu Yang 提交于 8月 02, 2017
```
Fix #3171
```
  3fc68f6f
25 7月, 2017 6 次提交
- Q
  
  fix bug in register gpu OpKernel · 4ecf68e0
  由 qijun 提交于 7月 25, 2017
  
  4ecf68e0
- Q
  
  fix gpu build error · 358261f0
  由 qijun 提交于 7月 25, 2017
  
  358261f0
- Q
  
  fix gpu build error · a71a9e63
  由 qijun 提交于 7月 25, 2017
  
  a71a9e63
- Q
  
  fix build error · aa5ca8a9
  由 qijun 提交于 7月 25, 2017
  
  aa5ca8a9
- Q
  
  set default cpu place for tensor alloc · d5109130
  由 qijun 提交于 7月 25, 2017
  
  d5109130
- Q
  
  enable operator gpu unittest · e2ba1337
  由 qijun 提交于 7月 25, 2017
  
  e2ba1337
19 7月, 2017 1 次提交

Simplify Tensor implimentation · 55d30172

由 fengjiayi 提交于 7月 19, 2017

ATTENTION: some interfaces changed:
1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims).
2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src)
3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const

55d30172

18 7月, 2017 2 次提交

Y

Use friend not to expose tensor's `type/place` · 1dc53a28
由 Yu Yang 提交于 7月 18, 2017

1dc53a28

Make Tensor <--> Numpy interactive in tensor.h · a89c7ffa

由 Yu Yang 提交于 7月 18, 2017

* Follow review comments to seperate Tensor Numpy interactive methods in
  tensor.h.
* Simplify logic for `CastToPyBufferImpl`, make it as one struct and in
  details namespace.
* Remove `Scope` expose in Python, since it currently is useless.
* Remove some debug functions.

a89c7ffa

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致