提交 · 7cd24b13182bcdcbdb455a430d54d70172e73a59 · PaddlePaddle / PaddleDetection

18 12月, 2018 1 次提交

add ir memory optimize. (#14530) · 7cd24b13

由 dzhwinter 提交于 12月 18, 2018

* follow comments. test=develop

* Fix typo

* fix compile error. test=develop

* merge develop branch. test=develop

* Remove set_equal

* Polish code

* Delete unused functions

test=develop

* polish code. test=develop

* follow comment

* polish code.

* fix windows compile error. test=develop

* fix op handle.

* rerun ci. test=develop

* rerun ci. test=develop

* rerun macci. test=develop

* polish code. test=develop

* rewrite sort code. test=develop

* remove unused code. test=develop

* fix tests. test=develop

* fix conflict. test=develop

* follow comment. test=develop

* merge develop branch. test=develop

* fix tests. test=develop

* remove ToTypeIndex. test=develop

* rerun ci. test=develop

7cd24b13

30 10月, 2018 1 次提交
- X
  add a small test to verify tensor type · e2db0b9b
  由 Xin Pan 提交于 10月 30, 2018
```
test=develop
```
  e2db0b9b
01 8月, 2018 1 次提交

"cherry picked cpp tests" (#12182) · 0c8fde7d

由 dzhwinter 提交于 8月 01, 2018

* "cherry picked cpp tests"

* "cherry picked"

* "cherry picked tests"

* "merge develop branch"

0c8fde7d

07 6月, 2018 1 次提交

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

15 1月, 2018 1 次提交

Feature/hooks (#7513) · b9b75377

由 dzhwinter 提交于 1月 15, 2018

* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci

b9b75377

28 12月, 2017 1 次提交

Implement selectedrows serialize and deserialize (#7042) · 2cdef424

由 Yancey 提交于 12月 28, 2017

* implement selectedrows serialize and deserialize

* make serialize/deserialize as global function

* recover send_imp.cc

* delete unused brackets

* fix compile error

* serialize version in LodTensor and SelecetedRows

* fix ci

* fix ci

2cdef424

25 12月, 2017 2 次提交
- D
  "add data layout" (#6955) · 7777c811
  由 dzhwinter 提交于 12月 25, 2017
```
* "add data layout"

* "need kernel registry support"

* "fix data layout"

* "reorder include headers"

* "change enum to enum class"

* "fix CI"
```
  7777c811
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

20 10月, 2017 1 次提交

Remove template parameter for Tensor methods (#4937) · c532b967

由 Yu Yang 提交于 10月 19, 2017

* Remove template parameter for Tensor methods

* Also check the type is correct when data()
* Simplize holder_

* Fix accuracy_op

* Register Code

c532b967

12 10月, 2017 1 次提交

Unify CUDA stream in Tensor CopyFrom interface (#4692) · 2603cb7e

由 QI JUN 提交于 10月 11, 2017

* init

* unify CopyFrom interface

* fix gpu build error

* fix bug in tensor_py.h

* refine code comments and add TODO list

* fix conflicts in FeedOp and FetchOp

2603cb7e

10 10月, 2017 1 次提交
- A
  Adding implementation for copying a vector to a tensor (#4635) · 383faaf7
  由 Abhinav Arora 提交于 10月 09, 2017
```
* Adding implementation for copying a vector to tensor
* Changing Tensor test to access gpu memory indirectly
```
  383faaf7
05 10月, 2017 2 次提交

Y

Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
由 Yi Wang 提交于 10月 04, 2017

4558807c

Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94

由 Yu Yang 提交于 10月 04, 2017

By shell command

```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```

84500f94

15 9月, 2017 1 次提交
- Z
  
  modified codes · 39d79e64
  由 zchen0211 提交于 9月 14, 2017
  
  39d79e64
08 9月, 2017 1 次提交
- Y
  
  Fix CI test · d8921e9d
  由 Yu Yang 提交于 9月 07, 2017
  
  d8921e9d
07 9月, 2017 2 次提交
- F
  
  Follow comments · 5aacd64b
  由 fengjiayi 提交于 9月 06, 2017
  
  5aacd64b
- F
  
  Follow comments · f2a66ffa
  由 fengjiayi 提交于 9月 06, 2017
  
  f2a66ffa
06 9月, 2017 1 次提交
- Z
  
  tensor element size support · adfef243
  由 Zhuoyuan 提交于 9月 05, 2017
  
  adfef243
05 9月, 2017 1 次提交
- F
  
  WIP · e76fa85c
  由 fengjiayi 提交于 9月 04, 2017
  
  e76fa85c
09 8月, 2017 1 次提交

LODTensor (Level of details, or Level of sequences Tensor). (#3109) · ede02d7d

由 Yan Chunwei 提交于 8月 09, 2017

* add lodtensor

* add reshape of lod

* add details

* rename Elements/Levels

* size_t and vector reserve

* add details

* add const& std::shared_ptr

* add lod_tensor_impl.h

* remove a shared_ptr

ede02d7d

08 8月, 2017 1 次提交

fix some enforce (#3301) · 2af35002

由 Yan Chunwei 提交于 8月 08, 2017

* fix some enforce

* remove compatible_type to avoid compile error

* remove shared_ptr

* fix tensor error msg

2af35002

28 7月, 2017 1 次提交
- Q
  
  use cuda default stream · 5364b394
  由 qijun 提交于 7月 28, 2017
  
  5364b394
26 7月, 2017 1 次提交
- L
  
  ENH: Add GPU CopyFrom Unit Test · 1c68f119
  由 liaogang 提交于 7月 26, 2017
  
  1c68f119
25 7月, 2017 2 次提交
- Y
  
  Fix unittest · bc09551e
  由 Yu Yang 提交于 7月 25, 2017
  
  bc09551e
- L
  
  ENH: Refine Tensor and And CopyFrom · de8a8fee
  由 liaogang 提交于 7月 25, 2017
  
  de8a8fee
19 7月, 2017 1 次提交

Simplify Tensor implimentation · 55d30172

由 fengjiayi 提交于 7月 19, 2017

ATTENTION: some interfaces changed:
1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims).
2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src)
3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const

55d30172

15 7月, 2017 4 次提交
- F
  
  add conditional compilation for tensor · afa2a88d
  由 fengjiayi 提交于 7月 15, 2017
  
  afa2a88d
- F
  
  fix compile error · 66cf21c8
  由 fengjiayi 提交于 7月 15, 2017
  
  66cf21c8
- F
  
  enbale tensor memory test · 68adb954
  由 fengjiayi 提交于 7月 15, 2017
  
  68adb954
- L
  
  ENH: unify PADDLE_ENFORCE · f812de2c
  由 liaogang 提交于 7月 15, 2017
  
  f812de2c
14 7月, 2017 2 次提交

F

fix several compile error · 1f97388a
由 fengjiayi 提交于 7月 14, 2017

1f97388a

Refactor Tensor::CopyFrom() · dcfcf687

由 fengjiayi 提交于 7月 14, 2017

1. Add template T which indicates data type to `CopyFrom()`, `Slice()`
and `ShareData()` functions. This makes `CopyData()` code much clearer.

2. Add `set_dim()`.

3. `product(DDim)` transforms `DDim` to `vector<int>` first and then calculate
its product. That might be quite slow. For `product(dims_)` is frequently
used in Tensor, we add a mumber variable `numel_` as a cache of the
product result.
TODO: refactor `product()` to make it more efficient.

4. Unable Tensor::operator=

5. Remove the limit of POD type, because `float16` and `int8` are not POD type.

dcfcf687

12 7月, 2017 1 次提交

Add Tensor::CopyFrom and Tensor::mutable_data(Place place) · 69d99d48

由 fengjiayi 提交于 7月 12, 2017

1. Add `Tensor::CopyFrom`. Current version can only support CPU memory
copy. The support of GPU will be provided later by `paddle::memory`.
The current implementation of `Tensor::CopyFrom` is a little inefficient:
Every time `CopyFrom` is called, tensor will re-allocate its memory. However, if
we try to check and reuse `placeholder_`, we have to provide a template
parameter for `CopyFrom` to indicate the data type. It seems strange for
a simple copy function.

2. Add `Tensor::mutable_data(Place place)`, which directly use member
variable `dims_` as its dim parameter. This interface is required by
`Op::InferShape`.

69d99d48

11 7月, 2017 1 次提交
- F
  
  add more test · 0665dc97
  由 fengjiayi 提交于 7月 11, 2017
  
  0665dc97
03 7月, 2017 1 次提交
- F
  
  re-submit · d054a5ee
  由 fengjiayi 提交于 7月 03, 2017
  
  d054a5ee

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功