提交 · 028f3dc4e5fcb558041ff168e233a89b41aeaed9 · Crayon鑫 / Paddle

19 7月, 2017 1 次提交
- L
  
  Add memcpy · 028f3dc4
  由 liaogang 提交于 7月 19, 2017
  
  028f3dc4
14 7月, 2017 16 次提交

L

Follow comments · 03b3d0d8
由 liaogang 提交于 7月 14, 2017

03b3d0d8
F

update PADDLE_ENFORCE message · 57a22db3
由 fengjiayi 提交于 7月 14, 2017

57a22db3
F

update tensor.h · 34beec0f
由 fengjiayi 提交于 7月 14, 2017

34beec0f
F

change int numel_ to size_t numel · 8594d5c3
由 fengjiayi 提交于 7月 14, 2017

8594d5c3
F

fix several compile error · 1f97388a
由 fengjiayi 提交于 7月 14, 2017

1f97388a

由 fengjiayi 提交于 7月 14, 2017

1. Add template T which indicates data type to `CopyFrom()`, `Slice()`
and `ShareData()` functions. This makes `CopyData()` code much clearer.

2. Add `set_dim()`.

3. `product(DDim)` transforms `DDim` to `vector<int>` first and then calculate
its product. That might be quite slow. For `product(dims_)` is frequently
used in Tensor, we add a mumber variable `numel_` as a cache of the
product result.
TODO: refactor `product()` to make it more efficient.

4. Unable Tensor::operator=

5. Remove the limit of POD type, because `float16` and `int8` are not POD type.

dcfcf687

F

Refactor `Tensor::CopyFrom()` · a1dc4311
由 fengjiayi 提交于 7月 14, 2017

a1dc4311

Optimize ptr (#2851) · 58f3de95

由 Qiao Longfei 提交于 7月 14, 2017

* use OperatorPtr = std::shared_ptr<OperatorBase>;
* use ScopePtr = std::share_ptr<Scope>;

58f3de95

Let OpProto support multiple and temporary (#2860) · 2462d0c5

由 Yu Yang 提交于 7月 14, 2017

* Let OpProto support multiple and temporary

* Each input/output of Paddle's Op could be a list. Add multiple mark to
  OpProto. Also add a `input_format`/`output_format` attribute if that
  Op has multiple input or output. The format of that attribute please
  reference the comments in `op_proto.proto`
* Add temporary mark, because some output of an Op is not used by user
  but used by other op for faster computation. Explicitly mark which
  output is temporary could let future memory/computation optimization.
* Add generated field to AttrProto.

* Add `AddInputs`/`AddOutputs` function

* It is more readable to invoke `AddInputs` not
  `AddInput(multiple=true)`.

2462d0c5

H

Remove useless empty pointer check. · 010adb99
由 hedaoyuan 提交于 7月 14, 2017

010adb99
L

update · 033523ea
由 liaogang 提交于 7月 14, 2017

033523ea
L

Fix: alignment metric · ea916c84
由 liaogang 提交于 7月 14, 2017

ea916c84
L

Fix condition compile · 21b7915d
由 liaogang 提交于 7月 14, 2017

21b7915d
L

ENH: memory test: check alignment and memory size · ab5fe1e9
由 liaogang 提交于 7月 14, 2017

ab5fe1e9
Q

change op to operators · e5887301
由 qiaolongfei 提交于 7月 14, 2017

e5887301
H
Fix optimizer parameter buffer allocation size. · 11660eab
由 Helin Wang 提交于 7月 13, 2017
```
The buffer allocation size should be number of bytes, not number of
floats.
```
11660eab

13 7月, 2017 9 次提交
- L
  
  Move the download of ndk to build_android.sh script file. · 62908dcc
  由 Liu Yiqun 提交于 7月 13, 2017
  
  62908dcc
- Y
  Follow comments · 79b70c2d
  由 Yu Yang 提交于 7月 13, 2017
```
* Convert `op` --> `operators`
* Remove AddType in OpProtoMaker, because type is part of registry.
* Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.
```
  79b70c2d
- L
  
  Add memory alignment test · 00572aa4
  由 liaogang 提交于 7月 13, 2017
  
  00572aa4
- Y
  Add a sample op, `add_op` · a0aaafe9
  由 Yu Yang 提交于 7月 13, 2017
```
* Refine register methods, make Op can get rid of whole-archieve
* `USE_OP` before a op is used.
* Add unittest for add_op.
```
  a0aaafe9
- L
  
  Add build_android task on Travis CI. · 95897fd1
  由 Liu Yiqun 提交于 7月 13, 2017
  
  95897fd1
- Q
  
  Add Init to OperatorBase (#2838) · 728665d7
  由 Qiao Longfei 提交于 7月 13, 2017
  
  728665d7
- L
  
  ENH: Remove comments · ff98e3c1
  由 liaogang 提交于 7月 13, 2017
  
  ff98e3c1
- Q
  
  fix bug in dynload · 4e918377
  由 qijun 提交于 7月 13, 2017
  
  4e918377
- H
  
  Add go testing into cmake and fix libpaddle_go_optimizer.a link path · e4be077f
  由 Helin Wang 提交于 7月 11, 2017
  
  e4be077f
12 7月, 2017 14 次提交
- Q
  test OpKernel (#2820) · be441f7d
  由 Qiao Longfei 提交于 7月 12, 2017
```
Add unit test for OpKernel
```
  be441f7d
- Q
  
  add memory header file · 70d937c5
  由 qijun 提交于 7月 12, 2017
  
  70d937c5
- F
  Add Tensor::CopyFrom and Tensor::mutable_data(Place place) · 69d99d48
  由 fengjiayi 提交于 7月 12, 2017
```
1. Add `Tensor::CopyFrom`. Current version can only support CPU memory
copy. The support of GPU will be provided later by `paddle::memory`.
The current implementation of `Tensor::CopyFrom` is a little inefficient:
Every time `CopyFrom` is called, tensor will re-allocate its memory. However, if
we try to check and reuse `placeholder_`, we have to provide a template
parameter for `CopyFrom` to indicate the data type. It seems strange for
a simple copy function.

2. Add `Tensor::mutable_data(Place place)`, which directly use member
variable `dims_` as its dim parameter. This interface is required by
`Op::InferShape`.
```
  69d99d48
- Q
  
  follow comments · be2c1a3b
  由 qijun 提交于 7月 12, 2017
  
  be2c1a3b
- Q
  
  follow comments · a07deac9
  由 qijun 提交于 7月 12, 2017
  
  a07deac9
- Q
  
  follow comments · 85806e75
  由 qijun 提交于 7月 12, 2017
  
  85806e75
- Q
  
  fix gpu build error · 8ee50a35
  由 qijun 提交于 7月 12, 2017
  
  8ee50a35
- L
  
  fix pybind compile question · e0ea87c9
  由 Luo Tao 提交于 7月 12, 2017
  
  e0ea87c9
- Q
  
  follow comments · 4d336d90
  由 qijun 提交于 7月 12, 2017
  
  4d336d90
- Y
  Add OperatorWithKernel class · 0ff81920
  由 Yu Yang 提交于 7月 12, 2017
```
* User can register OpKernel to its Ops. The OpKernelMap saved in
  OperatorWithKernel. Each Op which inherits OperatorWithKernel will
  use `OpKernel::Compute` instead of Run.
```
  0ff81920
- Q
  
  refine device_context · ef5f9deb
  由 qijun 提交于 7月 12, 2017
  
  ef5f9deb
- Q
  
  fix code style · 6bbc2944
  由 qijun 提交于 7月 12, 2017
  
  6bbc2944
- Q
  
  remove unused deps · b5a8d5b4
  由 qijun 提交于 7月 12, 2017
  
  b5a8d5b4
- Q
  
  fix gpu build error · 8f5a9fd9
  由 qijun 提交于 7月 12, 2017
  
  8f5a9fd9

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致