- 15 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 14 7月, 2017 28 次提交
-
-
由 gangliao 提交于
[Done] Memory Management: Buddy Allocator
-
由 fengjiayi 提交于
Add Tensor::CopyFrom and Tensor::mutable_data(Place place)
-
由 liaogang 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 hedaoyuan 提交于
Remove useless empty pointer check.
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
1. Add template T which indicates data type to `CopyFrom()`, `Slice()` and `ShareData()` functions. This makes `CopyData()` code much clearer. 2. Add `set_dim()`. 3. `product(DDim)` transforms `DDim` to `vector<int>` first and then calculate its product. That might be quite slow. For `product(dims_)` is frequently used in Tensor, we add a mumber variable `numel_` as a cache of the product result. TODO: refactor `product()` to make it more efficient. 4. Unable Tensor::operator= 5. Remove the limit of POD type, because `float16` and `int8` are not POD type.
-
由 Cao Ying 提交于
add configuration helper for the gated unit.
-
由 fengjiayi 提交于
-
由 caoying03 提交于
-
由 caoying03 提交于
-
由 Qiao Longfei 提交于
* use OperatorPtr = std::shared_ptr<OperatorBase>; * use ScopePtr = std::share_ptr<Scope>;
-
由 Yu Yang 提交于
* Let OpProto support multiple and temporary * Each input/output of Paddle's Op could be a list. Add multiple mark to OpProto. Also add a `input_format`/`output_format` attribute if that Op has multiple input or output. The format of that attribute please reference the comments in `op_proto.proto` * Add temporary mark, because some output of an Op is not used by user but used by other op for faster computation. Explicitly mark which output is temporary could let future memory/computation optimization. * Add generated field to AttrProto. * Add `AddInputs`/`AddOutputs` function * It is more readable to invoke `AddInputs` not `AddInput(multiple=true)`.
-
由 hedaoyuan 提交于
-
由 liaogang 提交于
-
由 liaogang 提交于
-
由 liaogang 提交于
-
由 liaogang 提交于
-
由 liaogang 提交于
-
由 helinwang 提交于
Fix optimizer parameter buffer allocation size.
-
由 Yiqun Liu 提交于
Add build_android task on Travis CI.
-
-
由 Yu Yang 提交于
Add a sample op, `add_op`
-
由 qiaolongfei 提交于
-
由 Helin Wang 提交于
-
由 Helin Wang 提交于
The buffer allocation size should be number of bytes, not number of floats.
-
- 13 7月, 2017 11 次提交
-
-
由 Liu Yiqun 提交于
-
由 Yu Yang 提交于
* Convert `op` --> `operators` * Remove AddType in OpProtoMaker, because type is part of registry. * Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.
-
由 Yu Yang 提交于
Default scope function
-
由 liaogang 提交于
-
由 Yu Yang 提交于
* Refine register methods, make Op can get rid of whole-archieve * `USE_OP` before a op is used. * Add unittest for add_op.
-
由 Liu Yiqun 提交于
-
由 QI JUN 提交于
fix bug in dynload
-
由 qijun 提交于
-
由 Qiao Longfei 提交于
-
由 hedaoyuan 提交于
Compile for armv8.
-
由 liaogang 提交于
-