提交 · 310fd5144028720344fc93748f259847f2c17327 · 机器未来 / Paddle

22 5月, 2019 1 次提交

[Lite] enable cross compile and run on mobile of lite (#17541) · 310fd514

由 tensor-tang 提交于 5月 22, 2019

* add cmake

* update

* fix proto pd

* fix compile

* tmp save

* fix protobuf device version

* fix protobuf and host compile

* fix std c++11 support on android

* change array to vector to fix ndk c++_static

* fix rt and add dockerfile

* fix android compile issue with latest merge

* init arm kernels

* enable run on arm

* update format

* update format

* update format

310fd514

16 5月, 2019 1 次提交
- Y
  
  Add some ops for training (#17442) · e9f33320
  由 Yan Chunwei 提交于 5月 16, 2019
  
  e9f33320
14 5月, 2019 4 次提交
- S
  rename CxxPredictor · 72fb4adb
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  72fb4adb
- S
  fix cpplint format · b798c9b9
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  b798c9b9
- S
  reformat for cpplint · fc442ec6
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  fc442ec6
- S
  update cpplint · 397d0567
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  397d0567
13 5月, 2019 3 次提交
- S
  make cxx_api_lite work on ARM · 4e0b25e3
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  4e0b25e3
- S
  disable lite by default · 65d2179a
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  65d2179a
- S
  fix ARM compile error · 0a20a618
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  0a20a618
12 5月, 2019 3 次提交
- S
  refactor TypeSystem · 9174652b
  由 Superjomn 提交于 5月 12, 2019
```
make typesystem simpler
```
  9174652b
- S
  
  refactor type system · 123c79fa
  由 Superjomn 提交于 5月 12, 2019
  
  123c79fa
- C
  Add DropLocalExeScopes in ParallelExecutor (#17297) · bc833945
  由 chengduo 提交于 5月 12, 2019
```
* reset drop local scope counter
test=develop
```
  bc833945
10 5月, 2019 6 次提交
- Z
  
  Add Where Op(#16793) · d4b67e16
  由 zhoukunsheng 提交于 5月 10, 2019
  
  d4b67e16
- Z
  
  Add Diag Op(#17027) · 1bfff020
  由 zhoukunsheng 提交于 5月 10, 2019
  
  1bfff020
- Z
  improve gru unit performance. (#16338) · 8a2caacd
  由 zhaoyuchen2018 提交于 5月 10, 2019
```
refine code

fuse cublas  calling and kernels into one cuda kernel.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  8a2caacd
- S
  
  test=develop (#17322) · ddb24d48
  由 SunGaofeng 提交于 5月 10, 2019
  
  ddb24d48
- Q
  Double backward of conv2d. (#17211) · e32c9888
  由 qingqing01 提交于 5月 10, 2019
```
* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.
```
  e32c9888
- Z
  fix data_type error message (#17312) · 5e5e7b33
  由 Zeng Jinle 提交于 5月 10, 2019
```
test=develop
```
  5e5e7b33
09 5月, 2019 9 次提交
- S
  
  add missing files · 9d6a0c88
  由 superjomn 提交于 5月 09, 2019
  
  9d6a0c88
- S
  
  fix utils_lite compile error · 66e56102
  由 superjomn 提交于 5月 09, 2019
  
  66e56102
- S
  
  remove glog and gtest dependency for light framework · cb44cff2
  由 superjomn 提交于 5月 09, 2019
  
  cb44cff2
- Z
  
  follow comments,test=develop (#17273) · fff270ea
  由 Zeng Jinle 提交于 5月 09, 2019
  
  fff270ea
- Z
  fix: (#17279) · 7a3bb061
  由 Zhaolong Xing 提交于 5月 09, 2019
```
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop
```
  7a3bb061
- X
  
  add import, test=develop (#17229) · 50ad9046
  由 xiaoting 提交于 5月 09, 2019
  
  50ad9046
- Z
  Mod floordiv (#17251) · 4292bd86
  由 zhoukunsheng 提交于 5月 09, 2019
```
* test=develop
add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
```
  4292bd86
- G
  fix infer_from_dataset and train_from_dataset (#17243) · 5d6a1fcf
  由 guru4elephant 提交于 5月 09, 2019
```
* fix train_from_dataset and infer_from_dataset example

* add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
```
  5d6a1fcf
- C
  use sync copy (#17291) · 516317cf
  由 chengduo 提交于 5月 09, 2019
```
test=develop
```
  516317cf
08 5月, 2019 13 次提交

Fix API example code of save_inference_model (#17274) · 2c446271

由 Huihuang Zheng 提交于 5月 08, 2019

* Fix API example code of save_inference_model

test=develop

* Add "import" in exmaple of save_inference_model

* Fix typo "exsample" -> "example"

test=develop

2c446271

X
modified formula for Lrn (#17281) · 9ed4aaad
由 xiaoting 提交于 5月 08, 2019
```
* modified formula for lrn

test=develop

* modified api.spec

test=develop
```
9ed4aaad

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

Repair api example (#17221) · e388a1fb

由 lujun 提交于 5月 08, 2019

Fix the following API examples:

paddle.fluid.scope_guard
paddle.fluid.backward.append_backward
paddle.fluid.cpu_places
paddle.fluid.cuda_pinned_places
paddle.fluid.cuda_places
paddle.fluid.in_dygraph_mode
paddle.fluid.CUDAPlace
paddle.fluid.CPUPlace
paddle.fluid.CUDAPinnedPlace

e388a1fb

Optimize the cuda implementation of sum_op (#17283) · 6b84688b

由 Yiqun Liu 提交于 5月 08, 2019

* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace.
test=develop

* Use eigen to add to tensors.
test=develop

6b84688b

C
update assert (#17282) · db5e74ab
由 chengduo 提交于 5月 08, 2019
```
test=develop
```
db5e74ab

Fix concat shape check (#17247) · c3195de5

由 Hongyu Liu 提交于 5月 08, 2019

* fix shape_check; test=develop

* fix format; test=develop

* fix format; test=develop

* fix ddim bug; test=develop

* fix c++ format; test=develop

* change function name; test=develop

c3195de5

L
Fix api example (#17231) · dab71e8d
由 lvmengsi 提交于 5月 08, 2019
```
* fix API examples, test=develop
```
dab71e8d
W

Fix bp of roi perspective transform op. (#17216) · 7d7e2995
由 whs 提交于 5月 08, 2019

7d7e2995

Adding lrn op for ngraph engine (#17189) · 7bd1d03e

由 baojun 提交于 5月 07, 2019

* added lrn op test=develop

* Added CreateConstant method test=develop

* avoid duplicates test=develop

7bd1d03e

W
improved unit test output (#17266) · 984aa905
由 Wojciech Uss 提交于 5月 08, 2019
```
added printing data type to differentiate int8 and fp32 latency results

test=develop
```
984aa905

Polish Executor and Compiler doc (#17262) · 8f534696

由 chengduo 提交于 5月 08, 2019

* polish doc
test=develop

* updata parallel executor doc
test=develop

* update API.spec
test=develop

* polish code
test=develop

8f534696

G

Fix code in document. (#17237) · 91784f8e
由 gongweibao 提交于 5月 08, 2019

91784f8e

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致