提交 · defd562c26bb52292a1cfd9ed12fe39cf4d1b66e · PaddlePaddle / Paddle

29 5月, 2019 4 次提交
- T
  
  [lite] fix arm deps (#17716) · defd562c
  由 tensor-tang 提交于 5月 29, 2019
  
  defd562c
- H
  enable softmax op and add unit test (#17703) · e242f781
  由 hong19860320 提交于 5月 29, 2019
```
* enable softmax op and add unit test

* move softmax sub-functions to softmax.cc, and move basic math functions to funcs.h
```
  e242f781
- L
  migrate several ops: (#17606) · ca45ed53
  由 liuwei1031 提交于 5月 29, 2019
```
* migrate several ops:
  mean,
  mean_grad
  fill_constant
  square_grad
  elementwise_sub_grad
  mul_grad

* add sdg_op

* fix kernel platform registration issue

* code cleanup

* fix platform typo
```
  ca45ed53
- T
  
  [lite] fix fc bias and enable armv7 fc (#17695) · b16ae4e2
  由 tensor-tang 提交于 5月 29, 2019
  
  b16ae4e2
28 5月, 2019 1 次提交

[Lite] enable fc kernel (#17674) · 4b253569

由 tensor-tang 提交于 5月 28, 2019

* add fc unit test

* refine eigen fc
add cpu info, arm context
init packed sgemm

* enable packed sgemm

* add arm math

* pass fc ut

* follow comments

4b253569

24 5月, 2019 2 次提交
- T
  
  fix cmake deps and update dockerfile (#17635) · e170ea03
  由 tensor-tang 提交于 5月 24, 2019
  
  e170ea03
- Y
  
  lite/init test trigger (#17627) · 4adb6195
  由 Yan Chunwei 提交于 5月 24, 2019
  
  4adb6195
23 5月, 2019 1 次提交

code clean - refine ARM compile (#17590) · 59122809

由 Yan Chunwei 提交于 5月 23, 2019

* code clean - refine ARM

cmake enhancement:

- add lite_cc_library and lite_cc_test

code clean:

- remove ARM feed and fetch kernels, reuse the Host's

remove unnecessary comments

59122809

22 5月, 2019 1 次提交

[Lite] enable cross compile and run on mobile of lite (#17541) · 310fd514

由 tensor-tang 提交于 5月 22, 2019

* add cmake

* update

* fix proto pd

* fix compile

* tmp save

* fix protobuf device version

* fix protobuf and host compile

* fix std c++11 support on android

* change array to vector to fix ndk c++_static

* fix rt and add dockerfile

* fix android compile issue with latest merge

* init arm kernels

* enable run on arm

* update format

* update format

* update format

310fd514

16 5月, 2019 1 次提交
- Y
  
  Add some ops for training (#17442) · e9f33320
  由 Yan Chunwei 提交于 5月 16, 2019
  
  e9f33320
14 5月, 2019 4 次提交
- S
  rename CxxPredictor · 72fb4adb
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  72fb4adb
- S
  fix cpplint format · b798c9b9
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  b798c9b9
- S
  reformat for cpplint · fc442ec6
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  fc442ec6
- S
  update cpplint · 397d0567
  由 superjomn 提交于 5月 14, 2019
```
test=develop
```
  397d0567
13 5月, 2019 3 次提交
- S
  make cxx_api_lite work on ARM · 4e0b25e3
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  4e0b25e3
- S
  disable lite by default · 65d2179a
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  65d2179a
- S
  fix ARM compile error · 0a20a618
  由 superjomn 提交于 5月 13, 2019
```
test=develop
```
  0a20a618
12 5月, 2019 3 次提交
- S
  refactor TypeSystem · 9174652b
  由 Superjomn 提交于 5月 12, 2019
```
make typesystem simpler
```
  9174652b
- S
  
  refactor type system · 123c79fa
  由 Superjomn 提交于 5月 12, 2019
  
  123c79fa
- C
  Add DropLocalExeScopes in ParallelExecutor (#17297) · bc833945
  由 chengduo 提交于 5月 12, 2019
```
* reset drop local scope counter
test=develop
```
  bc833945
10 5月, 2019 6 次提交
- Z
  
  Add Where Op(#16793) · d4b67e16
  由 zhoukunsheng 提交于 5月 10, 2019
  
  d4b67e16
- Z
  
  Add Diag Op(#17027) · 1bfff020
  由 zhoukunsheng 提交于 5月 10, 2019
  
  1bfff020
- Z
  improve gru unit performance. (#16338) · 8a2caacd
  由 zhaoyuchen2018 提交于 5月 10, 2019
```
refine code

fuse cublas  calling and kernels into one cuda kernel.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  8a2caacd
- S
  
  test=develop (#17322) · ddb24d48
  由 SunGaofeng 提交于 5月 10, 2019
  
  ddb24d48
- Q
  Double backward of conv2d. (#17211) · e32c9888
  由 qingqing01 提交于 5月 10, 2019
```
* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.
```
  e32c9888
- Z
  fix data_type error message (#17312) · 5e5e7b33
  由 Zeng Jinle 提交于 5月 10, 2019
```
test=develop
```
  5e5e7b33
09 5月, 2019 9 次提交
- S
  
  add missing files · 9d6a0c88
  由 superjomn 提交于 5月 09, 2019
  
  9d6a0c88
- S
  
  fix utils_lite compile error · 66e56102
  由 superjomn 提交于 5月 09, 2019
  
  66e56102
- S
  
  remove glog and gtest dependency for light framework · cb44cff2
  由 superjomn 提交于 5月 09, 2019
  
  cb44cff2
- Z
  
  follow comments,test=develop (#17273) · fff270ea
  由 Zeng Jinle 提交于 5月 09, 2019
  
  fff270ea
- Z
  fix: (#17279) · 7a3bb061
  由 Zhaolong Xing 提交于 5月 09, 2019
```
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop
```
  7a3bb061
- X
  
  add import, test=develop (#17229) · 50ad9046
  由 xiaoting 提交于 5月 09, 2019
  
  50ad9046
- Z
  Mod floordiv (#17251) · 4292bd86
  由 zhoukunsheng 提交于 5月 09, 2019
```
* test=develop
add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
```
  4292bd86
- G
  fix infer_from_dataset and train_from_dataset (#17243) · 5d6a1fcf
  由 guru4elephant 提交于 5月 09, 2019
```
* fix train_from_dataset and infer_from_dataset example

* add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
```
  5d6a1fcf
- C
  use sync copy (#17291) · 516317cf
  由 chengduo 提交于 5月 09, 2019
```
test=develop
```
  516317cf
08 5月, 2019 5 次提交

Fix API example code of save_inference_model (#17274) · 2c446271

由 Huihuang Zheng 提交于 5月 08, 2019

* Fix API example code of save_inference_model

test=develop

* Add "import" in exmaple of save_inference_model

* Fix typo "exsample" -> "example"

test=develop

2c446271

X
modified formula for Lrn (#17281) · 9ed4aaad
由 xiaoting 提交于 5月 08, 2019
```
* modified formula for lrn

test=develop

* modified api.spec

test=develop
```
9ed4aaad

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

Repair api example (#17221) · e388a1fb

由 lujun 提交于 5月 08, 2019

Fix the following API examples:

paddle.fluid.scope_guard
paddle.fluid.backward.append_backward
paddle.fluid.cpu_places
paddle.fluid.cuda_pinned_places
paddle.fluid.cuda_places
paddle.fluid.in_dygraph_mode
paddle.fluid.CUDAPlace
paddle.fluid.CPUPlace
paddle.fluid.CUDAPinnedPlace

e388a1fb

Optimize the cuda implementation of sum_op (#17283) · 6b84688b

由 Yiqun Liu 提交于 5月 08, 2019

* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace.
test=develop

* Use eigen to add to tensors.
test=develop

6b84688b

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功