提交 · 524f6e9b36bc348b2e428b05b50fc6d60f173279 · BaiXuePrincess / Paddle

29 9月, 2018 2 次提交
- Y
  
  Refine code · 524f6e9b
  由 Yu Yang 提交于 9月 29, 2018
  
  524f6e9b
- Y
  
  Fix bug in uts · 5cf395be
  由 Yu Yang 提交于 9月 28, 2018
  
  5cf395be
28 9月, 2018 2 次提交

refactor(op): polish generate_proposals_op · 593ad763

由 Yu Yang 提交于 9月 28, 2018

Polish styles in generate_proposals_op.

1. inline lambda functions rathar than use std::function to save var.
2. add `static inline` to template functions .cc
   * Make them static to prevent generating symbols.
   * Make them inline to give compiler a hit inline them as possible.
   * Not if the function is not static, they cannot be inlined since the
     symbols should be exported.
3. add `static` to global functions in .cc
   * Make them static to prevent generating symbols.
4. Use Vector<uint64> instead manually manange storage between devices.
5. Prefer to use platform::ForRange, so we can optimize `ForRange` by
   just changing `for_range.h` if it is needed.
6. Do not change shape of inputs

test=develop

593ad763

Y
refactor(memory): rewrite memory allocation and make it extentable · 58ed412f
由 Yu Yang 提交于 9月 28, 2018
```
Use OO style to rewrite memory allocation.
```
58ed412f

27 9月, 2018 5 次提交

C
refine sgd_op (#13626) · 43a3af86
由 chengduo 提交于 9月 27, 2018
```
test=develop
```
43a3af86
Q
Cuda speed for generate_proposals_op. (#13596) · fd4c4df9
由 qingqing01 提交于 9月 27, 2018
```
* Add CUDA implementation for generate_proposals_op.
* Clean code.
* Update code.
```
fd4c4df9

Add distributed unit tests about text_classification/simnet-bow/ctr (#12812) · 97cf1eb6

由 tangwei12 提交于 9月 27, 2018

* add dist ut for text_classification

* add dist ut for text_classification

* add simnet bow unittest

* add dist ut for simnet bow

* add trainning data url for simnet bow

* add trainning data url for simnet bow

* modify simnet test_reader to train reader

* add test_dist_ctr

* test_dist_ctr can run now

* dense update is good

* add unit test for selected rows

* debug unit test

* fix dist sparse update problem

* Constant args at init

* optimize code

* simnet optimize

* fix DebugStringEx

* optimize sum_op.h

* add ScaleOpVarTypeInference

* clean code

* fix test_dist_transpiler.py

* code optimize

* modify delta

* fix sparse update bug

* dist test use one cpu

* update some data

* remove unused code

* add use cuda config

* unit test fix

* unit test fix

* unit test fix

* unit test fix

* dist_word2vec use CPU

* unit test fix

* unit test fix

* code clean

* code clean

* merge develop

* api spec update

* Revert: api spec update

* replace simnet data with fake

* replace simnet data with fake

* update dim

* add batch auc

* code clean

* code clean

* modify print to stderr

* update simnet delta -> 1e-5

* update RUN_STEP

* add use_reader_alloc

* add use_reader_alloc

* add use_reader_alloc

* modify delta

* add use_reader_alloc

* fix stderr write

* python3 compatibility

test=develop

* python3 compatibility, test=develop

* Update dist_text_classification.py

* test=develop

97cf1eb6

T
Revert "Some trivial optimization (#13530)" · a4f7696a
由 typhoonzero 提交于 9月 27, 2018
```
This reverts commit 1d91a49d.
```
a4f7696a

Batch AUC (#13567) · 85362e98

由 tangwei12 提交于 9月 27, 2018

* add distributed auc

* add attr "is distributed" and config it

* add distributed auc

* add batch auc and code format

* code format

* auc optimize

* metric_op optimize

* code clean

* bug fix and code clean

* bug fix and code clean

* code optimize

* code optimize

* api spec update

* Comments optimized

* add mutex

* Revert: add mutex

* remove distribute metric

* remove distribute metric

* spec modifyed

* add annotation, test=develop

* keep API compatibility
test=develop

85362e98

26 9月, 2018 3 次提交
- T
  refine peephole · 209e9c3d
  由 tensor-tang 提交于 9月 26, 2018
```
test=develop
```
  209e9c3d
- C
  Some trivial optimization (#13530) · 1d91a49d
  由 chengduo 提交于 9月 26, 2018
```
* some trivial opt

* remove the fix of lod_tensor and shrink_rnn_memory_op

* refine ShrinkRNNMemoryOp

test=develop
```
  1d91a49d
- K
  
  Fix bug in sequence_slice_op · 5093afce
  由 ktlichkid 提交于 9月 25, 2018
  
  5093afce
25 9月, 2018 3 次提交
- D
  
  flags (#13542) · cc20867d
  由 dzhwinter 提交于 9月 25, 2018
  
  cc20867d
- M
  
  MKLDNN Pooling: inline functions handling ceiled mode · 0e6b303f
  由 Michal Gallus 提交于 9月 25, 2018
  
  0e6b303f
- M
  Enable MKLDNN in Analysis Predictor · f465b03e
  由 Michal Gallus 提交于 9月 20, 2018
```
Also fix MKL-DNN pooling integration for ceil mode
```
  f465b03e
21 9月, 2018 9 次提交
- N
  
  fix comments · 27633216
  由 nhzlx 提交于 9月 21, 2018
  
  27633216
- C
  Fix concat_op InferShape (#13513) · cdf3a4c2
  由 chengduo 提交于 9月 21, 2018
```
* add ShareLoDs

* refine

* add Is EmptyVarName

* refine Sharedlod
```
  cdf3a4c2
- S
  
  remove kwargs in python api · 3ee0a648
  由 sneaxiy 提交于 9月 21, 2018
  
  3ee0a648
- G
  
  fix · dda9c355
  由 gongweibao 提交于 9月 21, 2018
  
  dda9c355
- J
  
  fix roi_perspective_transform_op.cc unused variable caused error on macos · c324cdef
  由 JiabinYang 提交于 9月 21, 2018
  
  c324cdef
- G
  
  fix · ff478417
  由 gongweibao 提交于 9月 21, 2018
  
  ff478417
- W
  [Feature] dist op role and lr op role, to support memory optimize with dist training (#13220) · 29c63d18
  由 Wu Yi 提交于 9月 21, 2018
```
* wip

* clean up

* should fix running with memopt

* add ut

* mark lr schedule op role

* hide lr_schedule_guard

* use op_role_var instead of ufind

* unify dist test name

* wip for py3 support

* fix var deref

* fix python3 mem_opt order

* remove comments
```
  29c63d18
- Y
  
  Fix MixedVector · e1913bc5
  由 Yu Yang 提交于 9月 21, 2018
  
  e1913bc5
- W
  Add roi perspective transform op. (#13176) · fc44087d
  由 whs 提交于 9月 21, 2018
```
* Add roi perspective transform.

* Add roi_perspective_transform_op.

* Fix code style.

* Add python api and fix doc.

* Fix API.spec

* Fix python api.

* Fix API.spec

* Move src to detection.
```
  fc44087d
20 9月, 2018 8 次提交

S

modification · 192c49cb
由 sneaxiy 提交于 9月 20, 2018

192c49cb
S

enhance eager deletion · 0a36ef3c
由 sneaxiy 提交于 9月 20, 2018

0a36ef3c
Y
Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod"" · 6d2c6f96
由 Yu Yang 提交于 9月 20, 2018
```
This reverts commit a6c8d6b9.
```
6d2c6f96
Y
Revert "Merge pull request #13431 from chengduoZH/refine_lod" · a6c8d6b9
由 Yu Yang 提交于 9月 20, 2018
```
This reverts commit bd79e046, reversing
changes made to 6b4d290c.
```
a6c8d6b9
Y

Fix unstable selected_rows_functor_test.cu · b5996fa1
由 Yu Yang 提交于 9月 20, 2018

b5996fa1
S

fix sparse gradient clip · a29b4227
由 sneaxiy 提交于 9月 20, 2018

a29b4227

Refine activation for GRU operator (#13275) · 87086b13

由 Yihua Xu 提交于 9月 20, 2018

* Optimize GRU with AVX instruction

* Clean code

* Add the Unitest and fix the align issue

* Remove the remanent part of the unitest part

* Code clean

* Fix the parameters length issue for fusion_gru to pass CI

* Change the default type as float32

87086b13

Feature/op_fuse_pass (#12440) · d402234b

由 chengduo 提交于 9月 20, 2018

* Add Preface

* Add demo code

* Save file

* Refine code

* seems can work

* use elementwise strategy

* Use ElementwiseComputeEx

* Add comments

* extract functions from operator

* Refine code

* Follow comment

* code refine

* add op_fuse  pass

* add backward

* code refine

* use TopologySortOperations

* follow comments

* refine IsFusible

* code enhance

* fix op_fusion_pass

* refine code

* refine fuse_elemwise_act_op

* adjust the input and output

* refine logic

* add intermediate_edge

* disable inplace

* follow comments

* refine logic

* follow comments

* Remove the removable IntermediateOut

* change strategy

* code refine

* enable fuse backward

* code refine

* code refine

* rename unit test

* follow comments

d402234b

19 9月, 2018 7 次提交
- Q
  [WIP]Sequence Scatter Op (#12625) · 21ec93aa
  由 Qingsheng Li 提交于 9月 19, 2018
```
Sequence Scatter Op
```
  21ec93aa
- N
  
  fix ut error · 4c52be07
  由 nhzlx 提交于 9月 19, 2018
  
  4c52be07
- N
  
  add trt config to arguments · 94a57f1d
  由 nhzlx 提交于 9月 19, 2018
  
  94a57f1d
- C
  Fix the nested dyn_rnn (#13417) · fd8d83e6
  由 chengduo 提交于 9月 19, 2018
```
* add unit test for nested drnn

* add nested dyn_rnn

* refine while_op

* fix bug
```
  fd8d83e6
- W
  Add truncated gaussian initializer. (#13000) · cf128231
  由 whs 提交于 9月 19, 2018
```
* Add truncated gaussian initializer.

* Fix unitest.

* Update API.spec

* Fix code style and fix bug.

* Fix code style.

* Small fix.
```
  cf128231
- J
  
  fix mac compile error · 9d2d3096
  由 JiabinYang 提交于 9月 19, 2018
  
  9d2d3096
- D
  loosen the restriction of output_size in conv2d_transpose (#12292) · 253f618a
  由 Dun 提交于 9月 19, 2018
```
* loosen the restriction of output_size in conv2d_transpose

* test and docs

* fix code style

* fix ci test error

* bug fix

* fix python3 issue
```
  253f618a
18 9月, 2018 1 次提交
- C
  [Accelerate] Refine seq_softmax_op (#13421) · 6757a315
  由 chengduo 提交于 9月 18, 2018
```
* refine seq_softmax_op

* fix seq_softmax

* use cub in seq_softmax
```
  6757a315

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致