提交 · a83b792ada0e6833277f8300ede05c533677062a · PaddlePaddle / PaddleDetection

13 6月, 2018 1 次提交
- Q
  
  add row_size for selected rows in DebugStringEx · 7ebef493
  由 qiaolongfei 提交于 6月 13, 2018
  
  7ebef493
07 6月, 2018 3 次提交

Big data op_test benchmark, for checking output consistent in different runs. (#10646) · f7c96f07

由 dzhwinter 提交于 6月 07, 2018

* "init benchmark ops"

* "untrack outputs"

* "delete some usused code"

* "benchmark"

* "fix ci"

* "fix op test"

* "fix uint16 missing"

* "fix ci"

* "follow comments"

* "fix ci"

* "follow comments"

* "conficts. merge develop branch"

* repick

* "merge develop branch"

f7c96f07

F

fix bugs in the implementation of 'HasInput' and 'HasOutput' · dc8e0b49
由 fengjiayi 提交于 6月 07, 2018

dc8e0b49

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

30 5月, 2018 1 次提交
- F
  
  fix a bug · 3bce3dbc
  由 fengjiayi 提交于 5月 30, 2018
  
  3bce3dbc
29 5月, 2018 1 次提交
- C
  
  move check_nan_inf to operator · cb1c657c
  由 chengduoZH 提交于 5月 29, 2018
  
  cb1c657c
03 5月, 2018 1 次提交

Fix the bug when a input variable of op is dispensable. (#10268) · 6084af47

由 Yiqun Liu 提交于 5月 03, 2018

* Fix the bug when a input variable of op is dispensable.

* Add HasInputs/Outputs interfaces to OperatorBase.

* Remove the unreferenced header file.

6084af47

25 4月, 2018 2 次提交

Clean up unused code in operator class (#10035) · 81dfc0cf

由 Yang Yang(Tony) 提交于 4月 24, 2018

* delete unused IsNetOp() and Rename()

* rm OperatorBase::Rename implementation

* delete Operator::InputVars()

* remove unused OperatorBase::ShareLoD; ShareLoD has been implemented in infershape

* organize operatorbase; remove unused set_type

* add comments

* fix comment

81dfc0cf

A
Fix CPPLint issues in framework/data_transform framework/prune.cc (#10178) · f09aed04
由 Abhinav Arora 提交于 4月 24, 2018
```
* Fic CPPLint issues with data_transform

* Fic CPPLint issues with prune.cc
```
f09aed04

12 4月, 2018 1 次提交

Dist transpiler support prefetch (#9714) · 4c55a602

由 Qiao Longfei 提交于 4月 12, 2018

* init

* add some check

* add dist transpile logic

* add insert op for block

* init change get_pserver_program

* optimize code

* fix a bug

* can run now

* start to do table split

* start to process table gradient

* complete pserver part

* can send_vars now

* revert cpplint

* fix a bug

* optimize code

* move dist test to models

* revert the interface of distribute_transpiler.transpile

* fix prefetch_block

* optimize trainspiler code

* add comment to sum_op

* add warning log

* fix comment

* fix test_send_recv

* fix test_send_recv

* fix train with no distributed table

* optimize GetDims

4c55a602

04 4月, 2018 1 次提交
- Q
  
  add GetDataTypeOfVar · e66bd4cb
  由 qiaolongfei 提交于 4月 04, 2018
  
  e66bd4cb
30 3月, 2018 1 次提交

Fix data transform when inplace (#9450) · 23bab34c

由 Qiao Longfei 提交于 3月 30, 2018

* fix data transform when op have inplace in/out

* add log

* should not delete scope because Compute maybe async

* optimize code

23bab34c

14 3月, 2018 1 次提交
- Y
  
  Move back operator's event to RunImpl() · 90afbd28
  由 Yibing Liu 提交于 3月 14, 2018
  
  90afbd28
12 3月, 2018 1 次提交
- Y
  
  Remove dims in base class · 225efa67
  由 Yu Yang 提交于 3月 12, 2018
  
  225efa67
09 3月, 2018 1 次提交
- L
  
  Refine the profile codes for inference. · a8e85077
  由 Liu Yiqun 提交于 3月 09, 2018
  
  a8e85077
27 2月, 2018 1 次提交
- Y
  
  Fix the profiler's bug in multi-gpu mode · ee88855d
  由 Yibing Liu 提交于 2月 27, 2018
  
  ee88855d
16 2月, 2018 1 次提交

[WIP] Move DataType enum inside VarType (#8447) · c7ad26d6

由 Abhinav Arora 提交于 2月 15, 2018

* Move Pod Types from DataType enum to Type enum

* Fixed data_type.h

* Fix type in TensorDesc

* Add comment to framework.proto

* Fixed type in data_type.h

* Updated format of type in data_type.h

* Fix var_desc.h

* Fix op_kernel_type.h

* Fixed data_type_transform_test.cc

* Fix operator.h

* Fixed data_type_transform.cc

* Fixed op_kernel_type_test.cc

* Fix operator.cc

* Fixed data_layout_transform_test.cc

* Fix var_desc.cc

* Fixed assign_value_op.cc

* Fixed assign_value_op.h

* fixed protobuf.cc

* Fix data_layout_transform_test.cc and op_kernel_type_test.cc

* Fixed rnn_memory_helper_op.cc

* Fix progrma_desc_test.cc

* Fixed fill_constant_batch_size_like_op.cc

* Fix operator_test.cc

* Fixed fill_constant_op.cc

* Fixed gaussian_random_op.cc

* Fixed uniform_random_op.cc

* Fixed edit_distance_op.cc

* Fixed fill_constant_batch_size_like_op.cc

* Fixed rnn_memory_helper_op.cc

* Fixed chunk_eval_op.cc

* Fixed assign_value_op.cc

* Fixed assign_value_op.h

* Fixed cast_op.h

* Fixed cast_op.h

* Fix fill constant op

* Fixed clang for assign_value_op.cc

* Fix one_hot_op.h

* Fix one_hot_op.cc

* Fix fill_op.cc

* Fixed sum_op.cc

* Fixed sum_op clang

* Fix uniform_random_op.cc

* Fix gaussian_random_op.cc

* Fix backward.cc

* Fix protobuf.cc

* Fixed prune_test.cc

* Fixed op_registry_test.cc

* Fix data_device_transform_test.cu

* Fix travis error

* Fixed one_hot_op.cu

* Fixed op_registry_test.cc

* Fixed nccl_op.cc

* Fixing python tests

* Revert "Fixing python tests"

This reverts commit fccaa4c5818ed9f379ea1ce4315066cc78076c64.

* Fixing Pybind to remove data type

* Fixing tensor.py

* Updated the new files:

* Resolve error in merge conflict of fill_constant_batch_size_like_op.cc

c7ad26d6

13 2月, 2018 1 次提交

Separate VarType from VarDesc in framework.proto and fix all related compiler errors (#8414) · fcadb452

由 Abhinav Arora 提交于 2月 12, 2018

* Refine Type system

* Fixing type inference

* Fixed create_reader_op.cc

* Fix var_desc.h

* Fixed executor.cc

* Fix shape_inference.h

* Fixed create_reader_op.cc

* Fix tensor_util.h

* Fixed var_type_inference_test.cc

* Fix shape_inference.cc

* Fixed sum_op.c

* Fixed read_op.cc

* Fix var_type.h

* Fixed beam_search_decode_op.cc

* sendrecvop_utils.cc

* Fix operator.cc

* Fixed lookup_table_op.cc

* Fixed op_desc.cc

* Fixed get_places_op.cc

* Fixed lod_rank_table_op.cc

* Fixed beam_search_op.cc

* Fix var_desc.cc

* Fixed lod_tensor_to_array_op.cc

* Fixed while_op.cc

* Fix program_desc_test.cc

* tensor_array_read_write_op.cc

* Fix assign_op.cc

* Fix executor.cc

* Fix protobuf.cc

* Fix protobuf.cc

fcadb452

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
09 2月, 2018 1 次提交
- Y
  
  use op run as wrapper of run_impl; make run_impl as private virtual function · 98c94373
  由 Yang Yang 提交于 2月 09, 2018
  
  98c94373
06 2月, 2018 2 次提交
- F
  
  refine code and add unit tests · 0bb9c80e
  由 fengjiayi 提交于 2月 06, 2018
  
  0bb9c80e
- F
  
  Add ReadOp · 1010e39b
  由 fengjiayi 提交于 2月 06, 2018
  
  1010e39b
02 2月, 2018 1 次提交
- F
  
  simplify shape inference code · 0575fd46
  由 fengjiayi 提交于 2月 02, 2018
  
  0575fd46
31 1月, 2018 1 次提交
- D
  "unify flags" (#7973) · 80eff266
  由 dzhwinter 提交于 1月 31, 2018
```
* "unify flags"

* "fix init"
```
  80eff266
19 1月, 2018 1 次提交
- Q
  Bugfix/check if kernel for type exist (#7657) · 50ac67fc
  由 Qiao Longfei 提交于 1月 19, 2018
```
* check if kernel if found for kernel type

* do kernel check before data transform
```
  50ac67fc
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

12 1月, 2018 1 次提交

Add get lod for debug (#7375) · 23df6c44

由 Qiao Longfei 提交于 1月 12, 2018

* add GetLoD for debug

* add LoDToString

* optimize if

* typo

* add lod_tensor to operator's dependency

23df6c44

10 1月, 2018 3 次提交
- Q
  reorganize data transform related code (#7391) · 377424bf
  由 Qiao Longfei 提交于 1月 10, 2018
```
* init data_type_transform

* split data_layout_transform

* tmp rm data_transform_test

* change device_data_transform to data_device_transform

* clean code

* clean code
```
  377424bf
- D
  
  "fix CI" · a6edc038
  由 dzhwinter 提交于 1月 09, 2018
  
  a6edc038
- D
  
  "add flags" · f0316bdb
  由 dzhwinter 提交于 1月 09, 2018
  
  f0316bdb
09 1月, 2018 1 次提交
- Q
  
  fix GetDims bug · 8b1a81a9
  由 qiaolongfei 提交于 1月 09, 2018
  
  8b1a81a9
08 1月, 2018 5 次提交

Q

fix priority · 0b52cc88
由 qiaolongfei 提交于 1月 08, 2018

0b52cc88
Q

add back priority · ca90356b
由 qiaolongfei 提交于 1月 08, 2018

ca90356b
D
Feature/add shared layout (#7233) · e94db381
由 dzhwinter 提交于 1月 08, 2018
```
* "reuse ShareLoD with no regret"

* "removed base class shareLayout"

* "fix CI"
```
e94db381

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

E
Show argument dimensions with operator::DebugStringEx (#7268) · 8814bec0
由 emailweixu 提交于 1月 07, 2018
```
This can make it easier to locate error.
```
8814bec0

05 1月, 2018 1 次提交

Feature/use cudnn (#7141) · 5593858d

由 dzhwinter 提交于 1月 05, 2018

* "add c++ side kernel selection"

* "add multiple kernel op test"

* "kernel selection only support cudnn"

* "better formatter"

* "small fix with UseCPU"

* "depends on change interface Get(Place, Library)"

* "fix CI"

* "fix python cudnn test"

* "leave the register cudnn op to another PR"

* "fix CI"

* "use all kernel by default"

* "fix CI"

5593858d

04 1月, 2018 1 次提交
- Y
  
  clean up · 97dc451f
  由 Yang Yang 提交于 1月 04, 2018
  
  97dc451f

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功