提交 · 664f958a02f4c778b0758c574ad03c03bc5b58bc · BaiXuePrincess / Paddle

29 11月, 2019 1 次提交
- J
  
  [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375) · cd43c444
  由 Jacek Czaja 提交于 11月 29, 2019
  
  cd43c444
26 11月, 2019 1 次提交
- J
  
  [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207) · f4cf028a
  由 Jacek Czaja 提交于 11月 26, 2019
  
  f4cf028a
04 11月, 2019 1 次提交
- Z
  
  lrn supports channel_last input, test=develop (#20954) · de9bec60
  由 Zhang Ting 提交于 11月 04, 2019
  
  de9bec60
31 10月, 2019 1 次提交

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

28 10月, 2019 1 次提交

Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5

由 Chen Weihang 提交于 10月 28, 2019

* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop

26cc1fe5

08 5月, 2019 1 次提交
- X
  modified formula for Lrn (#17281) · 9ed4aaad
  由 xiaoting 提交于 5月 08, 2019
```
* modified formula for lrn

test=develop

* modified api.spec

test=develop
```
  9ed4aaad
12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
16 11月, 2018 1 次提交
- T
  fix lrn on mac (#14426) · 64f7516a
  由 tensor-tang 提交于 11月 16, 2018
```
* rename and fix blas vsqr

test=develop

* update
```
  64f7516a
15 11月, 2018 1 次提交

add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5

由 Sylwester Fraczek 提交于 11月 15, 2018

* add is_test to pooling and activations

add prop_kind support for layers activation. conv and pooling

add a pass that sets is_test to true

add transpiler version of is_test pass

test=develop

* patch test and pass

test=develop

* add pass to analyzer.h

test=develop

* add is_test attr description & pass only on mkldnn

in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc

* fix is_test handling for activation pool and conv

* change description of is_test for all layers again

* remove GetAttr(use_mkldnn) from pass

* rename correct_mkldnn_test_phase to is_test

and remove dependency on MKLDNN
test=develop

* review fix magic number

* two if(..)s into one

* Check is_test once and pass mkldnn forward prop kind

* dereference shared_ptr with * (without get())

test=develop

* add is_test_pass back

test=develop

8a1eeec5

13 11月, 2018 1 次提交
- T
  refine lrn_op cpu forward and speedup · b4dfba17
  由 tensor-tang 提交于 11月 13, 2018
```
test=develop
```
  b4dfba17
07 6月, 2018 1 次提交

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

19 4月, 2018 1 次提交
- Y
  add semicolon to op registry (#10034) · e04c43d5
  由 Yang Yang(Tony) 提交于 4月 18, 2018
```
* script to add semicolon

* fix typo
```
  e04c43d5
17 4月, 2018 1 次提交
- Y
  
  script to fix all · ce7c2e86
  由 Yang Yang 提交于 4月 16, 2018
  
  ce7c2e86
12 4月, 2018 1 次提交
- S
  Fix cpplint errors for a set of operators (#9837) · 8d3ce01f
  由 Siddharth Goyal 提交于 4月 11, 2018
```
* Fix cpplint errors, round2

* Fix pointer issue
```
  8d3ce01f
30 3月, 2018 1 次提交
- T
  
  Plain LRN op throws an exception when is_test is set in backward pass · b9874251
  由 Tomasz Patejko 提交于 3月 30, 2018
  
  b9874251
22 3月, 2018 1 次提交
- T
  
  Function for running MKLDNN primitive added. Unittest added for is_test attribute · 14ba67c0
  由 Tomasz Patejko 提交于 3月 22, 2018
  
  14ba67c0
21 3月, 2018 1 次提交
- T
  
  Device blobs are created only in training. Added testing attribute · 72cc64e4
  由 Tomasz Patejko 提交于 3月 21, 2018
  
  72cc64e4
19 3月, 2018 3 次提交
- T
  
  Removing WITHIN_CHANNEL algorithm for lrn. CPU lrn operator works only with ACROSS_CHANNELS · 2d955275
  由 Tomasz Patejko 提交于 3月 19, 2018
  
  2d955275
- T
  
  Content of GetExpectedKernelType moved to standalone function · c51c4462
  由 Tomasz Patejko 提交于 3月 16, 2018
  
  c51c4462
- T
  
  Implementation of MKLDNN LRN · 192cc5dd
  由 Tomasz Patejko 提交于 3月 13, 2018
  
  192cc5dd
15 3月, 2018 1 次提交
- Q
  
  Fix bug in LRN operator. (#9124) · 1cd700d8
  由 qingqing01 提交于 3月 15, 2018
  
  1cd700d8
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

06 12月, 2017 1 次提交
- G
  Add LRN efficient GPU implement. (#5894) · c7e739f5
  由 gongweibao 提交于 12月 06, 2017
```
Add LRN efficient GPU implement
```
  c7e739f5
04 11月, 2017 1 次提交
- K
  
  polish_g_to_l (#5367) · c0d2ca54
  由 kexinzhao 提交于 11月 03, 2017
  
  c0d2ca54
26 10月, 2017 1 次提交
- G
  Local response normalize. (#4426) · 9d142d50
  由 gongweibao 提交于 10月 26, 2017
```
Add local response normalize
```
  9d142d50

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致