提交 · 7d2740db83f08683f710ec01caf46e2e478458ed · PaddlePaddle / PaddleDetection

19 3月, 2019 2 次提交
- T
  
  Revert "cache runtime_context" · 7d2740db
  由 Tao Luo 提交于 3月 19, 2019
  
  7d2740db
- W
  Add cpu_quantize_placement_pass for C-API quantization (#16265) · af030088
  由 Wojciech Uss 提交于 3月 19, 2019
```
* Add cpu_quantize_placement_pass for C-API quantization

test=develop

* added a comment on required pass attributes

test=develop
```
  af030088
18 3月, 2019 1 次提交

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

16 3月, 2019 1 次提交
- Q
  Fix windows compiling (#16230) · 86e912c5
  由 qingqing01 提交于 3月 16, 2019
```
test=develop
```
  86e912c5
15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

14 3月, 2019 1 次提交

Add cpu_quantize_squash_pass for C-API quantization (#16128) · b9252f3d

由 Wojciech Uss 提交于 3月 14, 2019

* Add cpu_quantize_squash_pass for C-API quantization

test=develop

* add cpu_quantize_squash_pass teste

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* lint fix 2

* fixes

test=develop

* refactored

test=develop

* fix windows ci

test=develop

b9252f3d

13 3月, 2019 1 次提交
- L
  add runtime_context_cache_pass · d94fd972
  由 luotao1 提交于 3月 13, 2019
```
test=develop
```
  d94fd972
26 2月, 2019 1 次提交
- K
  Add MKL-DNN placement pass tester · 72253391
  由 Krzysztof Binias 提交于 2月 26, 2019
```
test=develop
```
  72253391
22 2月, 2019 1 次提交

MKL-DNN: Add test for conv bias fuse pass (#15824) · c4faf36e

由 Michał Gallus 提交于 2月 22, 2019

* MKL-DNN: Add test for conv bias fuse pass

test=develop

* Remove const cast from Conv Bias Pass Test

* Add conv with bias test case for conv+bias fuse ut

test=develop

c4faf36e

31 1月, 2019 1 次提交
- Y
  
  fix save_inferece_model bug (#15365) · 897789b1
  由 Yan Chunwei 提交于 1月 31, 2019
  
  897789b1
29 1月, 2019 1 次提交
- K
  Make separate folders for mkldnn codes · b1bdcd4d
  由 Krzysztof Binias 提交于 1月 28, 2019
```
test=develop
```
  b1bdcd4d
21 1月, 2019 1 次提交

Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2

由 Dun 提交于 1月 21, 2019

* mem opt

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine with cub test=develop

* fix mkldnn test && remove comments && test=develop

* polish code && test=develop

* add only_forward test && test=develop

9f8f0fc2

14 1月, 2019 1 次提交
- T
  
  add fuse pass of sequared mat sub fusion · a5d2a6d1
  由 tensor-tang 提交于 1月 13, 2019
  
  a5d2a6d1
13 1月, 2019 1 次提交
- T
  
  add repeated fc relu pass · a89296ac
  由 tensor-tang 提交于 1月 12, 2019
  
  a89296ac
11 1月, 2019 1 次提交
- Z
  
  add_transpose_flatten_concat_fuse (#15121) · 98e85f37
  由 Zhaolong Xing 提交于 1月 11, 2019
  
  98e85f37
10 1月, 2019 1 次提交
- T
  add seqpool concat fuse pass tester · a0a27bd2
  由 tensor-tang 提交于 1月 09, 2019
```
test=develop
```
  a0a27bd2
08 1月, 2019 1 次提交
- T
  add seqpool concat fuse pass · 72d2a180
  由 tensor-tang 提交于 1月 07, 2019
```
test=develop
```
  72d2a180
07 1月, 2019 1 次提交
- M
  Add no lock optimize pass · 4bfa110f
  由 minqiyang 提交于 1月 07, 2019
```
test=develop
```
  4bfa110f
25 12月, 2018 1 次提交
- N
  add affine_channel fuse. · ce3782c1
  由 nhzlx 提交于 12月 25, 2018
```
fix conv+elemenwise fuse bug.
```
  ce3782c1
16 12月, 2018 1 次提交
- N
  add conv+elementwiseadd pass · 4e4a7772
  由 nhzlx 提交于 12月 16, 2018
```
test=develop
```
  4e4a7772
14 12月, 2018 1 次提交
- Y
  
  Fea/fuse conv elementwise add fuse (#14669) · a985949b
  由 Yan Chunwei 提交于 12月 14, 2018
  
  a985949b
07 12月, 2018 1 次提交
- Y
  Clean Code · 240d974a
  由 Yihua Xu 提交于 12月 07, 2018
```
test=develop
```
  240d974a
03 12月, 2018 1 次提交
- Y
  Implement the fusion of convolution and bias for mkldnn · 64e261c6
  由 Yihua Xu 提交于 12月 03, 2018
```
(test=develop)
```
  64e261c6
15 11月, 2018 1 次提交

add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5

由 Sylwester Fraczek 提交于 11月 15, 2018

* add is_test to pooling and activations

add prop_kind support for layers activation. conv and pooling

add a pass that sets is_test to true

add transpiler version of is_test pass

test=develop

* patch test and pass

test=develop

* add pass to analyzer.h

test=develop

* add is_test attr description & pass only on mkldnn

in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc

* fix is_test handling for activation pool and conv

* change description of is_test for all layers again

* remove GetAttr(use_mkldnn) from pass

* rename correct_mkldnn_test_phase to is_test

and remove dependency on MKLDNN
test=develop

* review fix magic number

* two if(..)s into one

* Check is_test once and pass mkldnn forward prop kind

* dereference shared_ptr with * (without get())

test=develop

* add is_test_pass back

test=develop

8a1eeec5

14 11月, 2018 1 次提交
- Y
  
  Combine Inference Analysis with IR (#13914) · 9f252e00
  由 Yan Chunwei 提交于 11月 14, 2018
  
  9f252e00
06 11月, 2018 1 次提交
- X
  add tests · 25123a3b
  由 Xin Pan 提交于 11月 06, 2018
```
test=develop
```
  25123a3b
31 10月, 2018 1 次提交

add depthwise conv mkldnn pass · 4e2aaf01

由 Sylwester Fraczek 提交于 10月 30, 2018

added depthwise conv mkldnn pass which for MKLDNN changes depthwise_conv operator to conv operator because for mkldnn this is the same api
test=develop

4e2aaf01

29 10月, 2018 1 次提交

[1.1] [project] train imagenet using large batch size (#13766) · 26200f2e

由 Wu Yi 提交于 10月 29, 2018

* fix nccl2 lars dist support

* put lars in momentum op

* add tests lars

* fix ci

* fix cpu kernel

* soft warning

* remove lars in test_recognize_digits.py

* move to another op

* add file

* update api.spec test=develop

* update test=develop

* fix api.spec test=develop

* wip

* wip, finish grad merge ops

* wip, finish graph build

* wip test running

* work on 1 gpu

* workable version

* update

* fix tests

* fuse broadcast op

* fix compile failed

* refine

* add batch merge test mnist

* fix CI test=develop

* fix build

* use independent bn params for batch merge test=develop

* update api.spec

* follow comments and for test

* wip

* refine tests test=develop

* follow comments test=develop

* remove startup bn modify test=develop

* follow comments test=develop

* fix merge test=develop

26200f2e

23 10月, 2018 1 次提交
- T
  fix typo and warning in analyzer_resnet50_test · 316bc9bf
  由 Tao Luo 提交于 10月 23, 2018
```
test=develop
```
  316bc9bf
21 10月, 2018 2 次提交
- T
  MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored,... · 604bad08
  由 Tomasz Patejko 提交于 9月 12, 2018
```
MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
```
  604bad08
- T
  
  add seqconv eltadd relu pass · 603ba5e0
  由 tensor-tang 提交于 10月 19, 2018
  
  603ba5e0
19 10月, 2018 3 次提交

M
Conv+Bias: Support non-null bias · d7509d63
由 Michal Gallus 提交于 10月 12, 2018
```
test=develop
```
d7509d63
M

Conv+Bias fuse · 582f59c1
由 Michal Gallus 提交于 10月 12, 2018

582f59c1

Add MKL-DNN placement pass (#13958) · c3b70aec

由 Wojciech Uss 提交于 10月 19, 2018

* add MKL-DNN placement pass

This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926)
updated to use the mkldnn-placement-pass.

test=develop

* remove redundant pass list

* add comment on the default first pass

* fix test for conv+relu mkldnn fuse

c3b70aec

11 10月, 2018 1 次提交
- T
  
  Revert "[MKLDNN] Pass: Fuse Conv + Bias" · 9b11a175
  由 Tao Luo 提交于 10月 11, 2018
  
  9b11a175
10 10月, 2018 1 次提交
- M
  Pass: Fuse Conv + Bias · 40b17be4
  由 Michal Gallus 提交于 10月 01, 2018
```
test=develop
```
  40b17be4
08 10月, 2018 1 次提交

conv bn fuse pass · 78f98294

由 Sylwester Fraczek 提交于 9月 19, 2018

review fix

review from hshen14 fix

test=develop

fix error in broadcast and code cleanup

rename bias -> eltwise and added macro to shorten code

formatting

78f98294

29 9月, 2018 1 次提交
- L
  
  refine paddle_inference_helper.h · a989a4e7
  由 luotao1 提交于 9月 29, 2018
  
  a989a4e7
28 9月, 2018 1 次提交
- Y
  fea/infer executor and concurrency performance issue bug fix (#13451) · c8744d11
  由 Yan Chunwei 提交于 9月 28, 2018
```
- add naive executor
- fix concurrency performance issue
```
  c8744d11
27 9月, 2018 1 次提交

- Added initial pass for embedding-fc-lstm · 7ab5626d

由 Jacek Czaja 提交于 9月 13, 2018

- Added draft of new operator

- Added fused embedding fc lstm files

- First time embedding_fc_lstm_fuse_pass was invoked in
  test_text_classification

- Added Embedding pattern

- Not crashing

- Enabled draft of embedding_fc_lstm pass (does it job)

- First working (Seqcompute only) version

- Removed diagnostic comment

- First enabling of BatchCompute

- Disabling pass for embedding with is_sparse and is_distributed

- Cosmetics

- Style

- Style

7ab5626d

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功