提交 · a25331bc264b935b064655720ab08378bdf8458c · PaddlePaddle / PaddleDetection

20 3月, 2019 6 次提交

N

cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189 · a25331bc
由 nhzlx 提交于 3月 20, 2019

a25331bc
N
cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 · 69d37f81
由 nhzlx 提交于 3月 20, 2019
```
support change input size
```
69d37f81
N

cherry-pick from feature/anakin-engine: Anakin support facebox #16111 · a1d200a5
由 nhzlx 提交于 3月 20, 2019

a1d200a5

由 chengduo 提交于 3月 19, 2019

* fuse all_reduce
test=develop

* add fuse_parameter_groups_size
test=develop

* Polish code
test=develop

* Fix travis-ci
test=develop

* Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
test=develop

* Add SetGroupAccordingToMemorySize
test=develop

* fix multi_devices_graph
test=develop

* reset params_grads
test=develop

* Polish code
test=develop

f26ba5bd

Collective ops (#15572) · 6382b62f

由 Wu Yi 提交于 3月 20, 2019

* wip allreduce in op

* wip

* wip

* wip

* wip adding test

* wip for conflict with mp mode

* fix tests test=develop

* fix cpu build test=develop

* fix travis clang format test=develop

* fix cpu build test=develop

* update api.spec test=develop

* delete comment test=develop

* fix cpplint test=develop

* fix test=develop

* follow comment test=develop

* add file test=develop

* fix build test=develop

* update test=develop

* to be compatible with sync_bn, and fix mp mode in develop test=develop

6382b62f

S
fix op grad maker · 023a3a3d
由 sneaxiy 提交于 3月 19, 2019
```
test=develop
```
023a3a3d

19 3月, 2019 4 次提交

L
add runtime_context_cache_pass · 82af8031
由 luotao1 提交于 3月 19, 2019
```
test=develop
```
82af8031
T

Revert "cache runtime_context" · 7d2740db
由 Tao Luo 提交于 3月 19, 2019

7d2740db

[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233) · 13816dd4

由 Jacek Czaja 提交于 3月 19, 2019

* - Fix to crash of Transformer when mkldnn is to be used

Desc: TensorCopy was not setting MKLDNN primitive descriptor when layout was to be kMKLDNN

test=develop

* - Enable transformer for mkl-dnn

test=develo

* - Compilation fix

test=develop

* - Removed manual selection of MKL-DNN ops to be used in Transformer test

test=develop

13816dd4

Add cpu_quantize_placement_pass for C-API quantization (#16265) · af030088

由 Wojciech Uss 提交于 3月 19, 2019

* Add cpu_quantize_placement_pass for C-API quantization

test=develop

* added a comment on required pass attributes

test=develop

af030088

18 3月, 2019 4 次提交

M
Polish code style · b40e41fb
由 minqiyang 提交于 3月 18, 2019
```
test=develop
```
b40e41fb
M
Take DataType and VarType apart · 36dce65b
由 minqiyang 提交于 3月 18, 2019
```
test=develop
```
36dce65b
L
refine with comments · cc0ae1f1
由 luotao1 提交于 3月 18, 2019
```
test=develop
```
cc0ae1f1

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

16 3月, 2019 1 次提交
- Q
  Fix windows compiling (#16230) · 86e912c5
  由 qingqing01 提交于 3月 16, 2019
```
test=develop
```
  86e912c5
15 3月, 2019 7 次提交
- M
  Polish code · 36225373
  由 minqiyang 提交于 3月 15, 2019
```
test=develop
```
  36225373
- M
  Polish code · c0ddb93c
  由 minqiyang 提交于 3月 15, 2019
```
test=develop
```
  c0ddb93c
- M
  Make infer var type virtual · b5078c21
  由 minqiyang 提交于 3月 15, 2019
```
test=develop
```
  b5078c21
- M
  Implement Runtime Var Type Inference · 438bca9c
  由 minqiyang 提交于 3月 15, 2019
```
test=develop
```
  438bca9c
- L
  fix distributed unit-tests · 46ee6bb1
  由 luotao1 提交于 3月 15, 2019
```
test=develop
```
  46ee6bb1
- Q
  Support sync batch norm. (#16121) · 8ad672a2
  由 qingqing01 提交于 3月 15, 2019
```
* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)
```
  8ad672a2
- M
  
  Implement infer var type context · ca392c7e
  由 minqiyang 提交于 3月 15, 2019
  
  ca392c7e
14 3月, 2019 2 次提交

L
1. disable reuse SELECTED_ROWS type variable (#16150) · 1c6caf84
由 liuwei1031 提交于 3月 14, 2019
```
2. remove lod check in reshape op
test=develop
```
1c6caf84

Add cpu_quantize_squash_pass for C-API quantization (#16128) · b9252f3d

由 Wojciech Uss 提交于 3月 14, 2019

* Add cpu_quantize_squash_pass for C-API quantization

test=develop

* add cpu_quantize_squash_pass teste

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* lint fix 2

* fixes

test=develop

* refactored

test=develop

* fix windows ci

test=develop

b9252f3d

13 3月, 2019 4 次提交
- M
  
  Accelerate CPU part · 42e96a02
  由 minqiyang 提交于 3月 13, 2019
  
  42e96a02
- L
  add runtime_context_cache_pass · d94fd972
  由 luotao1 提交于 3月 13, 2019
```
test=develop
```
  d94fd972
- Y
  fix broadcast on mp mode (#15951) · 30568473
  由 Yan Xu 提交于 3月 13, 2019
```
* fix broadcast with mp mode

* polish code test=develop

* fix bcast strategy test=develop

* fic cpplint test=develop

* fix py3 failed test=develop

* fix comment test=develop

* update comment test=develop
```
  30568473
- B
  remove const_cast and refactor ngraph engine code (#15925) · e3c37bd5
  由 baojun 提交于 3月 13, 2019
```
* remove concast_cast and refactor code test=develop

* reduce flag use test=develop
```
  e3c37bd5
12 3月, 2019 6 次提交
- L
  refine with comments · fe78a92e
  由 luotao1 提交于 3月 12, 2019
```
test=develop
```
  fe78a92e
- W
  restore the exception caught since it is necessary for python call stack (#16160) · 85709f43
  由 wopeizl 提交于 3月 12, 2019
```
test=develop
```
  85709f43
- Z
  
  Add some fixme. test=develop · 5685a48c
  由 Zhen Wang 提交于 3月 12, 2019
  
  5685a48c
- Z
  
  Add the Clone method in Graph. test=develop · ac6ef06f
  由 Zhen Wang 提交于 3月 07, 2019
  
  ac6ef06f
- Z
  
  Not add graph copy construction method. test=develop · 01eddf12
  由 Zhen Wang 提交于 3月 07, 2019
  
  01eddf12
- Z
  
  add clone function for IrGraph. test=develop · 1b9c8d5f
  由 Zhen Wang 提交于 3月 07, 2019
  
  1b9c8d5f
11 3月, 2019 3 次提交
- L
  add all_kernels_must_compute_runtime_shape example for speedup infershape · 31ccaf09
  由 luotao1 提交于 3月 11, 2019
```
test=develop
```
  31ccaf09
- C
  Revert "Revert "Add Event for TensorCopy"" (#16035) · ad80bde8
  由 chengduo 提交于 3月 11, 2019
```
* Revert "Revert "Add Event for TensorCopy" (#16022)"

This reverts commit e2da3a5b.

* use default stream
test=develop
```
  ad80bde8
- S
  disable gc in recurrent_op currently · 732fa00e
  由 sneaxiy 提交于 3月 08, 2019
```
test=develop
```
  732fa00e
08 3月, 2019 3 次提交
- Y
  Fix the node's order issue when the content of graph is changed (#16088) · 0a45441a
  由 Yihua Xu 提交于 3月 07, 2019
```
* Fix the node's sort issue when the graph is changed.

test=develop

* Clean code

test=develop
```
  0a45441a
- N
  fix comments and fix cpplint · 4b59646e
  由 nhzlx 提交于 2月 27, 2019
```
test=develop
```
  4b59646e
- N
  3. when runing in trt mode, do not allocate memory for parameters in fluid. · 4f77248d
  由 nhzlx 提交于 2月 15, 2019
```
test=develop
```
  4f77248d

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功