提交 · e5f9d3a47ca1ea50f47e0b12b0d4acf11d5446fb · PaddlePaddle / PaddleDetection

27 2月, 2019 12 次提交
- F
  anakin subgraph engine (#15774) · e40d56c3
  由 flame 提交于 2月 27, 2019
```
* add anakin subgraph engine

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* add initial op converter

* update

* update

* fix op register compile error

* update
test=develop

* update
```
  e40d56c3
- Y
  Optimize while_op when is_test is true. (#15811) · 613d9d07
  由 Yiqun Liu 提交于 2月 27, 2019
```
test=develop
```
  613d9d07
- X
  Optimize Quantize Op with primitive reuse. (#15929) · 1abddd8d
  由 xiaolil1 提交于 2月 27, 2019
```
test=develop
```
  1abddd8d
- L
  refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool · 34404f9c
  由 luotao1 提交于 2月 27, 2019
```
test=develop
```
  34404f9c
- B
  
  Added adam op test=develop (#15710) · f285191f
  由 baojun 提交于 2月 26, 2019
  
  f285191f
- M
  Register sum operator (#15889) · 558f94cd
  由 mozga-intel 提交于 2月 27, 2019
```
test=develop
```
  558f94cd
- D
  polish cudnn related code and fix bug. (#15164) · 225c11a9
  由 dzhwinter 提交于 2月 27, 2019
```
* staged.

* polish code

* polish code. test=develop

* polish code. test=develop

* api change. test=develop

* fix default value. test=develop

* fix default value. test=develop
```
  225c11a9
- X
  polish · 0c277ac6
  由 Xin Pan 提交于 2月 27, 2019
```
test=develop
```
  0c277ac6
- X
  have no time for cmake/externel · 4d80db83
  由 Xin Pan 提交于 2月 27, 2019
```
test=develop
```
  4d80db83
- Y
  Rewrite is_empty op to avoid unnecessary data transform. (#15509) · 454f4f21
  由 Yiqun Liu 提交于 2月 27, 2019
```
* Rewrite is_empty op to avoid unnecessary data transform.
test=develop

* Add the implementation of InferShape and InferVarType for is_empty op.
test=develop

* Rewrite is_empty op to avoid directly inherit OperatorBase.
test=develop
```
  454f4f21
- X
  INT8 Pool kernel Key Creation Optimization. (#15883) · 6724be2b
  由 xiaolil1 提交于 2月 27, 2019
```
* Optimize key creation of INT8 pool kernel to improve the peformance of ResNet-50 and MobileNet, especially for latency.
test=develop

* Optimize key creation of pool fp32 grad.
test=develop
```
  6724be2b
- B
  
  added concat op test=develop · e4ab40a7
  由 baojun-nervana 提交于 2月 26, 2019
  
  e4ab40a7
26 2月, 2019 17 次提交

K
Add MKL-DNN placement pass tester · 72253391
由 Krzysztof Binias 提交于 2月 26, 2019
```
test=develop
```
72253391
T
fix cpplint error of async_executor.h · 436dfbb3
由 Tao Luo 提交于 2月 26, 2019
```
test=develop
```
436dfbb3
T

enable cpplint, remove go_fmt · 28680c65
由 Tao Luo 提交于 2月 26, 2019

28680c65
T
fix jitcodekey and refine test · 8bc63815
由 tensor-tang 提交于 2月 26, 2019
```
test=develop
```
8bc63815
T
add sgd jitcode and op test · 7044cfa7
由 tensor-tang 提交于 2月 25, 2019
```
test=develop
```
7044cfa7
T
add benchmark and mkl sgd implement · 8e041337
由 tensor-tang 提交于 2月 25, 2019
```
test=develop
```
8e041337

- MKL-DNN pooling updated to set_prim_desc · c63f6b20

由 Jacek Czaja 提交于 2月 04, 2019

- MKLDNN ops revisited

- disabled softmax modifications

- disabled elementwise_add

- reverted LRN modifications

- reverted SUM primitive

- Partial reviing of softmax

- Enable softmax

- Softmax changes

- LRN is back

- LRN partially disabled

- LRN is back

- LRN fix

- compilation fixes

- Sum fixed(hopefully)

- Enabling (partially) elementwise_add

- Fixes to elemenwise_add

- Lint fixes

quantize fix

- compilation fix

test=develop

Disabling pooling

- Disabled quantize op

test=develop

c63f6b20

S

add API.spec. test=develop · 33982932
由 shippingwang 提交于 2月 26, 2019

33982932
S

fix api.spec, test=develop · 5ce46c63
由 shippingwang 提交于 2月 26, 2019

5ce46c63
Q

Fix bug in fake_quantize_op and add more unit testing (#15912) · 8e439ccf
由 qingqing01 提交于 2月 26, 2019

8e439ccf

loosly check in the InferShape of cross_entropy_op. (#15863) · f4846bf3

由 qingqing01 提交于 2月 26, 2019

* loosly check in cross_entropy_op when soft_label is True
* Add Runtime assertion in backward infer_shape check.
* Skip InferShape check when un-know the input dimensions

f4846bf3

M
Add gperftools into imperative tracer · 28077c4d
由 minqiyang 提交于 2月 26, 2019
```
test=develop
```
28077c4d
X
Optimize INT8 DeQuantize Op with primitive reuse. · 70759d18
由 xiaoli.liu@intel.com 提交于 2月 26, 2019
```
test=develop
```
70759d18

Optimize the CUDA implementation of sequence_expand op by reduce the times of... · f4634d76

由 Yiqun Liu 提交于 2月 26, 2019

Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)

* Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU.
test=develop

* Refine the op benchmark to support setting lod in config.
test=develop

f4634d76

This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909) · 630c1e83

由 guomingz 提交于 2月 26, 2019

* This PR improve performance of prior_box op about 1.25x faster on CPU.

* Test Env:SKX 8180 with fake data on 28 threads(bs=1).
* The below table shows the ~25% improvement which generated by [eval_tp_fake_data.py](https://github.com/PaddlePaddle/Paddle/issues/15618#issuecomment-464613976).

| Type |Event | Calls |   Total     |  Min.    |   Max.      |  Ave.      |  Ratio.|
| ---------------- | ------------------ | ---- | ------- | -------- | -------- | ------------ | -------- |
| w/ optimization  | thread0::prior_box | 6000 | 921.201 | 0.110572 | 0.383402 | **0.153533** | 0.084585 |
| w/o optimization | thread0::prior_box | 6000 | 1151.85 | 0.102276 | 0.426702 | **0.191976** | 0.103337 |

test=develop

* Fix the style issue.

test=develop

630c1e83

Add alloc_continuous_space_op (#15900) · 7ca8553d

由 chengduo 提交于 2月 25, 2019

* add alloc_continuous_space_op
test=develop

* Polish code
test=develop

* follow comment
test=develop

7ca8553d

B

Update ngraph version to v0.14 test=develop · 2ffacdeb
由 baojun-nervana 提交于 2月 25, 2019

2ffacdeb

25 2月, 2019 11 次提交
- M
  Add Conv Residual Connection UT for Projection · 6a2bc9a2
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6a2bc9a2
- Z
  
  update some functions' names according to the suggestion. test=develop · 54893145
  由 Zhen Wang 提交于 2月 25, 2019
  
  54893145
- M
  Improve code reuse at MKL-DNN sum · 6ebe9877
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6ebe9877
- P
  
  test=develop · c6472579
  由 peizhilin 提交于 2月 25, 2019
  
  c6472579
- P
  fix build issue for cudaEvent_t · b5d6e38b
  由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
  b5d6e38b
- L
  Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
  由 liangan1 提交于 2月 25, 2019
```
test=develop
```
  4acc5220
- C
  Remove unnecessary dependence for profiler (#15899) · 8e904d32
  由 chengduo 提交于 2月 25, 2019
```
* refile profiler
test=develop

* follow comment
test=develop
```
  8e904d32
- Z
  
  update with develop. test=develop · 9261cf39
  由 Zhen Wang 提交于 2月 25, 2019
  
  9261cf39
- Z
  
  add set_attr for IrOpNode. test=develop · 0bf809c9
  由 Zhen Wang 提交于 2月 25, 2019
  
  0bf809c9
- Q
  Refine doc of uniform_random and fix dtype (#15873) · d8128930
  由 qingqing01 提交于 2月 25, 2019
```
* Refine doc of uniform_random and fix dtype
* Update defaule value in the arguments
```
  d8128930
- D
  
  fix default value. test=develop · a71f2fbe
  由 dzhwinter 提交于 2月 25, 2019
  
  a71f2fbe

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功