提交 · b2ce8320211bc4dd75567efd39055dec734d5f01 · PaddlePaddle / Paddle

04 3月, 2019 1 次提交
- J
  
  change default option related to softmax, test=develop · b2ce8320
  由 jerrywgz 提交于 2月 12, 2019
  
  b2ce8320
01 3月, 2019 1 次提交
- C
  
  test=develop · f6d18678
  由 ceci3 提交于 3月 01, 2019
  
  f6d18678
27 2月, 2019 6 次提交
- C
  
  2018 -> 2019 · 6bce9861
  由 ceci3 提交于 2月 27, 2019
  
  6bce9861
- C
  
  test=develop · 4b7bf06e
  由 ceci3 提交于 2月 27, 2019
  
  4b7bf06e
- Y
  Rewrite is_empty op to avoid unnecessary data transform. (#15509) · 454f4f21
  由 Yiqun Liu 提交于 2月 27, 2019
```
* Rewrite is_empty op to avoid unnecessary data transform.
test=develop

* Add the implementation of InferShape and InferVarType for is_empty op.
test=develop

* Rewrite is_empty op to avoid directly inherit OperatorBase.
test=develop
```
  454f4f21
- X
  INT8 Pool kernel Key Creation Optimization. (#15883) · 6724be2b
  由 xiaolil1 提交于 2月 27, 2019
```
* Optimize key creation of INT8 pool kernel to improve the peformance of ResNet-50 and MobileNet, especially for latency.
test=develop

* Optimize key creation of pool fp32 grad.
test=develop
```
  6724be2b
- T
  Merge pull request #15943 from kbinias/kbinias/add-placement-pass-tester · d5a888e1
  由 Tao Luo 提交于 2月 27, 2019
```
MKL-DNN: Add placement pass tester
```
  d5a888e1
- T
  Merge pull request #15917 from jczaja/prv-tensor-mkldnn-ops · ba90e052
  由 Tao Luo 提交于 2月 27, 2019
```
[MKL-DNN] Adjusting ops to Tensor modifications
```
  ba90e052
26 2月, 2019 22 次提交
- K
  Add MKL-DNN placement pass tester · 72253391
  由 Krzysztof Binias 提交于 2月 26, 2019
```
test=develop
```
  72253391
- C
  Merge pull request #15902 from colourful-tree/new_develop · 7d8f6398
  由 colourful-tree 提交于 2月 26, 2019
```
remove mkldnn & fix commit
```
  7d8f6398
- T
  Merge pull request #15913 from liangan1/func_coverage · effec866
  由 Tao Luo 提交于 2月 26, 2019
```
Enable function coverage for U8/S8 ConvMKLDNNOpKernel
```
  effec866
- D
  Merge pull request #15926 from dzhwinter/test/add_ir_mem_opt_tests · 15de2dff
  由 dzhwinter 提交于 2月 26, 2019
```
add ir memory optimize test base
```
  15de2dff
- Z
  Merge pull request #15830 from wzzju/add_ir_node_encapsulation · e00c7a2e
  由 Zhen Wang 提交于 2月 26, 2019
```
add IrNode&IrVarNode&IrOpNode. test=develop
```
  e00c7a2e
- J
  - MKL-DNN pooling updated to set_prim_desc · c63f6b20
  由 Jacek Czaja 提交于 2月 04, 2019
```
- MKLDNN ops revisited

- disabled softmax modifications

- disabled elementwise_add

- reverted LRN modifications

- reverted SUM primitive

- Partial reviing of softmax

- Enable softmax

- Softmax changes

- LRN is back

- LRN partially disabled

- LRN is back

- LRN fix

- compilation fixes

- Sum fixed(hopefully)

- Enabling (partially) elementwise_add

- Fixes to elemenwise_add

- Lint fixes

quantize fix

- compilation fix

test=develop

Disabling pooling

- Disabled quantize op

test=develop
```
  c63f6b20
- L
  Merge pull request #15906 from junjun315/fix-api-doc-20190220 · a4b4ecd8
  由 lujun 提交于 2月 26, 2019
```
Fix util-plot for py3
```
  a4b4ecd8
- Q
  
  Fix bug in fake_quantize_op and add more unit testing (#15912) · 8e439ccf
  由 qingqing01 提交于 2月 26, 2019
  
  8e439ccf
- Q
  loosly check in the InferShape of cross_entropy_op. (#15863) · f4846bf3
  由 qingqing01 提交于 2月 26, 2019
```
* loosly check in cross_entropy_op when soft_label is True
* Add Runtime assertion in backward infer_shape check.
* Skip InferShape check when un-know the input dimensions
```
  f4846bf3
- D
  
  fix default value. test=develop · 48d9fd08
  由 dzhwinter 提交于 2月 26, 2019
  
  48d9fd08
- T
  Merge pull request #15922 from kbinias/kbinias/reuse-primitives-activations-and-softmax-mkldnn-ut · 2c5c7b2a
  由 Tao Luo 提交于 2月 26, 2019
```
MKL-DNN: Add Activations and Softmax UTs to check if primitives already exist in backward
```
  2c5c7b2a
- Y
  Optimize the CUDA implementation of sequence_expand op by reduce the times of... · f4634d76
  由 Yiqun Liu 提交于 2月 26, 2019
```
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)

* Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU.
test=develop

* Refine the op benchmark to support setting lod in config.
test=develop
```
  f4634d76
- D
  
  fix default value. test=develop · dfb21219
  由 dzhwinter 提交于 2月 26, 2019
  
  dfb21219
- D
  
  fix default value. test=develop · a4cf2954
  由 dzhwinter 提交于 2月 26, 2019
  
  a4cf2954
- D
  
  fix default value. test=develop · a922a0a1
  由 dzhwinter 提交于 2月 26, 2019
  
  a922a0a1
- T
  Merge pull request #15923 from Sand3r-/mgallus/conv-residual-ut · 60546b78
  由 Tao Luo 提交于 2月 26, 2019
```
Add Conv Residual Connection UT for Projection
```
  60546b78
- D
  
  fix default value. test=develop · 6477b443
  由 dzhwinter 提交于 2月 26, 2019
  
  6477b443
- G
  This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909) · 630c1e83
  由 guomingz 提交于 2月 26, 2019
```
* This PR improve performance of prior_box op about 1.25x faster on CPU.

* Test Env:SKX 8180 with fake data on 28 threads(bs=1).
* The below table shows the ~25% improvement which generated by [eval_tp_fake_data.py](https://github.com/PaddlePaddle/Paddle/issues/15618#issuecomment-464613976).

| Type |Event | Calls |   Total     |  Min.    |   Max.      |  Ave.      |  Ratio.|
| ---------------- | ------------------ | ---- | ------- | -------- | -------- | ------------ | -------- |
| w/ optimization  | thread0::prior_box | 6000 | 921.201 | 0.110572 | 0.383402 | **0.153533** | 0.084585 |
| w/o optimization | thread0::prior_box | 6000 | 1151.85 | 0.102276 | 0.426702 | **0.191976** | 0.103337 |

test=develop

* Fix the style issue.

test=develop
```
  630c1e83
- T
  Merge pull request #15914 from Sand3r-/mgallus/mkldnn-sum-code-reuse · 9c05421c
  由 Tao Luo 提交于 2月 26, 2019
```
Refactor MKL-DNN Sum to use reference version on fallback
```
  9c05421c
- C
  Add alloc_continuous_space_op (#15900) · 7ca8553d
  由 chengduo 提交于 2月 25, 2019
```
* add alloc_continuous_space_op
test=develop

* Polish code
test=develop

* follow comment
test=develop
```
  7ca8553d
- D
  Merge pull request #15904 from dzhwinter/fix/disable_temp · 131e4a3b
  由 dzhwinter 提交于 2月 26, 2019
```
fix nightly build
```
  131e4a3b
- W
  Merge pull request #15916 from wopeizl/win/fixevent1 · 2192c464
  由 wopeizl 提交于 2月 26, 2019
```
fix build issue for cudaEvent_t
```
  2192c464
25 2月, 2019 10 次提交
- M
  Add Conv Residual Connection UT for Projection · 6a2bc9a2
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6a2bc9a2
- K
  Add UTs to check whether primitives for activations and softmax already exist in backward · 851ea04d
  由 Krzysztof Binias 提交于 2月 25, 2019
```
test=develop
```
  851ea04d
- Z
  
  update some functions' names according to the suggestion. test=develop · 54893145
  由 Zhen Wang 提交于 2月 25, 2019
  
  54893145
- M
  Improve code reuse at MKL-DNN sum · 6ebe9877
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6ebe9877
- D
  Merge pull request #15855 from dzhwinter/fix/nightly_test · 660e4106
  由 dzhwinter 提交于 2月 25, 2019
```
accelerate memory optimize process
```
  660e4106
- P
  
  test=develop · c6472579
  由 peizhilin 提交于 2月 25, 2019
  
  c6472579
- P
  fix build issue for cudaEvent_t · b5d6e38b
  由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
  b5d6e38b
- Q
  Merge pull request #15831 from velconia/imperative_engine · 4bd28b30
  由 Qiyang Min 提交于 2月 25, 2019
```
Imperative training network to the end
```
  4bd28b30
- X
  Merge pull request #15425 from panyx0718/api · a6e3cd5e
  由 Xin Pan 提交于 2月 25, 2019
```
Pass graph to parallel executor instead of program
```
  a6e3cd5e
- L
  Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
  由 liangan1 提交于 2月 25, 2019
```
test=develop
```
  4acc5220

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功