提交 · 3334c279d098c884b915a56dedc7dd69c95d7c7e · BaiXuePrincess / Paddle

01 3月, 2019 1 次提交
- S
  add sample_generator · 3334c279
  由 sneaxiy 提交于 2月 27, 2019
```
test=develop
```
  3334c279
27 2月, 2019 1 次提交
- S
  add cache reader · 7b5a9d75
  由 sneaxiy 提交于 2月 27, 2019
```
test=develop
```
  7b5a9d75
26 2月, 2019 2 次提交

This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909) · 630c1e83

由 guomingz 提交于 2月 26, 2019

* This PR improve performance of prior_box op about 1.25x faster on CPU.

* Test Env:SKX 8180 with fake data on 28 threads(bs=1).
* The below table shows the ~25% improvement which generated by [eval_tp_fake_data.py](https://github.com/PaddlePaddle/Paddle/issues/15618#issuecomment-464613976).

| Type |Event | Calls |   Total     |  Min.    |   Max.      |  Ave.      |  Ratio.|
| ---------------- | ------------------ | ---- | ------- | -------- | -------- | ------------ | -------- |
| w/ optimization  | thread0::prior_box | 6000 | 921.201 | 0.110572 | 0.383402 | **0.153533** | 0.084585 |
| w/o optimization | thread0::prior_box | 6000 | 1151.85 | 0.102276 | 0.426702 | **0.191976** | 0.103337 |

test=develop

* Fix the style issue.

test=develop

630c1e83

Add alloc_continuous_space_op (#15900) · 7ca8553d

由 chengduo 提交于 2月 25, 2019

* add alloc_continuous_space_op
test=develop

* Polish code
test=develop

* follow comment
test=develop

7ca8553d

25 2月, 2019 13 次提交
- M
  Add Conv Residual Connection UT for Projection · 6a2bc9a2
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6a2bc9a2
- M
  Improve code reuse at MKL-DNN sum · 6ebe9877
  由 Michal Gallus 提交于 2月 25, 2019
```
test=develop
```
  6ebe9877
- S
  unify API · c545f1ed
  由 sneaxiy 提交于 2月 25, 2019
```
test=develop
```
  c545f1ed
- P
  
  test=develop · c6472579
  由 peizhilin 提交于 2月 25, 2019
  
  c6472579
- P
  fix build issue for cudaEvent_t · b5d6e38b
  由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
  b5d6e38b
- S
  
  fix hang bug · b17541a9
  由 sneaxiy 提交于 2月 25, 2019
  
  b17541a9
- C
  Remove unnecessary dependence for profiler (#15899) · 8e904d32
  由 chengduo 提交于 2月 25, 2019
```
* refile profiler
test=develop

* follow comment
test=develop
```
  8e904d32
- Q
  Refine doc of uniform_random and fix dtype (#15873) · d8128930
  由 qingqing01 提交于 2月 25, 2019
```
* Refine doc of uniform_random and fix dtype
* Update defaule value in the arguments
```
  d8128930
- D
  
  fix default value. test=develop · a71f2fbe
  由 dzhwinter 提交于 2月 25, 2019
  
  a71f2fbe
- J
  [MKL-DNN] MKL-DNN specific Tensor modification (#15429) · dec9cf53
  由 Jacek Czaja 提交于 2月 25, 2019
```
* - Implemented draft of primitive desc keeping in Tensor

test=develop

- TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented

- Added nchw and nc formats setting for sake of compatiblity

Fixed unit tests

- Worakaround to problem with 5D data in conv

- Added 3D and 1D MKL-DNN formats for name handles for tensor

test=develop

- Fix to UTs

test=develop

- Conv fp32 op was updated

Cosmetic fixes

test=develop

- tensor mkldnn cosmetics

test=develop

- Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils

* - Lint fixes

test=develop

* - setting prim dec in Tensor , sets also layout to kMKLDNN

test=develop

* - Moved creation of prim desc totally out of Tensor

test=develop

* - Cosmetic fixes adter review

test=develop
```
  dec9cf53
- M
  Polish code · e9fdf909
  由 minqiyang 提交于 2月 25, 2019
```
test=develop
```
  e9fdf909
- X
  polish · 5dd281f7
  由 Xin Pan 提交于 2月 25, 2019
```
test=develop
```
  5dd281f7
- P
  fix build issue on windows for sample prop op · 6ccdb1b9
  由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
  6ccdb1b9
24 2月, 2019 3 次提交
- D
  
  use kernel size in global_pooling. test=develop · 373cfb0c
  由 dengkaipeng 提交于 2月 24, 2019
  
  373cfb0c
- D
  
  fix spell mistakes. test=develop · 60305196
  由 dengkaipeng 提交于 2月 24, 2019
  
  60305196
- D
  
  add memset CUPTI && test=develop (#15868) · c6bd434f
  由 Dun 提交于 2月 24, 2019
  
  c6bd434f
23 2月, 2019 2 次提交
- M
  Polish code · a15a3fc3
  由 minqiyang 提交于 2月 23, 2019
```
test=develop
```
  a15a3fc3
- Q
  
  refine code test=develop · 2b7931d5
  由 Qiao Longfei 提交于 2月 23, 2019
  
  2b7931d5
22 2月, 2019 18 次提交
- X
  remove mutex · 8d83e38a
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  8d83e38a
- X
  fix · 0362ef75
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  0362ef75
- D
  
  fix spell error. test=develop · 14df92fe
  由 dengkaipeng 提交于 2月 22, 2019
  
  14df92fe
- D
  
  fix adaptive_pool and yolov3_loss. test=develop · 144016fc
  由 dengkaipeng 提交于 2月 22, 2019
  
  144016fc
- X
  polish codes · 12a0e2ed
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  12a0e2ed
- X
  polish · 19d78f67
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  19d78f67
- S
  Change *(smart_ptr.get()) -> *smart_ptr · 74672d1a
  由 Sylwester Fraczek 提交于 2月 07, 2019
```
reason: dereferencing smart pointer is the same as the underlying pointer
test=develop
```
  74672d1a
- T
  Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
  由 tensor-tang 提交于 2月 22, 2019
```
* Revert "Optimze Gelu with MKL Erf function (#15770)"

This reverts commit 676995c8.

* test=develop
```
  ee2321de
- D
  
  \frac -> \frac. test=develop · eb65b4e4
  由 dengkaipeng 提交于 2月 22, 2019
  
  eb65b4e4
- C
  enhance profiler (#15842) · 3b08c9ab
  由 chengduo 提交于 2月 22, 2019
```
test=develop
```
  3b08c9ab
- D
  
  add blank after math::. test=develop · 8167588f
  由 dengkaipeng 提交于 2月 22, 2019
  
  8167588f
- X
  resolve conflicts · 32d5a160
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  32d5a160
- Q
  
  optimize style test=develop · 3f9263f6
  由 Qiao Longfei 提交于 2月 22, 2019
  
  3f9263f6
- D
  
  use math:: instead of 29. test=develop · d9ec6058
  由 dengkaipeng 提交于 2月 22, 2019
  
  d9ec6058
- Q
  
  add more comment test=develop · 4233d0a8
  由 Qiao Longfei 提交于 2月 22, 2019
  
  4233d0a8
- D
  
  fix adaptive pool doc.test=develop · 19292ac6
  由 dengkaipeng 提交于 2月 22, 2019
  
  19292ac6
- Y
  Initialize the benchmark tester for operator. (#15772) · 7d96c74a
  由 Yiqun Liu 提交于 2月 22, 2019
```
* Initialize the benchmark tester for operator.
test=develop

* Rearrange the codes.
test=develop
```
  7d96c74a
- Y
  Optimze Gelu with MKL Erf function (#15770) · 676995c8
  由 Yihua Xu 提交于 2月 22, 2019
```
* Optimize for gelu operator

* Set up the low accuracy mode of MKL ERF function.

test=develop

* Only enable MKLML ERF when OS is linux

* Use the speical mklml version included vmsErf function to verify gelu mkl kernel.

test=develop

* Add the CUDA macro to avoid NVCC's compile issue.

test=develop

* Add the TODO comments for mklml library modification.

test=develop

* Clean Code

test=develop

* Add the comment of marco for NVCC compiler.

test=develop
```
  676995c8

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致