1. 15 3月, 2019 1 次提交
  2. 13 3月, 2019 1 次提交
    • G
      resolve #15618 (#16114) · decdbed0
      guomingz 提交于
      * resolve #15618
      Backgroud: the PR #15398 raised the box_coder op performance regression, we optimized the code via the more efficency leveraging opemmp.
      decdbed0
  3. 12 3月, 2019 22 次提交
  4. 07 3月, 2019 4 次提交
  5. 05 3月, 2019 4 次提交
  6. 28 2月, 2019 3 次提交
  7. 27 2月, 2019 1 次提交
  8. 26 2月, 2019 1 次提交
    • G
      This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909) · 630c1e83
      guomingz 提交于
      * This PR improve performance of prior_box op about 1.25x faster on CPU.
      
      * Test Env:SKX 8180 with fake data on 28 threads(bs=1).
      * The below table shows the ~25% improvement which generated by [eval_tp_fake_data.py](https://github.com/PaddlePaddle/Paddle/issues/15618#issuecomment-464613976).
      
      | Type |Event | Calls |   Total     |  Min.    |   Max.      |  Ave.      |  Ratio.|
      | ---------------- | ------------------ | ---- | ------- | -------- | -------- | ------------ | -------- |
      | w/ optimization  | thread0::prior_box | 6000 | 921.201 | 0.110572 | 0.383402 | **0.153533** | 0.084585 |
      | w/o optimization | thread0::prior_box | 6000 | 1151.85 | 0.102276 | 0.426702 | **0.191976** | 0.103337 |
      
      test=develop
      
      * Fix the style issue.
      
      test=develop
      630c1e83
  9. 22 2月, 2019 2 次提交
  10. 19 2月, 2019 1 次提交