1. 28 5月, 2019 1 次提交
    • T
      [Lite] enable fc kernel (#17674) · 4b253569
      tensor-tang 提交于
      * add fc unit test
      
      * refine eigen fc
      add cpu info, arm context
      init packed sgemm
      
      * enable packed sgemm
      
      * add arm math
      
      * pass fc ut
      
      * follow comments
      4b253569
  2. 23 5月, 2019 2 次提交
  3. 22 5月, 2019 1 次提交
    • T
      [Lite] enable cross compile and run on mobile of lite (#17541) · 310fd514
      tensor-tang 提交于
      * add cmake
      
      * update
      
      * fix proto pd
      
      * fix compile
      
      * tmp save
      
      * fix protobuf device version
      
      * fix protobuf and host compile
      
      * fix std c++11 support on android
      
      * change array to vector to fix ndk c++_static
      
      * fix rt and add dockerfile
      
      * fix android compile issue with latest merge
      
      * init arm kernels
      
      * enable run on arm
      
      * update format
      
      * update format
      
      * update format
      310fd514
  4. 10 5月, 2019 1 次提交
  5. 07 5月, 2019 2 次提交
    • T
      remove unused FLAGS_warpctc_dir (#17162) · ff1661f1
      Tao Luo 提交于
      * remove unused FLAGS_warpctc_dir
      
      test=develop
      
      * remove FLAGS_warpctc_dir
      
      test=develop
      ff1661f1
    • Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a
      石晓伟 提交于
      * cherry-pick commit from 88770542
      
      * cherry-pick commit from 3f0b97df
      
      * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn
      
      (cherry picked from commit 8643dbc2)
      
      * Cherry-Pick from 16662 : Anakin subgraph cpu support
      
      (cherry picked from commit 7ad182e1)
      
      * Cherry-pick from 1662, 16797.. : add anakin int8 support
      
      (cherry picked from commit e14ab180)
      
      * Cherry-pick from 16813 : change singleton to graph RegistBlock
      test=release/1.4
      
      (cherry picked from commit 4b9fa423)
      
      * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2
      
      Support ShuffleNet and MobileNet-v2, test=release/1.4
      
      (cherry picked from commit a6fb066f)
      
      * Cherry-pick : anakin subgraph add opt config layout argument #16846
      test=release/1.4
      
      (cherry picked from commit 8121b3ec)
      
      * 1. add shuffle_channel_detect
      
      (cherry picked from commit 6efdea89)
      
      * update shuffle_channel op convert, test=release/1.4
      
      (cherry picked from commit e4726a06)
      
      * Modify symbol export rules
      
      test=develop
      a72dbe9a
  6. 06 5月, 2019 1 次提交
  7. 05 5月, 2019 1 次提交
  8. 29 4月, 2019 1 次提交
  9. 20 4月, 2019 1 次提交
  10. 18 4月, 2019 1 次提交
  11. 12 4月, 2019 1 次提交
  12. 11 4月, 2019 1 次提交
  13. 03 4月, 2019 1 次提交
  14. 02 4月, 2019 1 次提交
    • C
      Model data cryption link all lib (#16555) · c38c7c56
      Chen Weihang 提交于
      * link the libwbaes.so into paddle
      
      * polish detail, test=develop
      
      * try fix mac_pr_ci error, test=develop
      
      * add compile option, test=develop
      
      * fix ci error, test=develop
      
      * ignore failed to find mac lib, test=develop
      
      * change cdn to bj, cdn can't get the latest version
      
      * trigger ci, test=develop
      
      * temporary delete win32 lib linking, test=develop
      
      * change https to http, test=develop
      
      * turn compile option on to off
      
      * turn compile option off to on, test=develop
      
      * try lib compiled by gcc4.8, test=develop
      
      * update lib version, test=develop
      
      * link other lib, test=develop
      
      * add setup config
      
      * delete false, test=develop
      
      * delete no_soname, test=develop
      
      * recover so name set
      
      * fix, test=develop
      
      * adjust make config, test=develop
      
      * remove link to wbaes, test=develop
      
      * remove useless define, test=develop
      c38c7c56
  15. 30 3月, 2019 1 次提交
  16. 29 3月, 2019 1 次提交
  17. 28 3月, 2019 2 次提交
  18. 25 3月, 2019 1 次提交
  19. 22 3月, 2019 1 次提交
  20. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  21. 09 3月, 2019 1 次提交
    • B
      Upgrade MKLDNN to v0.18-rc and fix issue caused by lib/lib64 (#15861) · db120b93
      Brian Liu 提交于
      * Upgrade MKLDNN to v0.18-rc and fix issue caused by lib/lib64
      
      Upgrade MKLDNN to v0.18-rc
      Also fix the issue during upgrade
      
      test=develop
      
      * Rebase MKLDNN to rls-v0.18 branch
      
      Some issues in v0.18-rc which caused INT8 conv op unit test failure was fixed
      in rls-v0.18 branch
      
      test=develop
      
      * Upgrade MKLDNN from v0.18rc to formal v0.18 tag
      
      test=develop
      
      * Fix the windows compile issue.
      
      test=develop
      db120b93
  22. 04 3月, 2019 4 次提交
  23. 28 2月, 2019 1 次提交
  24. 27 2月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 225c11a9
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      225c11a9
  25. 26 2月, 2019 2 次提交
  26. 25 2月, 2019 1 次提交
  27. 22 2月, 2019 2 次提交
    • T
      Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
      tensor-tang 提交于
      * Revert "Optimze Gelu with MKL Erf function (#15770)"
      
      This reverts commit 676995c8.
      
      * test=develop
      ee2321de
    • Y
      Optimze Gelu with MKL Erf function (#15770) · 676995c8
      Yihua Xu 提交于
      * Optimize for gelu operator
      
      * Set up the low accuracy mode of MKL ERF function.
      
      test=develop
      
      * Only enable MKLML ERF when OS is linux
      
      * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
      
      test=develop
      
      * Add the CUDA macro to avoid NVCC's compile issue.
      
      test=develop
      
      * Add the TODO comments for mklml library modification.
      
      test=develop
      
      * Clean Code
      
      test=develop
      
      * Add the comment of marco for NVCC compiler.
      
      test=develop
      676995c8
  28. 20 2月, 2019 3 次提交
  29. 19 2月, 2019 2 次提交