1. 15 4月, 2020 1 次提交
  2. 14 4月, 2020 3 次提交
  3. 13 4月, 2020 1 次提交
  4. 09 4月, 2020 1 次提交
  5. 08 4月, 2020 1 次提交
    • H
      [Core][XPU] Add XPU op kernels (#3274) · 2b80bab6
      hong19860320 提交于
      * [LITE][XPU] bind xpu resnet50 kernels
      
      * [LITE][XPU] fuse resnet50 and encoder
      
      * [LITE][XPU] bind xpu bert kernels
      
      * [LITE][XPU] refine xpu_resnet_fuse_pass.cc
      
      * [LITE][XPU] add xpu stack kernel
      
      * [LITE][XPU] add xpu slice/tanh kernel
      
      * [LITE][XPU] refine resnet50 and encoder fusor
      
      * [LITE][XPU] split resnet50 and multi_encoder op from subgraph_op.h
      
      * [LITE][XPU] clean workspace
      
      * [LITE][XPU] add build script
      
      * [LITE][XPU] fix compilation errors
      
      * [LITE][XPU] fix kernel matmul
      
      * [LITE][XPU] fix kernel ewadd ewsub
      
      * [LITE][XPU] add xpu cast kernel
      
      * [LITE][XPU] fix kernel slice
      
      * [LITE][XPU] switch dev by LITE_XPU_DEV env
      
      * [LITE][XPU] eliminate useless cast op
      
      * [LITE][XPU] add PerThread Ops
      
      * [LITE][X86] add SequenceUnpad op and kernel
      
      * [LITE][XPU] add LITE_WITH_XTCL option
      
      * [LITE][X86] add SequenceConv kernel
      
      * [LITE][XPU] fix cmake dependency
      
      * [LITE][XPU] add xpu sigmoid kernel
      
      * [XPU] Remove the dependencies of framework.pb.h
      test=develop
      
      Change-Id: Icfb44efb0482a6369b365b5c09017765328fc10d
      
      * [XPU] Fix the precision of cast kernel
      test=develop
      
      Change-Id: Icb18be47d7ab490de9fb9c92eae1165f49dbf492
      
      * [Core] Fix the compiling error when build for the target that disable XPU
      test=develop
      
      Change-Id: I38ec53f222391d3bf06b70512e6c3ad1282e4683
      
      * [XPU] Add io_copy kernel for xpu<->arm
      test=develop
      
      Change-Id: Iec7ea066f040534285557f9948b73e6a1970aed7
      
      * fix
      test=develop
      
      Change-Id: I4db1c93df48e22afbba904ce6c3b0babd9fda4c3
      
      * fix target matching of type_target_cast_pass and remove the unnecessary registration of io_copy kernel
      test=develop
      
      Change-Id: I432c10c9d1064e778d43fd0d12d8cf0599252f7a
      
      * [X86] Add the keyword 'template' to avoid the compiling errors
      test=develop
      
      Change-Id: I015d5d323adafb3884029c8287ced66c90ad931e
      
      * Fix the build.sh for XPU and x86
      test=develop
      
      Change-Id: I7d9575243669ce02af69a8ddbd6421db31902bd6
      
      * [XPU] Add the keyword 'template' to avoid the compiling errors
      test=develop
      
      Change-Id: I46d0b3b6861286a73ee2999934b8e185e453e749
      
      * [XPU] Add XTCL compiling option in build.sh
      test=develop
      
      Change-Id: I8b3fd998ca5f898d5bd2e665646e3874b3b73c80
      
      * fix namespace conflicts, test=develop
      
      * [API][XPU] Move the XPU related APIs into CxxConfig
      test=develop
      
      Change-Id: I75ac35e8bae96bcb835683f413f01b9db45afbf9
      
      * [API][XPU] Remove the LITE_WITH_XPU in paddle_api.h
      test=develop
      
      Change-Id: Idbd64013bdf331ad876919511c1c349332d46f93
      
      * [API][XPU] Remove XPUSetWorkspaceL3SizePerThread and XPUSetDevPerThread
      test=develop
      
      Change-Id: I515958f56f8e129280bae61c923513cc91fb9728
      
      * [API][Core][XPU] Refine the test case and remove the necessary modifications
      test=develop
      
      Change-Id: I1e0e2957a2f9d5f4207b06c0bc98a5ab611fee56
      
      * [Core] Remove useless code
      test=develop
      
      Change-Id: I6293faa10424aea2836d09d85ddb6a30f7811678
      
      * [XPU] Refine the test cases
      test=develop
      
      Change-Id: I6818fc3addf1bca5b96a7d66ee99263242e3374f
      
      * [XPU] Remove useless scripts and code
      test=develop
      
      Change-Id: I965ba6712d3cf881d0038f0473fec27d4c1bc684
      
      * [XPU] Use InferShapeImpl in sequence_unpad, resnet50 and multi_encoder op
      test=develop
      
      Change-Id: I5375f524d36836a394d426b4b2bc9fb44be0b59c
      
      * test=develop
      
      Change-Id: I42ee68c8a5e891dd0f3e95d6cfbc498be7cf1519
      
      * test=develop
      
      Change-Id: If679e5aa73e1368e0ee5bd5f286d2e1b4c2f354e
      
      * [XPU] Add __xpu__ prefix to the op and graph pass name of resnet50 and multi_encoder
      test=develop
      
      Change-Id: Idb61c99b4b8429cb87665bfd6835ab4d7d263be2
      
      * [XPU] Fix and refine the xpu fuse pass
      test=develop
      
      Change-Id: If1c5b6788d994e2809c1a00d9384685a89440907
      
      * test=develop
      
      Change-Id: Icfa333e322fc4351700103692c46cfcb3d4f9a89
      
      * [XPU] Remove the dependency on xpu api for xpu fuse passes
      test=develop
      
      Change-Id: I6094b5536f58ae18bab068284b32f9bd10a2ab92
      
      * [XPU] Move unit tests from lite/api to lite/tests/api
      test=develop
      
      Change-Id: I7ba27abb23abeffb0c95fdbbefec7ac16cdbd250
      
      * test=develop
      
      Change-Id: I33230c84d6c4e61bf19f46668bae2baa3ef68794
      
      * [XPU] Refine code
      test=develop
      
      Change-Id: I37bc5b948b4927e44cd3ea2594ebe3fd7671be06
      
      * [XPU] Add env XPU_ENABLE_XTCL to enable xpu_subgraph_pass
      test=develop
      
      Change-Id: Ifb8e07e86f307f562adaca3ce792015a6f2a2204
      
      * [XPU] refine code
      test=develop
      
      Change-Id: I1380654b930d51ae704dbc0cd855464d9c3b5b79
      
      * [XPU] Refine code
      test=develop
      
      Change-Id: I73285c2718ccd3612490eb2635bef4fd608c9bde
      
      * [XPU] Add comments for the XPU APIs
      test=develop
      
      Change-Id: Ieb5015f37984f8869b90c4c625c5894bb26164fd
      Co-authored-by: Nmiaotianxiang <miaotianxiang@baidu.com>
      Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      2b80bab6
  6. 07 4月, 2020 2 次提交
  7. 24 3月, 2020 1 次提交
  8. 17 3月, 2020 2 次提交
    • W
      add cuda cxx demo (#3205) · 9098da7c
      Wilber 提交于
      - 增加cuda c++ demo.
      - 考虑到检测模型尾部一般是multiclass_nms,该kernel为host,如果fetch kernel为cuda的话,则会在此处插入无用的io_copy(host->cuda),由于该原因,注释掉fetch的cuda kernel. 默认使用host的fetch kernel. 此处暗中进行的行为:每次predictor run完,都会默认把数据从cuda拷贝到cpu
      9098da7c
    • W
      For cuda compilation products and ci (#3152) · ca2481e6
      Wilber 提交于
      add cuda ci.
      
      Organize cuda compilation products.
      ca2481e6
  9. 16 3月, 2020 1 次提交
  10. 10 3月, 2020 1 次提交
  11. 09 3月, 2020 2 次提交
  12. 04 3月, 2020 1 次提交
  13. 21 2月, 2020 1 次提交
  14. 12 2月, 2020 1 次提交
  15. 14 1月, 2020 1 次提交
  16. 08 1月, 2020 1 次提交
    • H
      [arm] add test_cv demo (#2691) · 945e4341
      HappyAngel 提交于
      * add cv image process
      
      * fix arm liunx build error
      
      * add LITE_WITH_CV defien to make cv, test=develop
      
      * fix cv format, annd add describe in utils/cv
      
      * set LITE_WITH_CV=OFF in build.sh, test=develop
      
      * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop
      
      * according to reviews to redefine paddle_image_preprocess.h, test=develop
      
      * add detailed note of flipParam, test=develop
      
      * fix format in paddle_image_preprocess.h, test=develop
      
      * fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop
      
      * according to review change, test=develop
      
      * add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop
      
      * fix format, test=develop
      
      * fix model_optimize bug, update concat and split op, speed up, test=develop
      
      * update split speed, test=develop
      
      * fix format, test=develop
      
      * add classify demo inn demo/cxx/ , test=develop
      
      * fix formart inn mobile_classify, test=develop
      
      * delete some note and extra code, test=develop
      
      * remove test.jpg and labels.txt, test=develop
      
      * add test_cv in cxx/demo
      
      * add test_cv READMEE, test=develoop
      
      * add note info, flip only support x, y, xy;rotate only support 90, 180, 270; test=develop
      
      * fix build error, paddle_cv_arm , test=develop
      
      * add GRAY to RGBA(BGRA) convert and RGBA(BGRA)_to_Tensor, test=develop
      
      * fix format from review, test=develop
      
      * fix makefile format. test=devellop
      
      * fix bbuuild v7 error, test=develop
      945e4341
  17. 07 1月, 2020 1 次提交
  18. 23 12月, 2019 1 次提交
  19. 11 12月, 2019 1 次提交
  20. 09 12月, 2019 1 次提交
    • H
      Static libraty in tiny pub (#2560) · 8aee9417
      huzhiqiang 提交于
      * add static library in tiny_publish
      * move flto and ffunction-sections cmake option into the tiny publish so result of java,cxx,python 
      8aee9417
  21. 03 12月, 2019 1 次提交
    • Y
      [Demo] add cxx mobilenetv1-ssd detection demo, test=develop (#2541) · 2890b912
      yiicy 提交于
      * [Demo] add cxx mobilenetv1-ssd detection demo, test=develop
      
      * add makefile to mobile detection demo, test=develop
      
      * [Demo] add cxx mobilenetv1-ssd detection demo, test=develop
      
      * [demo] fix mobile_detection code style, test=develop
      
      * [Demo] fix demo code style, test=develop
      
      * [Demo] fix detection demo makefile dependency, test=develop
      2890b912
  22. 22 11月, 2019 3 次提交
    • H
      strip cxx_dynamic so in tiny_publish test=develop (#2456) · f68ea81c
      huzhiqiang 提交于
      * [Publish] strip cxx light lib test=develop
      f68ea81c
    • Y
      [LITE][DEMO] add input check for demo (#2444) · fee2004b
      Yuan Shuai 提交于
      * Add CheckInput. test=develop
      
      * Fix android_log_ undef when using .a. test=develop
      
      * add guide of how to use shared library in demo. test=develop
      fee2004b
    • H
      [LITE][ARMM] fix cmake error in lite/CMakeLists.txt, missing mkdir cxx in iOS (#2418) · b5fe3840
      HappyAngel 提交于
      * add cv image process
      
      * fix arm liunx build error
      
      * add LITE_WITH_CV defien to make cv, test=develop
      
      * fix cv format, annd add describe in utils/cv
      
      * delete some Meaningless comments, test=develop
      
      * set LITE_WITH_CV=OFF in build.sh, test=develop
      
      * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop
      
      * according to reviews to redefine paddle_image_preprocess.h, test=develop
      
      * add detailed note of flipParam, test=develop
      
      * fix format in paddle_image_preprocess.h, test=develop
      
      * fix error when build x86. test=develop
      
      lite_with_X86 does not contain lite_with_cv
      
      * fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop
      
      * according to review change, test=develop
      
      * chang grb to rgb, test=develop
      b5fe3840
  23. 20 11月, 2019 1 次提交
  24. 18 11月, 2019 1 次提交
    • Y
      [LITE][OPENCL] Enable full and light api for OpenCL (#2331) · cfa086e9
      Yuan Shuai 提交于
      * Fix bug target for kHost and kARM not equal. test=develop
      
      * Fix license. test=develop
      
      * add debug -g option. test=develop
      
      * enable opencl demo. test=develop
      
      * Fix model_optimize_tool found no opencl kernel. test=develop
      
      * add more vlog. test=develop
      
      * remove macro LITE_WITH_OPENCL, LITE_WITH_FPGA in passes. test=develop
      
      * Fix valid_places in mobilenetv1_test. test=develop
      
      * Fix bug of find no real output of fetch, after tool OPs of optimzer passes. test=develop
      
      * Fix vlog as log message in model_optimize_tool. test=develop
      
      * fix miscs. test=develop
      
      * fix comment. test=develop
      
      * Fix misspell of opencl, fpga kernels name in lite/api/CMakeLists.txt. test=develop
      
      * add opencl macro in full_api of demo. test=develop
      cfa086e9
  25. 12 11月, 2019 1 次提交
    • H
      [LITE][ARM]add cv image process (#2402) · 9f236a99
      HappyAngel 提交于
      * add cv image process
      
      * fix arm liunx build error
      
      * add LITE_WITH_CV defien to make cv, test=develop
      
      * fix cv format, annd add describe in utils/cv
      
      * delete some Meaningless comments, test=develop
      
      * set LITE_WITH_CV=OFF in build.sh, test=develop
      
      * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop
      
      * according to reviews to redefine paddle_image_preprocess.h, test=develop
      
      * add detailed note of flipParam, test=develop
      
      * fix format in paddle_image_preprocess.h, test=develop
      
      * fix error when build x86. test=develop
      
      * lite_with_X86 does not contain lite_with_cv
      9f236a99
  26. 07 11月, 2019 1 次提交
  27. 05 11月, 2019 1 次提交
  28. 01 11月, 2019 2 次提交
  29. 31 10月, 2019 1 次提交
  30. 30 10月, 2019 1 次提交
  31. 29 10月, 2019 1 次提交
  32. 28 10月, 2019 1 次提交
    • H
      [LITE][XPU] initial support for XPU (#2202) · ac1b2f9f
      hong19860320 提交于
      * Initial support for XPU
      * Fix compiling errors of XPU
      * Move XPU op kernel bridges from backends to kernels to fix deps order
      * Change the namespace and directory of XPU bridges
      * Add XPU SDK
      * Fix header files and namespace of XPU SDK
      * Add unit tests for relu and conv2d ops
      * Restore the modification of paddle_api_test
      * Supports simple model which contains only a relu layer
      * Add compiling scripts for XPU
      * Fix compiling errors of XPU
      * Add comments for XPU LoadModel and BuildModel
      ac1b2f9f