1. 08 2月, 2020 1 次提交
  2. 06 2月, 2020 1 次提交
    • J
      Support weight quantization (#2791) · 6329a9a2
      juncaipeng 提交于
      * optimize quant_dequant_fuse_pass, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * fix bug for accessing the removed node, test=develop
      
      * set the bias of int8 conv as float, test=develop
      
      * support weight quantization, test=develop
      
      * up, test=develop
      
      * up, test=develop
      
      * up, test=develop
      6329a9a2
  3. 22 1月, 2020 1 次提交
  4. 17 1月, 2020 1 次提交
  5. 16 1月, 2020 1 次提交
  6. 14 1月, 2020 2 次提交
  7. 13 1月, 2020 1 次提交
  8. 09 1月, 2020 1 次提交
  9. 08 1月, 2020 1 次提交
  10. 06 1月, 2020 1 次提交
  11. 03 1月, 2020 2 次提交
  12. 02 1月, 2020 1 次提交
  13. 31 12月, 2019 2 次提交
    • W
      X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f
      Wilber 提交于
      X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)
      
      x86 and cuda compile simutaneously
      f1cedb8f
    • H
      [LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2
      hong19860320 提交于
      * Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.
      
      * Refine the registration of op bridges and make it similar to the registration of op and kernel.
      
      * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.
      
      * Add the unit test of the scale and softmax op bridge for NPU.
      a29c84a2
  14. 30 12月, 2019 1 次提交
  15. 28 12月, 2019 1 次提交
  16. 27 12月, 2019 1 次提交
  17. 25 12月, 2019 1 次提交
  18. 24 12月, 2019 1 次提交
  19. 23 12月, 2019 3 次提交
  20. 20 12月, 2019 2 次提交
  21. 19 12月, 2019 1 次提交
  22. 18 12月, 2019 1 次提交
  23. 17 12月, 2019 3 次提交
    • H
      [lite]add some fusion (#2604) · ec8353e8
      HappyAngel 提交于
      * add cv image process
      
      * fix arm liunx build error
      
      * add LITE_WITH_CV defien to make cv, test=develop
      
      * fix cv format, annd add describe in utils/cv
      
      * delete some Meaningless comments, test=develop
      
      * set LITE_WITH_CV=OFF in build.sh, test=develop
      
      * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop
      
      * according to reviews to redefine paddle_image_preprocess.h, test=develop
      
      * add detailed note of flipParam, test=develop
      
      * fix format in paddle_image_preprocess.h, test=develop
      
      * fix error when build x86. test=develop
      
      lite_with_X86 does not contain lite_with_cv
      
      * fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop
      
      * according to review change, test=develop
      
      * chang grb to rgb, test=develop
      
      * add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop
      
      * fix format, test=develop
      ec8353e8
    • G
      [ARMLinux] Fix the error that armlinux can not compile (#2612) · 1e9823a0
      guofei 提交于
      test=develop
      1e9823a0
    • Y
  24. 16 12月, 2019 2 次提交
  25. 13 12月, 2019 1 次提交
  26. 11 12月, 2019 1 次提交
  27. 10 12月, 2019 2 次提交
    • W
      fix type_target_cast pass. support only copy once for multiple use arg. test=develop (#2572) · 8903c795
      Wilber 提交于
      For multiple-use parameters, only copy once
      8903c795
    • W
      modify static_kernel_pass to support select the kernel according to input type (#2488) · 7ef0e7fe
      Wilber 提交于
      修改了选kernel的逻辑,默认从模型文件中读取出lod_tensor的data type,在static_kernel_pick pass中如果kernel输入输出的类型与读取的data type完全一致,则选择该Kernel的概率增大。
      
      - 增加 从模型文件__model__读取lod_tensor的data type到cpp::vardesc
      
      - program中增加unordered_map<string, type>字段,并在 Program::PrepareWorkspace中对该字段赋值
      
      - 修改了node.h文件,将const Type* 更改为Type*,并在SSAGraph::Build过程中为符合条件的type*赋值
      
      - static_kernel_pick_pass中添加新规则,如果kernel的输入类型输出类型与__model__中存储的类型的一致,则score*=2。
      
      - 支持模型中用到sequence_reverse_float kernel(输入输出均为float)和sequence_reverse_int64 kernel(输入输出均为int64),能够根据输入输出type选kernel
      7ef0e7fe
  28. 07 12月, 2019 1 次提交
    • J
      Support mask_rcnn (#2484) · c2f72cb3
      juncaipeng 提交于
      * add arm split lod tensor, test=develop
      
      * add arm merge lod tensor, test=develop
      
      * update split merge lod tensor, test=develop
      
      * add reduce_prob op, test=develop
      
      * support mask_rcnn succeed, test=develop
      c2f72cb3
  29. 04 12月, 2019 2 次提交