1. 27 2月, 2020 1 次提交
    • Y
      [LITE][OPENCL] replace vlog with log in pool, add each run duration print in test (#3010) · 5014c3ce
      Yuan Shuai 提交于
      * replace vlog with log in pool, add each run duration print in test. test=develop
      
      * change layout file path. test=develop
      
      * fix act, layout image kernel. test=develop
      
      * fix spell error. test=develop
      
      * remove clfinish in act, concat, layout, nearest_interp. test=develop
      
      * add RELU macro define in fc opencl kernel. test=develop
      
      * add cpu_ref print in activation opencl kernel.test=develop
      
      * fix layout ut. test=develop
      
      * replace log with vlog. test=develop
      
      * fix get output. test=develop
      5014c3ce
  2. 24 2月, 2020 1 次提交
    • Y
      [LITE][OPENCL] support fp16 for cl_image_converter, layout, activation all... · 55b45651
      Yuan Shuai 提交于
      [LITE][OPENCL] support fp16 for cl_image_converter, layout, activation all OpenCL image kernel. test=develop (#2964)
      
      * [LITE][OPENCL] support fp16 for cl_image_converter, layout, activation image kernel. test=develop
      
      * add conv, depthwise and UT. test=develop
      
      * add pool, conv, nearest_interp kernel. test=develop
      
      * support fp16 for scale, reshape, concat, fc buffer opencl kernel. test=develop
      
      * refactor for mul opencl buffer kernel. test=develop
      
      * support fp16 for elementwise_mul opecl image kernel. test=develop
      
      * support fp16 for elementwise_mul opencl image kernel. test=develop
      
      * support fp16 for ele_add, fuse_ele_add_act opencl kernel. test=develop
      
      * rename io_copy. test=develop
      
      * mobilenetv1,v2 passed on 855. test=develop
      
      * fix opt for opencl. test=develop
      55b45651
  3. 22 10月, 2019 1 次提交
    • T
      Transformer pr (#2214) · 330644b0
      TianXiaogang 提交于
      * feat: add beam_search_special function for support nlp model
      
      * fix: add beam_search_compute kernel input and output
      
      * feat: add assign op & copy_compute kernel
      
      * feat: add fill_const_batch_size_like op & kernel
      
      * feat: add layer_norm op and kernel and ut
      
      * fix: fix some bugs
          fix mul_op infer_shape bug when x_dim_idx = 2, x_dims.size()=3 & y_dim_idx = 1, y_dims.size()=2
          fix elementwise_compute bug when y axis is all 1
          fix beam_search choose math_func wrong bug
          fix layer_norm get attr bug
          fix fill_constant_batch_size_like shape_set bug
      
      * feat: add gather op and kernel & and transform ut
      
      * feats: add ops and fix bugs to support transformer op
             fix type_cast passes to skip `while`
             fix elementwise infer_shape bug when x.dims=3 and y.dims={1} & axis=0
             fix lookup_table compute bug
             fix read_from_array/beam_search/increment/compate/gather ops data_type problems
      
      * fix:
          transfomer ut add word read inferface
          fix copy/gather/norm/layer_norm include path problem
      
      * fix:debug info
      
      * fix: fix input reshape bug
      
      * fix: fix norm bug
      
      * style: style fix & test=develop
      
      * style: fix operators cmakelist
      
      * style: fix operators cmakelist; test=develop
      
      * fix and test=develop
      
      * fix and test=develop
      
      * style: style fix; test=develop
      330644b0
  4. 03 9月, 2019 1 次提交
  5. 16 8月, 2019 1 次提交