- 22 10月, 2019 1 次提交
-
-
由 TianXiaogang 提交于
* feat: add beam_search_special function for support nlp model * fix: add beam_search_compute kernel input and output * feat: add assign op & copy_compute kernel * feat: add fill_const_batch_size_like op & kernel * feat: add layer_norm op and kernel and ut * fix: fix some bugs fix mul_op infer_shape bug when x_dim_idx = 2, x_dims.size()=3 & y_dim_idx = 1, y_dims.size()=2 fix elementwise_compute bug when y axis is all 1 fix beam_search choose math_func wrong bug fix layer_norm get attr bug fix fill_constant_batch_size_like shape_set bug * feat: add gather op and kernel & and transform ut * feats: add ops and fix bugs to support transformer op fix type_cast passes to skip `while` fix elementwise infer_shape bug when x.dims=3 and y.dims={1} & axis=0 fix lookup_table compute bug fix read_from_array/beam_search/increment/compate/gather ops data_type problems * fix: transfomer ut add word read inferface fix copy/gather/norm/layer_norm include path problem * fix:debug info * fix: fix input reshape bug * fix: fix norm bug * style: style fix & test=develop * style: fix operators cmakelist * style: fix operators cmakelist; test=develop * fix and test=develop * fix and test=develop * style: style fix; test=develop
-
- 16 10月, 2019 1 次提交
-
-
由 sangoly 提交于
* [framework][place] remove prefered_place, use place order in valid_place array instead test=develop * remove kHost from valid_places test=develop
-
- 15 10月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* fix bug for accessing the removed node, test=develop
-
- 14 10月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* optimize quant_dequant_fuse_pass, test=develop
-
- 27 9月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* add conv int8 support(in condition which the input or output channel not be the times of 4) add add_kernel for cuda. * can run yolov3 fp32 test=develop * 1. fix bug with yolov3 run test=develop
-
- 26 9月, 2019 1 次提交
-
-
由 sangoly 提交于
-
- 23 9月, 2019 1 次提交
-
-
由 Wilber 提交于
-
- 18 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 13 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
* checkout if passes match targets and kernels, test=develop * add pass_utils, test=develop * fix lite/core/mir/pass_registry.h, test=develop * improve code styles, test=develop * fix spell error, test=develop
-
- 12 9月, 2019 1 次提交
-
-
由 guofei 提交于
-
- 11 9月, 2019 2 次提交
-
-
由 石晓伟 提交于
* make passes related to the device type, test=develop * improve tips, test=develop
-
由 zhupengyang 提交于
test=develop
-
- 06 9月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 30 8月, 2019 1 次提交
-
-
由 Zhen Wang 提交于
* Add precision and persistable attrs for the tensor. And fix cxx light and full api demo. * update precision2string methods. test=develop * move the save logic to the front of the run in mobilenetv1_full_api.cc, test=develop. * add comments for UpdateVarsOfProgram. test=develop
-
- 29 8月, 2019 2 次提交
-
-
由 tensor-tang 提交于
-
由 Wilber 提交于
* add yolo_box_compute cuda * move multiclass_nms(arm) to host * add lod in scale op * add yolo_box_cuda cmake config * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop * reshape and transpose op don't have xshape output. * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop * add yolo_box use kernel test=develop
-
- 28 8月, 2019 2 次提交
-
-
由 zhupengyang 提交于
* add transpose-softmax-transpose fuse pass test=develop * enable supported lite-npu ops test=develop
-
由 sangoly 提交于
-
- 23 8月, 2019 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 16 8月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-