- 17 3月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* add sgd op, test=develop * test=develop
-
- 16 3月, 2020 2 次提交
-
-
由 mapingshuo 提交于
* add backend * add kernel * add grad kernel
-
由 cc 提交于
-
- 12 3月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* add mean op, test=develop * split forward and backward, test=develop
-
- 09 3月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* add lookup_dequant_op
-
- 29 2月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 28 2月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* add sequence_conv op and arm kernel * add test, test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * modify code style. test=develop * fix ut, test=develop * delete unused code, test=develop
-
- 27 2月, 2020 1 次提交
-
-
由 xiaogang 提交于
* feat: add lstm op && kernel test=develop
-
- 21 2月, 2020 1 次提交
-
-
由 xiaogang 提交于
fix: move quant op to basic
-
- 14 2月, 2020 1 次提交
-
-
由 xiaogang 提交于
-
- 27 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 23 12月, 2019 2 次提交
- 18 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support Mask RCNN2 (#2588)
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 10 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 07 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add arm split lod tensor, test=develop * add arm merge lod tensor, test=develop * update split merge lod tensor, test=develop * add reduce_prob op, test=develop * support mask_rcnn succeed, test=develop
-
- 26 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 25 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
* arrange ops&kernels to support basic models * add relu and iocopy into basic * add fusion_elementwise_activation_ops into basic * add dropout,layout,io_copyonce kernel&op into basic
-
- 19 11月, 2019 3 次提交
-
-
由 zhupengyang 提交于
test=develop
-
由 huzhiqiang 提交于
* add x86 op and kernel : search_fc and sequence_topk_avg_pooling for content-dnn model test=develop
-
由 zhupengyang 提交于
* [X86] add attention_padding_mask op, x86 kernel and unit test test=develop * [CUDA] add attention_padding_mask cuda kernel and unit test test=develop
-
- 18 11月, 2019 2 次提交
-
-
由 Pei Yang 提交于
add search_group_padding op and x86 kernel
-
由 zhupengyang 提交于
* [X86][CUDA] add sequence_arithmetic op , x86 kernel, cuda kernel and unit test test=develop * add sequence_arithmetic cuda kernel unit test test=develop
-
- 16 11月, 2019 1 次提交
-
-
由 hong19860320 提交于
-
- 15 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add search_seq_depadding x86 and cuda * add match_matrix_tensor x86 * add search_grnn x86, no test
-
- 14 11月, 2019 2 次提交
- 13 11月, 2019 2 次提交
- 08 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
* move multiclass_nms kernel back to host test=develop * move layer_norm OP and arm_kernel into extra type since it's added after release/v2.0-beta1 and not related with CV test=develop * fix code_style test=develop
-
- 07 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON
-
- 06 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add channel_wise_dequantized_max_abs op and ChannelWiseDequantOpFuser, test=develop
-
- 05 11月, 2019 1 次提交
-
-
由 lijianshe02 提交于
* fix step rnn model run bugs test=develop
-
- 04 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
* move some basic ops into extra type to reduce library size test=develop (#2347)
-
- 22 10月, 2019 2 次提交
-
-
由 juncaipeng 提交于
* Add DeleteQuantOpFuser * Add fake_quantize_dequantize_moving_avg_abs_max_op * Add DeleteQuantDequantOpFuser
-
由 TianXiaogang 提交于
* feat: add beam_search_special function for support nlp model * fix: add beam_search_compute kernel input and output * feat: add assign op & copy_compute kernel * feat: add fill_const_batch_size_like op & kernel * feat: add layer_norm op and kernel and ut * fix: fix some bugs fix mul_op infer_shape bug when x_dim_idx = 2, x_dims.size()=3 & y_dim_idx = 1, y_dims.size()=2 fix elementwise_compute bug when y axis is all 1 fix beam_search choose math_func wrong bug fix layer_norm get attr bug fix fill_constant_batch_size_like shape_set bug * feat: add gather op and kernel & and transform ut * feats: add ops and fix bugs to support transformer op fix type_cast passes to skip `while` fix elementwise infer_shape bug when x.dims=3 and y.dims={1} & axis=0 fix lookup_table compute bug fix read_from_array/beam_search/increment/compate/gather ops data_type problems * fix: transfomer ut add word read inferface fix copy/gather/norm/layer_norm include path problem * fix:debug info * fix: fix input reshape bug * fix: fix norm bug * style: style fix & test=develop * style: fix operators cmakelist * style: fix operators cmakelist; test=develop * fix and test=develop * fix and test=develop * style: style fix; test=develop
-
- 11 10月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* add conv int8 support(in condition which the input or output channel not be the times of 4) add add_kernel for cuda. * can run yolov3 fp32 test=develop * 1. fix bug with yolov3 run test=develop * can run yolov3 int8 test=develop
-
- 17 9月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* add fill_constant_batch_size_like op and add its unittest
-
- 16 9月, 2019 1 次提交
-
-
由 lhl960107 提交于
* add x86 gru&&relu&&sequence_expand_as op test=develop
-