- 04 2月, 2020 1 次提交
-
-
由 yiicy 提交于
* refactor 5x5s1 dw conv armv8, test=develop * [ARM] refactor depthwise conv 5x5s1, and support relu6, leakey relu, test=develop * [ARM] sgemv support fuse relu6 and leakey relu,test=develop * [ARM] reduce some conv ut case, test=develop * [ARM] fix 5x5dw conv pick kernel bug, test=develop * fix code style, test=develop * [ARM] fix sgemv fuse relu6 bug, test=develop * [ARM] fix fp32 5x5s1 dw bug, test=develop * [ARM] fix fp32 5x5 dw conv pick kernel bug, test=develop
-
- 24 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 21 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 19 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
* [NPU] reshape x,y,out node in elementwise ops
-
- 16 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 15 1月, 2020 1 次提交
-
-
由 hong19860320 提交于
-
- 14 1月, 2020 3 次提交
-
-
由 HappyAngel 提交于
add gemm + relu6/leakyrelu fusion
-
由 zhupengyang 提交于
- enhance interp InferShape
-
由 myq406450149 提交于
* Support bitman backend
-
- 13 1月, 2020 2 次提交
-
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
[LITE][NPU] Add instance_norm op bridge and unit test, refine the registration of op bridges (#2747)
-
- 10 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 08 1月, 2020 2 次提交
-
-
由 HappyAngel 提交于
* add cv image process * fix arm liunx build error * add LITE_WITH_CV defien to make cv, test=develop * fix cv format, annd add describe in utils/cv * set LITE_WITH_CV=OFF in build.sh, test=develop * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop * according to reviews to redefine paddle_image_preprocess.h, test=develop * add detailed note of flipParam, test=develop * fix format in paddle_image_preprocess.h, test=develop * fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop * according to review change, test=develop * add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop * fix format, test=develop * fix model_optimize bug, update concat and split op, speed up, test=develop * update split speed, test=develop * fix format, test=develop * add classify demo inn demo/cxx/ , test=develop * fix formart inn mobile_classify, test=develop * delete some note and extra code, test=develop * remove test.jpg and labels.txt, test=develop * add test_cv in cxx/demo * add test_cv READMEE, test=develoop * add note info, flip only support x, y, xy;rotate only support 90, 180, 270; test=develop * fix build error, paddle_cv_arm , test=develop * add GRAY to RGBA(BGRA) convert and RGBA(BGRA)_to_Tensor, test=develop * fix format from review, test=develop * fix makefile format. test=devellop * fix bbuuild v7 error, test=develop
-
由 zhupengyang 提交于
* [NPU] enhance unittest for shuffle_channel, unsqueeze, pool test=develop
-
- 07 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 03 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 31 12月, 2019 3 次提交
-
-
由 Wilber 提交于
X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo -DWITH_MKL=ON -DLITE_WITH_CUDA=ON -DWITH_MKLDNN=OFF -DLITE_WITH_X86=ON -DLITE_WITH_PROFILE=OFF -DWITH_LITE=OFF -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF -DWITH_PYTHON=OFF -DWITH_TESTING=ON -DLITE_WITH_ARM=OFF -DLITE_ON_TINY_PUBLISH=OFF -DCUDNN_ROOT=/usr/local/cudnn/ -DLITE_BUILD_EXTRA=ON (#2708) x86 and cuda compile simutaneously
-
由 zhupengyang 提交于
test=develop
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 26 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 25 12月, 2019 3 次提交
-
-
由 juncaipeng 提交于
* fix op inputs and outputs type, test=develop
-
由 Liu Yiqun 提交于
test=develop
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 5 次提交
-
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
* [XPU] elementwise_add unit test * [XPU] softmax unit test test=develop
-
- 23 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 21 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 20 12月, 2019 2 次提交
-
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 19 12月, 2019 1 次提交
-
-
由 yiicy 提交于
* [ARM] change global pooling choose kernel policy, test=develop
-
- 18 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support Mask RCNN2 (#2588)
-
- 17 12月, 2019 1 次提交
-
-
由 HappyAngel 提交于
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 10 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 07 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add arm split lod tensor, test=develop * add arm merge lod tensor, test=develop * update split merge lod tensor, test=develop * add reduce_prob op, test=develop * support mask_rcnn succeed, test=develop
-
- 04 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
-