- 02 3月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] Support video-sr feature using OpenCL FP16 Image. test=develop * optimize image2d_to_buffer_with_post255. test=develop * add def debug in cl kernel. test=develop * remove conv image code in conv buffer. test=develop
-
- 01 3月, 2020 1 次提交
-
-
由 cc 提交于
* Update benchmark to support setting model and param filename
-
- 28 2月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 27 2月, 2020 2 次提交
-
-
由 Yuan Shuai 提交于
* replace vlog with log in pool, add each run duration print in test. test=develop * change layout file path. test=develop * fix act, layout image kernel. test=develop * fix spell error. test=develop * remove clfinish in act, concat, layout, nearest_interp. test=develop * add RELU macro define in fc opencl kernel. test=develop * add cpu_ref print in activation opencl kernel.test=develop * fix layout ut. test=develop * replace log with vlog. test=develop * fix get output. test=develop
-
由 Santa An 提交于
-
- 26 2月, 2020 2 次提交
-
-
由 huzhiqiang 提交于
-
由 huzhiqiang 提交于
-
- 24 2月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
[LITE][OPENCL] support fp16 for cl_image_converter, layout, activation all OpenCL image kernel. test=develop (#2964) * [LITE][OPENCL] support fp16 for cl_image_converter, layout, activation image kernel. test=develop * add conv, depthwise and UT. test=develop * add pool, conv, nearest_interp kernel. test=develop * support fp16 for scale, reshape, concat, fc buffer opencl kernel. test=develop * refactor for mul opencl buffer kernel. test=develop * support fp16 for elementwise_mul opecl image kernel. test=develop * support fp16 for elementwise_mul opencl image kernel. test=develop * support fp16 for ele_add, fuse_ele_add_act opencl kernel. test=develop * rename io_copy. test=develop * mobilenetv1,v2 passed on 855. test=develop * fix opt for opencl. test=develop
-
- 21 2月, 2020 1 次提交
-
-
由 hong19860320 提交于
-
- 20 2月, 2020 2 次提交
-
-
由 huzhiqiang 提交于
-
由 HappyAngel 提交于
* fix caliberate error, values from [-128, 127] to [-127, 127], test=develop * add classify demo and detection demo, test=develop
-
- 19 2月, 2020 1 次提交
-
-
由 cc 提交于
* add weight quant for combined model after optimized, test=develop
-
- 18 2月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 16 2月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 15 2月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* mobilenetv1/v2,mnasnet,yolonano,sr models passed with opencl image2d kernel. test=develop * Fix gflag shape define. test=develop
-
- 14 2月, 2020 3 次提交
-
-
由 huzhiqiang 提交于
-
由 Yuan Shuai 提交于
* comment All opencl buffer kernels. test=develop * refactor conv, depthwise into one routing selection. test=develop
-
由 xiaogang 提交于
fix fpga lite_tensor compile bug add fake quantize_abs_max op test=develop
-
- 13 2月, 2020 2 次提交
-
-
由 huzhiqiang 提交于
-
由 huzhiqiang 提交于
* modify opt name test=develop * fix code style test=develop * test=develop
-
- 08 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
test=develop
-
- 06 2月, 2020 1 次提交
-
-
由 juncaipeng 提交于
* optimize quant_dequant_fuse_pass, test=develop * update, test=develop * update, test=develop * fix bug for accessing the removed node, test=develop * set the bias of int8 conv as float, test=develop * support weight quantization, test=develop * up, test=develop * up, test=develop * up, test=develop
-
- 04 2月, 2020 1 次提交
-
-
由 juncaipeng 提交于
-
- 17 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 16 1月, 2020 1 次提交
-
-
由 Wilber 提交于
test_model_bin support print intermediate tensor.
-
- 14 1月, 2020 1 次提交
-
-
由 myq406450149 提交于
* Support bitman backend
-
- 10 1月, 2020 2 次提交
-
-
由 juncaipeng 提交于
* upgrade benchmark 2, test=develop
-
由 juncaipeng 提交于
* update benchmark, test=develop
-
- 08 1月, 2020 1 次提交
-
-
由 HappyAngel 提交于
* add cv image process * fix arm liunx build error * add LITE_WITH_CV defien to make cv, test=develop * fix cv format, annd add describe in utils/cv * set LITE_WITH_CV=OFF in build.sh, test=develop * delete cv_enum.h in utils/cv, push the contents in cv_ennum.h to paddle_image_preprocess.h, test=develop * according to reviews to redefine paddle_image_preprocess.h, test=develop * add detailed note of flipParam, test=develop * fix format in paddle_image_preprocess.h, test=develop * fix cmake error in llite/CMakeLists.txt, missing mkdir cxx, test=develop * according to review change, test=develop * add elemetnwise mul constant elimination and deconv+relu, deconv+batchnorm fusion, test=develop * fix format, test=develop * fix model_optimize bug, update concat and split op, speed up, test=develop * update split speed, test=develop * fix format, test=develop * add classify demo inn demo/cxx/ , test=develop * fix formart inn mobile_classify, test=develop * delete some note and extra code, test=develop * remove test.jpg and labels.txt, test=develop * add test_cv in cxx/demo * add test_cv READMEE, test=develoop * add note info, flip only support x, y, xy;rotate only support 90, 180, 270; test=develop * fix build error, paddle_cv_arm , test=develop * add GRAY to RGBA(BGRA) convert and RGBA(BGRA)_to_Tensor, test=develop * fix format from review, test=develop * fix makefile format. test=devellop * fix bbuuild v7 error, test=develop
-
- 06 1月, 2020 1 次提交
-
-
由 liu zhengxi 提交于
* alter the api name from cpu to x86, test=develop * correct the step_rnn model test, test=develop
-
- 31 12月, 2019 1 次提交
-
-
由 Wilber 提交于
X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo -DWITH_MKL=ON -DLITE_WITH_CUDA=ON -DWITH_MKLDNN=OFF -DLITE_WITH_X86=ON -DLITE_WITH_PROFILE=OFF -DWITH_LITE=OFF -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF -DWITH_PYTHON=OFF -DWITH_TESTING=ON -DLITE_WITH_ARM=OFF -DLITE_ON_TINY_PUBLISH=OFF -DCUDNN_ROOT=/usr/local/cudnn/ -DLITE_BUILD_EXTRA=ON (#2708) x86 and cuda compile simutaneously
-
- 28 12月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 26 12月, 2019 1 次提交
-
-
由 xiaogang 提交于
* feat: add multi_thread ut
-
- 25 12月, 2019 2 次提交
-
-
由 juncaipeng 提交于
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 23 12月, 2019 4 次提交
-
-
由 HappyAngel 提交于
-
由 Wilber 提交于
add sequence_pool_concat fuse pass add fuse kernel
-
由 yiicy 提交于
-
由 liu zhengxi 提交于
-