- 01 6月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 01 4月, 2020 1 次提交
-
-
由 Wilber 提交于
add cuda kernel. abs, tanh, elementwise_sub
-
- 25 3月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 17 3月, 2020 1 次提交
-
-
由 Wilber 提交于
add cuda ci. Organize cuda compilation products.
-
- 20 2月, 2020 1 次提交
-
-
由 Wilber 提交于
Optimize cuda kernel and remove io_copy added by default due to missing fetch_cuda kernel
-
- 28 12月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 23 12月, 2019 1 次提交
-
-
由 Wilber 提交于
add sequence_pool_concat fuse pass add fuse kernel
-
- 15 12月, 2019 1 次提交
-
-
由 Wilber 提交于
optimize search_grnn
-
- 04 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* init resnet cuda int8 support test=develop * refine cuda unit test test=develop * add the forgeted file. test=develop
-
- 22 11月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 21 11月, 2019 3 次提交
-
-
由 huzhiqiang 提交于
* cuda kernel for sequence_topk_avg_pooling and search_fc test=develop
-
由 Pei Yang 提交于
-
由 石晓伟 提交于
* fix cuda building, test=develop * remove sequence_pool from cmake because build error, test=develop
-
- 20 11月, 2019 2 次提交
-
-
由 juncaipeng 提交于
* fix x86 search_grnn and add unit test * add cuda search_grnn and unit test
-
由 Pei Yang 提交于
* add sequence_pool cuda kernel, test=develop * fix sequence_pool cuda,test=develop * fix and complete unittest, test=develop
-
- 19 11月, 2019 2 次提交
-
-
由 hong19860320 提交于
-
由 zhupengyang 提交于
* [X86] add attention_padding_mask op, x86 kernel and unit test test=develop * [CUDA] add attention_padding_mask cuda kernel and unit test test=develop
-
- 18 11月, 2019 3 次提交
-
-
由 Wilber 提交于
- add var_conv_2d cuda kernel - add var_conv_2d cuda kernel unit test - temporarily set to two input mode, remove input(ROW) and input(COLUMN)
-
由 Pei Yang 提交于
add sequence_pool cuda kernel
-
由 zhupengyang 提交于
* [X86][CUDA] add sequence_arithmetic op , x86 kernel, cuda kernel and unit test test=develop * add sequence_arithmetic cuda kernel unit test test=develop
-
- 17 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add cuda match_matrix_tensor op and test, test=develop
-
- 15 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add search_seq_depadding x86 and cuda * add match_matrix_tensor x86 * add search_grnn x86, no test
-
- 13 11月, 2019 2 次提交
- 11 11月, 2019 1 次提交
-
-
由 Pei Yang 提交于
add cuda kernel:lookup table
-
- 23 10月, 2019 2 次提交
- 21 10月, 2019 2 次提交
-
-
由 myq406450149 提交于
* add gpu kernel mul pool relu scale softmax dropout bilinear_interp and can run in tx2 * rm GREATER_EQUAL
-
由 yiicy 提交于
* cuda add softmax and pool op * * fix armlinux can find sys/system_properties.h * conv add padding_algorithm test=develop * delete padding_algorithm in op param, test=develop * fix bugs, test=develop
-
- 17 10月, 2019 1 次提交
-
-
由 juncaipeng 提交于
-
- 11 10月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* add conv int8 support(in condition which the input or output channel not be the times of 4) add add_kernel for cuda. * can run yolov3 fp32 test=develop * 1. fix bug with yolov3 run test=develop * can run yolov3 int8 test=develop
-
- 27 9月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* add conv int8 support(in condition which the input or output channel not be the times of 4) add add_kernel for cuda. * can run yolov3 fp32 test=develop * 1. fix bug with yolov3 run test=develop
-
- 12 9月, 2019 1 次提交
-
-
由 Wilber 提交于
add transpose kernel for cuda
-
- 09 9月, 2019 2 次提交
-
-
由 Pei Yang 提交于
* add nearest_interp_cuda kernel, test=develop * add concat op and elementwise_add op * remove eigen dependency from nearest_interp cuda kernel, test=develop * free cuda pointers, test=develop
-
由 Zhen Wang 提交于
* add calib cuda kernel. * add unit test for calib cuda kernel. test=develop
-
- 06 9月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* paddle lite cuda init can run model with leaky_relu * add the missing file. test=develop * add the load from memory interface. test=develop * refine this pr. fix comments fix ci error test=develop * conv impl fp32: conv, conv+bais, conv+bias+relu, conv+bias+leaky_relu int8: conv, conv+bais+relu(int8 or fp32 output), conv+bias+leaky_relu(int8 or fp32 output) can run conv+ bias+relu using cxx_api test=develop * move the lite/cuda/math to backends/cuda/math test=develop
-
- 30 8月, 2019 1 次提交
-
-
由 Pei Yang 提交于
add nearest_interp cuda kernel for Paddle-Lite
-
- 29 8月, 2019 1 次提交
-
-
由 Wilber 提交于
* add yolo_box_compute cuda * move multiclass_nms(arm) to host * add lod in scale op * add yolo_box_cuda cmake config * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop * reshape and transpose op don't have xshape output. * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop * add yolo_box use kernel test=develop
-
- 27 8月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* paddle lite cuda init can run model with leaky_relu * add the missing file. test=develop
-
- 16 8月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-