- 29 5月, 2019 4 次提交
-
-
由 tensor-tang 提交于
-
由 hong19860320 提交于
* enable softmax op and add unit test * move softmax sub-functions to softmax.cc, and move basic math functions to funcs.h
-
由 liuwei1031 提交于
* migrate several ops: mean, mean_grad fill_constant square_grad elementwise_sub_grad mul_grad * add sdg_op * fix kernel platform registration issue * code cleanup * fix platform typo
-
由 tensor-tang 提交于
-
- 28 5月, 2019 1 次提交
-
-
由 tensor-tang 提交于
* add fc unit test * refine eigen fc add cpu info, arm context init packed sgemm * enable packed sgemm * add arm math * pass fc ut * follow comments
-
- 24 5月, 2019 2 次提交
-
-
由 tensor-tang 提交于
-
由 Yan Chunwei 提交于
-
- 23 5月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
* code clean - refine ARM cmake enhancement: - add lite_cc_library and lite_cc_test code clean: - remove ARM feed and fetch kernels, reuse the Host's remove unnecessary comments
-
- 22 5月, 2019 1 次提交
-
-
由 tensor-tang 提交于
* add cmake * update * fix proto pd * fix compile * tmp save * fix protobuf device version * fix protobuf and host compile * fix std c++11 support on android * change array to vector to fix ndk c++_static * fix rt and add dockerfile * fix android compile issue with latest merge * init arm kernels * enable run on arm * update format * update format * update format
-
- 16 5月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 14 5月, 2019 4 次提交
- 13 5月, 2019 3 次提交
- 12 5月, 2019 3 次提交
- 10 5月, 2019 6 次提交
-
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhaoyuchen2018 提交于
refine code fuse cublas calling and kernels into one cuda kernel. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 SunGaofeng 提交于
-
由 qingqing01 提交于
* Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables,return None in Python.
-
由 Zeng Jinle 提交于
test=develop
-
- 09 5月, 2019 9 次提交
-
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 Zeng Jinle 提交于
-
由 Zhaolong Xing 提交于
1. infernce multi card occupy 2. facebox model inference occupy too much test=develop
-
由 xiaoting 提交于
-
由 zhoukunsheng 提交于
* test=develop add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
-
由 guru4elephant 提交于
* fix train_from_dataset and infer_from_dataset example * add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
-
由 chengduo 提交于
test=develop
-
- 08 5月, 2019 5 次提交
-
-
由 Huihuang Zheng 提交于
* Fix API example code of save_inference_model test=develop * Add "import" in exmaple of save_inference_model * Fix typo "exsample" -> "example" test=develop
-
由 xiaoting 提交于
* modified formula for lrn test=develop * modified api.spec test=develop
-
由 zhaoyuchen2018 提交于
* Refine elementwise kernel. Add a simple cuda kernel if grad x and y both exist Use 2D block cuda kernel to do broadcast. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 lujun 提交于
Fix the following API examples: paddle.fluid.scope_guard paddle.fluid.backward.append_backward paddle.fluid.cpu_places paddle.fluid.cuda_pinned_places paddle.fluid.cuda_places paddle.fluid.in_dygraph_mode paddle.fluid.CUDAPlace paddle.fluid.CPUPlace paddle.fluid.CUDAPinnedPlace
-
由 Yiqun Liu 提交于
* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace. test=develop * Use eigen to add to tensors. test=develop
-