- 22 5月, 2019 1 次提交
-
-
由 tensor-tang 提交于
* add cmake * update * fix proto pd * fix compile * tmp save * fix protobuf device version * fix protobuf and host compile * fix std c++11 support on android * change array to vector to fix ndk c++_static * fix rt and add dockerfile * fix android compile issue with latest merge * init arm kernels * enable run on arm * update format * update format * update format
-
- 16 5月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 14 5月, 2019 4 次提交
- 13 5月, 2019 3 次提交
- 12 5月, 2019 3 次提交
- 10 5月, 2019 6 次提交
-
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhaoyuchen2018 提交于
refine code fuse cublas calling and kernels into one cuda kernel. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 SunGaofeng 提交于
-
由 qingqing01 提交于
* Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables,return None in Python.
-
由 Zeng Jinle 提交于
test=develop
-
- 09 5月, 2019 9 次提交
-
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 Zeng Jinle 提交于
-
由 Zhaolong Xing 提交于
1. infernce multi card occupy 2. facebox model inference occupy too much test=develop
-
由 xiaoting 提交于
-
由 zhoukunsheng 提交于
* test=develop add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
-
由 guru4elephant 提交于
* fix train_from_dataset and infer_from_dataset example * add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
-
由 chengduo 提交于
test=develop
-
- 08 5月, 2019 13 次提交
-
-
由 Huihuang Zheng 提交于
* Fix API example code of save_inference_model test=develop * Add "import" in exmaple of save_inference_model * Fix typo "exsample" -> "example" test=develop
-
由 xiaoting 提交于
* modified formula for lrn test=develop * modified api.spec test=develop
-
由 zhaoyuchen2018 提交于
* Refine elementwise kernel. Add a simple cuda kernel if grad x and y both exist Use 2D block cuda kernel to do broadcast. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 lujun 提交于
Fix the following API examples: paddle.fluid.scope_guard paddle.fluid.backward.append_backward paddle.fluid.cpu_places paddle.fluid.cuda_pinned_places paddle.fluid.cuda_places paddle.fluid.in_dygraph_mode paddle.fluid.CUDAPlace paddle.fluid.CPUPlace paddle.fluid.CUDAPinnedPlace
-
由 Yiqun Liu 提交于
* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace. test=develop * Use eigen to add to tensors. test=develop
-
由 chengduo 提交于
test=develop
-
由 Hongyu Liu 提交于
* fix shape_check; test=develop * fix format; test=develop * fix format; test=develop * fix ddim bug; test=develop * fix c++ format; test=develop * change function name; test=develop
-
由 lvmengsi 提交于
* fix API examples, test=develop
-
由 whs 提交于
-
由 baojun 提交于
* added lrn op test=develop * Added CreateConstant method test=develop * avoid duplicates test=develop
-
由 Wojciech Uss 提交于
added printing data type to differentiate int8 and fp32 latency results test=develop
-
由 chengduo 提交于
* polish doc test=develop * updata parallel executor doc test=develop * update API.spec test=develop * polish code test=develop
-
由 gongweibao 提交于
-