- 13 5月, 2019 2 次提交
- 12 5月, 2019 4 次提交
- 10 5月, 2019 7 次提交
-
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhaoyuchen2018 提交于
refine code fuse cublas calling and kernels into one cuda kernel. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 SunGaofeng 提交于
-
由 qingqing01 提交于
* Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables,return None in Python.
-
由 wopeizl 提交于
* rename the default version from '0.0.0' to 'latest'
-
由 Zeng Jinle 提交于
test=develop
-
- 09 5月, 2019 11 次提交
-
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 superjomn 提交于
-
由 Zeng Jinle 提交于
-
由 Zhaolong Xing 提交于
1. infernce multi card occupy 2. facebox model inference occupy too much test=develop
-
由 xiaoting 提交于
-
由 zhoukunsheng 提交于
* test=develop add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
-
由 tangwei12 提交于
* fix some logic in distributed transpiler, test=develop * reformat fleet API, test=develop
-
由 lujun 提交于
The parameter dirpath will be passed directly to c++ operater. The file address format will be different under win and UNIX.
-
由 guru4elephant 提交于
* fix train_from_dataset and infer_from_dataset example * add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements
-
由 chengduo 提交于
test=develop
-
- 08 5月, 2019 16 次提交
-
-
由 Huihuang Zheng 提交于
* Fix API example code of save_inference_model test=develop * Add "import" in exmaple of save_inference_model * Fix typo "exsample" -> "example" test=develop
-
由 xiaoting 提交于
* modified formula for lrn test=develop * modified api.spec test=develop
-
由 zhaoyuchen2018 提交于
* Refine elementwise kernel. Add a simple cuda kernel if grad x and y both exist Use 2D block cuda kernel to do broadcast. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 Jiabin Yang 提交于
-
由 lujun 提交于
Fix the following API examples: paddle.fluid.scope_guard paddle.fluid.backward.append_backward paddle.fluid.cpu_places paddle.fluid.cuda_pinned_places paddle.fluid.cuda_places paddle.fluid.in_dygraph_mode paddle.fluid.CUDAPlace paddle.fluid.CPUPlace paddle.fluid.CUDAPinnedPlace
-
由 Yiqun Liu 提交于
* Optimize the cuda implementation of sum_op, which add two lod_tensors inplace. test=develop * Use eigen to add to tensors. test=develop
-
由 chengduo 提交于
test=develop
-
由 superjomn 提交于
-
由 Hongyu Liu 提交于
* fix shape_check; test=develop * fix format; test=develop * fix format; test=develop * fix ddim bug; test=develop * fix c++ format; test=develop * change function name; test=develop
-
由 guomingz 提交于
* resolve #17184 Fix the KL algorithm bug when calculated the size of tensor. * Add the missing test=develop. test=develop
-
由 lvmengsi 提交于
* fix API examples, test=develop
-
由 whs 提交于
-
由 baojun 提交于
* added lrn op test=develop * Added CreateConstant method test=develop * avoid duplicates test=develop
-
由 Wojciech Uss 提交于
added printing data type to differentiate int8 and fp32 latency results test=develop
-
由 Zhen Wang 提交于
-
由 chengduo 提交于
* polish doc test=develop * updata parallel executor doc test=develop * update API.spec test=develop * polish code test=develop
-