提交 · 80c68d38ff3c59e12f48b4e4e88c24c89568fc0a · 机器未来 / Paddle

22 11月, 2016 1 次提交
- L
  
  clang format .cc .h .cpp .c and .hpp file · 80c68d38
  由 Luo Tao 提交于 11月 22, 2016
  
  80c68d38
07 11月, 2016 1 次提交
- L
  
  abstract outputSize function in CNN-related layers (#314) · e802471c
  由 luotao1 提交于 11月 07, 2016
  
  e802471c
02 11月, 2016 1 次提交

Add job=time in trainer, refine cudnn_conv to reduce gpu memory and speed up training. () · 45c81a41

由 qingqing01 提交于 11月 02, 2016

* Add benchmark for PaddlePaddle, tensorflow and caffe

* ConvProjection to reduce memory for goolenet

* Add unit test for ConvProjection.
1. unit test in test_LayerGrad.
2. compare the ConvPorjection and CudnnConvLayer, also compare the concat_layer+img_conv_layer and concat_layer_conv_projection.

* Reduce cudnn_conv memory and add benchmark document.
1. Use TmpMatrix as the workspace in cudnn_conv to reduce gpu memory. It reduce lots of memory.
2. Add benchmark document.
3. fix smallnet_mnist_cifar.py in paddle.

* Add job=time and refine cudnn_conv to reduce gpu memroy and speed up

* Refine cudnn_conv and shared biases operation in concat_layer and mixed_layer.

* follow comments

* follow comments

* Use unique_ptr to prevent memory leaks in CudnnConvLayer.

45c81a41

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致