1. 05 10月, 2017 2 次提交
    • Y
      Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
      Yi Wang 提交于
      4558807c
    • Y
      Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94
      Yu Yang 提交于
      By shell command
      
      ```bash
      sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
      sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
      ```
      84500f94
  2. 21 8月, 2017 1 次提交
  3. 01 8月, 2017 1 次提交
  4. 09 5月, 2017 1 次提交
  5. 08 5月, 2017 3 次提交
  6. 04 1月, 2017 1 次提交
  7. 20 12月, 2016 1 次提交
  8. 14 12月, 2016 1 次提交
  9. 12 12月, 2016 1 次提交
  10. 09 12月, 2016 1 次提交
  11. 01 12月, 2016 2 次提交
  12. 22 11月, 2016 1 次提交
  13. 02 11月, 2016 1 次提交
    • Q
      Add job=time in trainer, refine cudnn_conv to reduce gpu memory and speed up training. (#218) · 45c81a41
      qingqing01 提交于
      * Add benchmark for PaddlePaddle, tensorflow and caffe
      
      * ConvProjection to reduce memory for goolenet
      
      * Add unit test for ConvProjection.
      1. unit test in test_LayerGrad.
      2. compare the ConvPorjection and CudnnConvLayer, also compare the concat_layer+img_conv_layer and concat_layer_conv_projection.
      
      * Reduce cudnn_conv memory and add benchmark document.
      1. Use TmpMatrix as the workspace in cudnn_conv to reduce gpu memory. It reduce lots of memory.
      2. Add benchmark document.
      3. fix smallnet_mnist_cifar.py in paddle.
      
      * Add job=time and refine cudnn_conv to reduce gpu memroy and speed up
      
      * Refine cudnn_conv and shared biases operation in concat_layer and mixed_layer.
      
      * follow comments
      
      * follow comments
      
      * Use unique_ptr to prevent memory leaks in CudnnConvLayer.
      45c81a41
  14. 09 10月, 2016 1 次提交
  15. 29 8月, 2016 1 次提交