1. 26 12月, 2017 1 次提交
  2. 25 12月, 2017 1 次提交
    • Q
      Impl kernel hint (#6883) · af0c4c45
      Qiao Longfei 提交于
      * init kernel hint
      
      * fix typo
      
      * rm unused code
      
      * add include in op_kernel.h
      
      * restore op_kernel since it will be moved to op_kernel_type
      
      * change force_cpu to use_cpu
      
      * fix compilation
      af0c4c45
  3. 22 12月, 2017 1 次提交
  4. 20 12月, 2017 1 次提交
  5. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  6. 28 11月, 2017 1 次提交
  7. 08 11月, 2017 2 次提交
  8. 05 11月, 2017 1 次提交
  9. 04 11月, 2017 1 次提交
    • Q
      Add acc test to image classification (#5336) · 906e2565
      Qiao Longfei 提交于
      * add acc layer
      * memory log level change from 3 to 10
      * use gaussian random to init conv parameters
      * use initializer
      * fix import
      * batch_norm use helper to create persistable var
      * refine code
      * train only 2 batches for test
      * use g_program and g_init_program
      * use XavierInitializer to init fc parameter
      906e2565
  10. 30 10月, 2017 1 次提交
    • Q
      03 image classification (#5192) · 0049ce04
      Qiao Longfei 提交于
      * add batch_norm_layer
      
      * add img_conv_group layer and test
      
      * add check to Tensor.type()
      
      * forward can run
      
      * with backward
      
      * change label data time from int32 to int64
      
      * refine code
      
      * follow comment
      0049ce04
  11. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c