1. 29 10月, 2018 1 次提交
    • J
      Rebase of seqpool-max optimization · 458b16f4
      Jacek Czaja 提交于
      test=develop
      
      - Added rough profiling
      
      - Profiled maxpool itself
      
      - First draft of max seqpool optimization (is_test added)
      
      - Added unit tests to seqpool
      
      - Cosmetic fixes
      
      - Fix to UT of Seq pool
      
      Disabled grad checking for sequence max pool when is_test is set to True
      
      -Cosmetic fix to comment
      
      test=develop
      
      - Fix to GPU build
      
      test=develop
      
      - yet another GPU fix for sequence max pool
      
      - Fix to comment
      
      test=develop
      
      - Change to API of sequence_pool
      
      test=develop
      
      - Yet another API spec change
      
      test=develop
      458b16f4
  2. 17 9月, 2018 1 次提交
  3. 30 4月, 2018 1 次提交
  4. 27 4月, 2018 1 次提交
  5. 29 3月, 2018 1 次提交
    • D
      Speed/sequence op1 (#9217) · 8425c2c8
      dzhwinter 提交于
      * "add functors"
      
      * "remove old code"
      
      * "fix"
      
      * "fix ci"
      
      * "add details"
      
      * "fix ci"
      
      * "fix ci"
      
      * "fix ci"
      
      * "fix ci"
      
      * "remove unused code"
      8425c2c8
  6. 12 2月, 2018 1 次提交
  7. 10 2月, 2018 2 次提交
  8. 08 2月, 2018 1 次提交
  9. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  10. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  11. 03 11月, 2017 1 次提交