1. 01 1月, 2018 1 次提交
  2. 28 12月, 2017 1 次提交
  3. 27 12月, 2017 4 次提交
  4. 26 12月, 2017 1 次提交
  5. 25 12月, 2017 3 次提交
    • D
      "add data layout" (#6955) · 7777c811
      dzhwinter 提交于
      * "add data layout"
      
      * "need kernel registry support"
      
      * "fix data layout"
      
      * "reorder include headers"
      
      * "change enum to enum class"
      
      * "fix CI"
      7777c811
    • Q
      Impl kernel hint (#6883) · af0c4c45
      Qiao Longfei 提交于
      * init kernel hint
      
      * fix typo
      
      * rm unused code
      
      * add include in op_kernel.h
      
      * restore op_kernel since it will be moved to op_kernel_type
      
      * change force_cpu to use_cpu
      
      * fix compilation
      af0c4c45
    • D
      GPUPlace to CUDAPlace (#6960) · 0d2235aa
      dzhwinter 提交于
      0d2235aa
  6. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  7. 22 12月, 2017 3 次提交
  8. 21 12月, 2017 4 次提交
  9. 20 12月, 2017 3 次提交
  10. 19 12月, 2017 3 次提交
  11. 18 12月, 2017 2 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
    • F
      update · b3ea677a
      fengjiayi 提交于
      b3ea677a
  12. 14 12月, 2017 2 次提交
  13. 12 12月, 2017 1 次提交
  14. 30 11月, 2017 1 次提交
  15. 27 11月, 2017 1 次提交
  16. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  17. 24 11月, 2017 2 次提交
  18. 14 11月, 2017 1 次提交
  19. 08 11月, 2017 1 次提交
  20. 07 11月, 2017 1 次提交
    • Y
      ReadFromArray/WriteToArray op (#5407) · c9b57dcc
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Complete array read/write op unittests
      c9b57dcc
  21. 06 11月, 2017 1 次提交
    • Y
      Feature/lod tensor array (#5379) · 2be4c3cb
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      2be4c3cb
  22. 04 11月, 2017 1 次提交
    • Y
      Add LoDRankTable (#5349) · 74849158
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add InferVarType
      74849158
  23. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      0a32e74d