1. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  2. 07 3月, 2018 1 次提交
  3. 10 2月, 2018 2 次提交
  4. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  5. 16 1月, 2018 1 次提交
  6. 01 1月, 2018 1 次提交
  7. 27 12月, 2017 1 次提交
  8. 19 12月, 2017 1 次提交
  9. 18 12月, 2017 1 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
  10. 30 11月, 2017 1 次提交
  11. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  12. 28 10月, 2017 1 次提交
    • F
      Python API for inference model saving/load (#5020) · 6783dcee
      fengjiayi 提交于
      * Add `dump_to_file()` for ProgrameDescBind in pybind
      
      * Update
      
      * Add utility.py
      
      * typo
      
      * Fix bugs
      
      * Move add_feed/fetch_components to untility.py
      
      * Compelete dump
      
      * Follow comments
      
      * Change output of Prune() from inference to pointer
      
      * Expose Prune() to Python
      
      * Compelete save/load API of inference model
      
      * Fix errors
      
      * Debuging
      
      * Compelete unit tests
      
      * follow comments
      6783dcee
  13. 24 10月, 2017 1 次提交
  14. 16 10月, 2017 1 次提交
  15. 12 10月, 2017 1 次提交
    • Cudnn conv op (#4195) · a3ccbdb3
      武毅 提交于
      * add cudnn_conv_op
      
      * WIP
      
      * update
      
      * update
      
      * fix grad check
      
      * use platform::memory
      
      * add support group for cudnn
      
      * update
      
      * follow comments
      
      * fix onlycpu build
      
      * update cuda define
      
      * follow comments
      
      * follow comments
      
      * merge with updates
      
      * fix compile error
      
      * follow comments
      
      * follow comments
      a3ccbdb3
  16. 07 10月, 2017 1 次提交
  17. 28 9月, 2017 1 次提交
  18. 27 9月, 2017 2 次提交
  19. 23 9月, 2017 1 次提交
  20. 06 9月, 2017 1 次提交
  21. 01 9月, 2017 2 次提交
  22. 30 8月, 2017 2 次提交
  23. 25 8月, 2017 1 次提交
  24. 23 8月, 2017 1 次提交
  25. 07 8月, 2017 1 次提交
  26. 03 8月, 2017 1 次提交
  27. 02 8月, 2017 2 次提交
  28. 01 8月, 2017 2 次提交
  29. 29 7月, 2017 1 次提交
    • Y
      RecurrentOp implementation (#2890) · aee0d3ec
      Yan Chunwei 提交于
      * add rnn op interfaces
      
      * add Run
      
      * rename state -> memory
      
      * change state -> memory
      
      * make compilable
      
      * add .cc
      
      * init test
      
      * add op fake implementation
      
      * add CreateStepNet and CreateScopes implementation.
      
      * add TODO list
      
      * init memory attributes.
      
      * add LinkMemories
      
      * add PlainNet fake implementation
      
      * Use std::shared_ptr<Scope> in the OpRunContext.
      
      * add test
      
      * disable mutable_data
      
      * finist segmentInput function
      
      * enable mutable_data with a trick
      
      * RNNOp test.
      
      * enable LinkMemories with mutable_data
      
      * update SegmentInput function with comments
      
      * finish ConcatOutput function
      
      * reformat inputs and attributes
      
      boot_memories
      
      * Refine unit test.
      
      * Refine unit test.
      
      * modify inlinks.
      
      * add OpDesc to Net
      
      * fix bug and update unit test.
      
      * move step scopes from inputs to outputs
      
      * fix merge conflict, update SegmentInput function
      
      * add RecurrentOpProtoAndCheckerMaker.
      
      * clean the codes
      
      * Abstract GetStepScopes and GetMaxSeqLen function
      
      * refine LinkMemories
      
      * Refine code and add some comments.
      
      * add backward core
      
      * update for develop branch.
      
      * add forward core
      
      * add forward algorithm
      
      * Add RecurrentGradientAlgorithm implenmention.
      
      * use CopyFrom and Slice function in RecurrentOp
      
      * add unit test for LinkMemories.
      
      * fix unit test.
      
      * use the latest tensor.h, solve conflict
      
      * add maker
      
      * move SegmentInput and ConcatOutput to details nameplace
      
      * unit test for RecurrentGradientAlgorithm.
      
      * apply OperatorBase
      
      * apply net operator.
      
      * move memorys to attributes
      
      * add RecurrentGradientOp
      
      * open test unit test in recurrent_network_op_test.
      
      * revert some files.
      
      * add RecurrentArgument and Link struct to simplify member variable.
      
      * rename.
      
      * move recurrent_op from framework to operators
      
      * add RecurrentGradientOp Init
      
      * fix name
      
      * fix Link.interal/external name
      
      * use namespace operators instead of framework
      
      * clean the code
      
      * use the latest add_op and mul_op, don't test backward now
      
      * Remove ScopePtr and OperatorPtr
      
      * add get_net to pybind
      
      * add test_recurrent_op.py
      
      * add random into gen_tensor
      
      * update to develop branch and refine some code.
      
      * add some comments.
      aee0d3ec
  30. 25 7月, 2017 1 次提交
  31. 21 7月, 2017 1 次提交
    • Q
      add cross-entropy-op (#2965) · 26857659
      Qiao Longfei 提交于
      * add cross-entropy-op
      
      * add infershape and compute
      
      * implement Infershape and compute of onehotcrossentropy op
      26857659
  32. 19 7月, 2017 1 次提交
  33. 18 7月, 2017 1 次提交
    • Y
      Skeleton Of fully connected operator · d3373c5b
      Yu Yang 提交于
      * Fc operator is a grouped operator, which combined by may internal
        operators.
      * InferShape & Run a FC operator in Python.
      d3373c5b
  34. 17 7月, 2017 1 次提交