1. 20 3月, 2018 1 次提交
  2. 19 3月, 2018 1 次提交
  3. 15 3月, 2018 1 次提交
    • K
      Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123
      Kexin Zhao 提交于
      * add fp16 mul op support
      
      * small fix
      
      * fix bug
      
      * small fix
      
      * fix PADDLE_WITH_CUDA compiling issue
      
      * reorg code
      
      * test for pybind
      
      * treate as float16 as uint16_t in pybind
      
      * bind np.float16 to paddle float16
      
      * small fix
      
      * clean code
      
      * remove redundancy
      
      * fix mul_op test
      
      * address comments
      
      * small fix
      
      * add is_float16_supported func
      e26f1123
  4. 14 3月, 2018 1 次提交
  5. 12 3月, 2018 1 次提交
  6. 07 3月, 2018 1 次提交
  7. 26 2月, 2018 1 次提交
  8. 24 2月, 2018 1 次提交
  9. 23 2月, 2018 1 次提交
    • K
      Exposing Channel to be used as a Variable and integrating with Fluid (#8486) · 77ee8fb2
      kavyasrinet 提交于
      * Adding set_capacity method support
      
      * Adding Python for make_channel
      
      * Updating notest_concurrency
      
      * Write python for make_channel method
      
      * Write python for make_channel method
      
      * Fix make_channel and test
      
      * Placeholder ops for channel send, recv and close
      
      * Adding ToTypeIndex method to var_type.h
      
      * Add var_type.h to channel:
      
      * Added POD_Type to the method
      
      * Add CHANNEL to executor
      
      * Updated get and set DataType to accomodate Channels
      
      * Updating get and set to incorporate channels
      
      * Adding CHANNEL as supported VarType in protobuf
      
      * Removing unecessary import
      
      * Fixing VarDesc to adapt to Channel as VarType
      
      * Add channel.h to executor
      
      * Remove innclude from channel
      
      * Updated var_type to support Channel as  var type
      
      * Adding get_channel to pybind
      
      * Added ChannelHolder
      
      * Adding make_channel as an op
      
      * Adding ChannelHolder in channel
      
      * Fixing typo
      
      * Commenting out operators in concurrency
      
      * Removing totypeid right now since we don't need it.
      
      * Reverting python changes
      
      * Fixing typo in framework.py
      
      * Modify comments for ReaderHolder
      77ee8fb2
  10. 12 2月, 2018 1 次提交
  11. 11 2月, 2018 1 次提交
  12. 10 2月, 2018 2 次提交
  13. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  14. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  15. 24 1月, 2018 1 次提交
  16. 22 1月, 2018 1 次提交
  17. 16 1月, 2018 1 次提交
  18. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  19. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  20. 03 1月, 2018 1 次提交
  21. 25 12月, 2017 2 次提交
  22. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  23. 21 12月, 2017 2 次提交
  24. 20 12月, 2017 2 次提交
  25. 19 12月, 2017 2 次提交
  26. 18 12月, 2017 2 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
    • F
      update · b3ea677a
      fengjiayi 提交于
      b3ea677a
  27. 14 12月, 2017 2 次提交
  28. 27 11月, 2017 1 次提交
  29. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  30. 24 11月, 2017 1 次提交
  31. 14 11月, 2017 1 次提交
  32. 08 11月, 2017 1 次提交
  33. 06 11月, 2017 1 次提交
    • Y
      Feature/lod tensor array (#5379) · 2be4c3cb
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      2be4c3cb