1. 15 3月, 2018 1 次提交
    • K
      Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123
      Kexin Zhao 提交于
      * add fp16 mul op support
      
      * small fix
      
      * fix bug
      
      * small fix
      
      * fix PADDLE_WITH_CUDA compiling issue
      
      * reorg code
      
      * test for pybind
      
      * treate as float16 as uint16_t in pybind
      
      * bind np.float16 to paddle float16
      
      * small fix
      
      * clean code
      
      * remove redundancy
      
      * fix mul_op test
      
      * address comments
      
      * small fix
      
      * add is_float16_supported func
      e26f1123
  2. 14 3月, 2018 1 次提交
  3. 12 3月, 2018 1 次提交
  4. 07 3月, 2018 1 次提交
  5. 26 2月, 2018 1 次提交
  6. 24 2月, 2018 1 次提交
  7. 23 2月, 2018 1 次提交
    • K
      Exposing Channel to be used as a Variable and integrating with Fluid (#8486) · 77ee8fb2
      kavyasrinet 提交于
      * Adding set_capacity method support
      
      * Adding Python for make_channel
      
      * Updating notest_concurrency
      
      * Write python for make_channel method
      
      * Write python for make_channel method
      
      * Fix make_channel and test
      
      * Placeholder ops for channel send, recv and close
      
      * Adding ToTypeIndex method to var_type.h
      
      * Add var_type.h to channel:
      
      * Added POD_Type to the method
      
      * Add CHANNEL to executor
      
      * Updated get and set DataType to accomodate Channels
      
      * Updating get and set to incorporate channels
      
      * Adding CHANNEL as supported VarType in protobuf
      
      * Removing unecessary import
      
      * Fixing VarDesc to adapt to Channel as VarType
      
      * Add channel.h to executor
      
      * Remove innclude from channel
      
      * Updated var_type to support Channel as  var type
      
      * Adding get_channel to pybind
      
      * Added ChannelHolder
      
      * Adding make_channel as an op
      
      * Adding ChannelHolder in channel
      
      * Fixing typo
      
      * Commenting out operators in concurrency
      
      * Removing totypeid right now since we don't need it.
      
      * Reverting python changes
      
      * Fixing typo in framework.py
      
      * Modify comments for ReaderHolder
      77ee8fb2
  8. 12 2月, 2018 1 次提交
  9. 11 2月, 2018 1 次提交
  10. 10 2月, 2018 2 次提交
  11. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  12. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  13. 24 1月, 2018 1 次提交
  14. 22 1月, 2018 1 次提交
  15. 16 1月, 2018 1 次提交
  16. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  17. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  18. 03 1月, 2018 1 次提交
  19. 25 12月, 2017 2 次提交
  20. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  21. 21 12月, 2017 2 次提交
  22. 20 12月, 2017 2 次提交
  23. 19 12月, 2017 2 次提交
  24. 18 12月, 2017 2 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
    • F
      update · b3ea677a
      fengjiayi 提交于
      b3ea677a
  25. 14 12月, 2017 2 次提交
  26. 27 11月, 2017 1 次提交
  27. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  28. 24 11月, 2017 1 次提交
  29. 14 11月, 2017 1 次提交
  30. 08 11月, 2017 1 次提交
  31. 06 11月, 2017 1 次提交
    • Y
      Feature/lod tensor array (#5379) · 2be4c3cb
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      2be4c3cb
  32. 04 11月, 2017 1 次提交
    • Y
      Add LoDRankTable (#5349) · 74849158
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add InferVarType
      74849158
  33. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      0a32e74d