1. 07 6月, 2018 1 次提交
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  2. 06 6月, 2018 1 次提交
  3. 18 4月, 2018 2 次提交
  4. 16 4月, 2018 2 次提交
  5. 26 3月, 2018 3 次提交
  6. 20 3月, 2018 1 次提交
  7. 13 2月, 2018 1 次提交
  8. 12 2月, 2018 1 次提交
  9. 10 2月, 2018 2 次提交
  10. 09 2月, 2018 1 次提交
  11. 08 2月, 2018 1 次提交
  12. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      未验证
      ae7d1c1f
  13. 08 1月, 2018 1 次提交
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      未验证
      0f353ab4
  14. 27 12月, 2017 1 次提交
  15. 25 12月, 2017 1 次提交
    • D
      "add data layout" (#6955) · 7777c811
      dzhwinter 提交于
      * "add data layout"
      
      * "need kernel registry support"
      
      * "fix data layout"
      
      * "reorder include headers"
      
      * "change enum to enum class"
      
      * "fix CI"
      未验证
      7777c811
  16. 21 12月, 2017 1 次提交
  17. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      未验证
      45062fe5
  18. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      未验证
      0a32e74d
  19. 30 10月, 2017 1 次提交
    • Q
      03 image classification (#5192) · 0049ce04
      Qiao Longfei 提交于
      * add batch_norm_layer
      
      * add img_conv_group layer and test
      
      * add check to Tensor.type()
      
      * forward can run
      
      * with backward
      
      * change label data time from int32 to int64
      
      * refine code
      
      * follow comment
      未验证
      0049ce04
  20. 26 10月, 2017 1 次提交
    • Y
      Feature/save op (#5090) · efc2464f
      Yu Yang 提交于
      * Init
      
      * Stash
      
      * Polish SaveLoadOp
      
      * Fix CI
      
      * Polish code
      
      * Save GPU Tensor
      
      * Stash
      
      * Fix CI
      efc2464f
  21. 25 10月, 2017 1 次提交
    • D
      "Serialize LoDTensor, Save/Restore model" (#4602) · fd2eb550
      dzhwinter 提交于
      * "add model format design doc"
      
      * "add restore function"
      
      * "add parse protobuf"
      
      * "move necessary information to saver.proto"
      
      * "format code"
      
      * "add gpu option"
      
      * "add lod info"
      
      * "add saveop python test wrapper"
      
      * "checkpoint reuse save operator"
      
      * "rewrite model format design doc"
      
      * "async support needed"
      
      * "fix run once"
      
      * "fix doc based on comments"
      
      * "refine based on comments"
      
      * "fix based comments"
      
      * "remove persistable flag from framework.proto"
      
      * "add IndicateDataType to restore op"
      
      * "add save test"
      
      * "modify save restore code"
      
      * "modified the restore logic"
      
      * rm checkpoint_op.cc
      
      * rm test_checkpoint_op.py
      
      * "get inputs outputs name from execution context"
      
      * Saving each variable to a independent file
      
      * Fix bugs
      
      * Rewrite save_restore_op_test with new Python framework
      
      * Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase
      
      * Refine unit test of SaveOp and RestoreOp
      
      * fix compile errorwq
      fd2eb550
  22. 20 10月, 2017 1 次提交
  23. 17 10月, 2017 2 次提交
    • C
      add forward computation of crf operator. · cc220eec
      caoying03 提交于
      cc220eec
    • Y
      Rewrite feed/fetch op (#4815) · 4df6cf4d
      Yu Yang 提交于
      * Feed/Fetch op just plain operator, not a OpWithKernel
      * Do not register OpInfoMaker since Feed/Fetch will never be
        configured by users
      * Feed/Fetch op has empty gradient
      * Feed/Fetch op do not hard code `feed_variable`, `fetch_variable` as
        its input and output, make it as a plain Operator input/output
      4df6cf4d
  24. 12 10月, 2017 1 次提交
  25. 10 10月, 2017 1 次提交
  26. 28 9月, 2017 1 次提交
  27. 23 9月, 2017 2 次提交
  28. 15 9月, 2017 1 次提交
  29. 12 9月, 2017 1 次提交
  30. 08 9月, 2017 1 次提交
  31. 07 9月, 2017 1 次提交
  32. 06 9月, 2017 1 次提交
  33. 05 9月, 2017 1 次提交
反馈
建议
客服 返回
顶部