1. 28 11月, 2017 1 次提交
    • Send recv op (#5520) · 0a8a86e0
      武毅 提交于
      * WIP send recv op
      
      * WIP send recv
      
      * put grpc impl in details
      
      * put grpc impl in details
      
      * update wip
      
      * update proto
      
      * update proto
      
      * update proto
      
      * clean cmake
      
      * wip on op implementations
      
      * wip on op implementations
      
      * compile ok adding ut
      
      * wip unitest
      
      * add extern cares for linking
      
      * wip add ut
      
      * working version send recv
      
      * revert optimizer.py
      
      * update test cmake
      
      * add libtool to dockerfile
      
      * update cmake dependency
      
      * update cmake depends
      
      * update cmake grpc depends
      
      * fix cmake dependency
      
      * fix compile error
      
      * fix compile
      
      * follow comments
      
      * update
      
      * update copyfrom
      0a8a86e0
  2. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  3. 08 11月, 2017 1 次提交
    • Y
      Feature/rnn to array to lod tensor (#5411) · f72729d4
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * clean code
      
      * refactor
      
      * use ostream
      
      * update test
      
      * fix gpu build error
      
      * make gpu test pass
      f72729d4
  4. 06 11月, 2017 1 次提交
  5. 26 10月, 2017 2 次提交
  6. 25 10月, 2017 1 次提交
    • D
      "Serialize LoDTensor, Save/Restore model" (#4602) · fd2eb550
      dzhwinter 提交于
      * "add model format design doc"
      
      * "add restore function"
      
      * "add parse protobuf"
      
      * "move necessary information to saver.proto"
      
      * "format code"
      
      * "add gpu option"
      
      * "add lod info"
      
      * "add saveop python test wrapper"
      
      * "checkpoint reuse save operator"
      
      * "rewrite model format design doc"
      
      * "async support needed"
      
      * "fix run once"
      
      * "fix doc based on comments"
      
      * "refine based on comments"
      
      * "fix based comments"
      
      * "remove persistable flag from framework.proto"
      
      * "add IndicateDataType to restore op"
      
      * "add save test"
      
      * "modify save restore code"
      
      * "modified the restore logic"
      
      * rm checkpoint_op.cc
      
      * rm test_checkpoint_op.py
      
      * "get inputs outputs name from execution context"
      
      * Saving each variable to a independent file
      
      * Fix bugs
      
      * Rewrite save_restore_op_test with new Python framework
      
      * Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase
      
      * Refine unit test of SaveOp and RestoreOp
      
      * fix compile errorwq
      fd2eb550
  7. 24 10月, 2017 1 次提交
  8. 23 10月, 2017 1 次提交
  9. 21 10月, 2017 1 次提交
  10. 19 10月, 2017 3 次提交
  11. 18 10月, 2017 1 次提交
  12. 15 10月, 2017 3 次提交
  13. 05 10月, 2017 2 次提交
    • Y
      Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
      Yi Wang 提交于
      4558807c
    • Y
      Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU` · 84500f94
      Yu Yang 提交于
      By shell command
      
      ```bash
      sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
      sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
      ```
      84500f94
  14. 28 9月, 2017 1 次提交
  15. 23 9月, 2017 3 次提交
  16. 13 9月, 2017 2 次提交
  17. 11 9月, 2017 2 次提交
  18. 07 9月, 2017 1 次提交
  19. 31 8月, 2017 2 次提交
  20. 14 8月, 2017 2 次提交
    • L
      Fix macro !defined(PADDLE_ONLY_CPU) · 9038b849
      liaogang 提交于
      9038b849
    • Y
      remove detail from LODTensor (#3364) · 1ee633d1
      Yan Chunwei 提交于
      * remove SliceCopied
      
      * remove SliceCopied
      
      * rename SliceShared to SliceLevels, SliceInLevel
      
      * merge lod_tensor/details
      
      * remove lod_start_pos_'s shared_ptr
      
      * make lod-tensor a special tensor
      
      * add clone to lod_tensor
      
      * add lod tensor impl
      
      * add lodtensor clone test
      
      * init lod
      
      * add equal
      
      * merge LOD and its methods
      
      * recover tensor and variable
      
      * change thrust to host_vector
      1ee633d1
  21. 09 8月, 2017 1 次提交