1. 14 12月, 2017 1 次提交
  2. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  3. 11 12月, 2017 1 次提交
    • Y
      Fix gcc4.9 (#6442) · 95924686
      Yiqun Liu 提交于
      * Fix compiling error of gcc4.9.
      
      * Refine the check of cxx compiler flags in api/CMakeLists.txt.
      95924686
  4. 08 12月, 2017 1 次提交
    • Q
      Nmt decoder train (#6367) · 36fcc95c
      Qiao Longfei 提交于
      * init decoder_trainer
      
      * can run
      
      * fix lod
      
      * add sharelod to cross_entropy_grad_op
      
      * add avg_cost to fetch list
      
      * modify learning rate
      
      * can run
      
      * optimie code
      
      * add early exit
      
      * fix print
      
      * revert test_understand_sentiment_conv.py
      
      * add act to fc
      36fcc95c
  5. 06 12月, 2017 1 次提交
    • Y
      Feature/while op sentiment analysis (#6282) · 229c2e78
      Yu Yang 提交于
      * Add DataFeeder
      
      A v2 API like data feeder for book demos.
      We can feed data directly from reader.
      
      * Fix CI
      
      * Add an unittest for while/rnn op forward
      
      * Add unittest for raw while op backward
      
      * Fix CI
      
      * Complete Dynamic RNN
      229c2e78
  6. 05 12月, 2017 1 次提交
  7. 04 12月, 2017 1 次提交
    • Y
      While op forward for sentimental analysis (#6140) · d5e32794
      Yu Yang 提交于
      * Add DataFeeder
      
      A v2 API like data feeder for book demos.
      We can feed data directly from reader.
      
      * Fix CI
      
      * Add an unittest for while/rnn op forward
      
      * Add unittest for raw while op backward
      
      * Fix CI
      d5e32794
  8. 30 11月, 2017 2 次提交
  9. 28 11月, 2017 1 次提交
    • Send recv op (#5520) · 0a8a86e0
      武毅 提交于
      * WIP send recv op
      
      * WIP send recv
      
      * put grpc impl in details
      
      * put grpc impl in details
      
      * update wip
      
      * update proto
      
      * update proto
      
      * update proto
      
      * clean cmake
      
      * wip on op implementations
      
      * wip on op implementations
      
      * compile ok adding ut
      
      * wip unitest
      
      * add extern cares for linking
      
      * wip add ut
      
      * working version send recv
      
      * revert optimizer.py
      
      * update test cmake
      
      * add libtool to dockerfile
      
      * update cmake dependency
      
      * update cmake depends
      
      * update cmake grpc depends
      
      * fix cmake dependency
      
      * fix compile error
      
      * fix compile
      
      * follow comments
      
      * update
      
      * update copyfrom
      0a8a86e0
  10. 27 11月, 2017 1 次提交
  11. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  12. 24 11月, 2017 2 次提交
  13. 18 11月, 2017 2 次提交
  14. 16 11月, 2017 1 次提交
    • Y
      feature/while_grad_op (#5554) · 18f0c40a
      Yang Yang(Tony) 提交于
      * first commit
      
      * Python API for while op
      
      * Python Unittest for simple while_op forward
      
      * fix out to be list
      
      * Fix UT
      
      * VarType
      
      * Fix several bugs
      
      * Fix bug
      
      * Fix bug
      
      * Fix Bug
      
      * Fix bug
      
      * Fix unittest
      
      * Remove debug log
      
      * Add comments
      
      * add PADDLE_ENFORCE
      
      * while_grad_op first commit
      
      * Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs
      
      * not sure how to setdim of while outputs
      
      * push for test
      
      * add executor vlog
      
      * fix bug of while_op cond
      
      * Several enhancement for code
      
      1. Backward always infer shape & infer var type. Since there are RENAME
      variables will be created when creating backward operator, but their
      shape & var types are not inferenced.
      2. Never use SomePtr-> directly, since every pointer could be nullptr if
      it is a function return value. Add `detail::Ref` to cast pointer to
      reference safely.
      3. Enhance error message for backward.
      4. Infer data type of variable in `sum` and `tensor_write`
      
      * Fix bugs of while_op gradient
      
      * Fix several bugs of while_op grad
      
      * fix fill zeros like
      
      * fix 3 >= 3
      
      * fix place holder shouldn't be null
      
      * fail on sum op
      
      * Fix SumOp of TensorList
      
      * clean up
      
      * pass while test
      
      * fix test_array_write_read
      
      * pass sum op
      
      * Support int/int64 for fill_constant_batch_size_like
      
      * Fix compile
      18f0c40a
  15. 15 11月, 2017 1 次提交
  16. 14 11月, 2017 2 次提交
    • Y
      Conditional Block Forward (#5530) · 488320a7
      Yu Yang 提交于
      * Conditional Block Forward
      
      * Assign Operator.
      
      Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]
      
      * Stash
      
      * Add Scope::Rename
      
      it is useful in gradient phase of an operator with block
      
      * ConditionalBlock Grad Done
      
      * Add comments
      
      * yapf format code
      488320a7
    • Y
      Assign Operator. (#5531) · 7c1755d9
      Yu Yang 提交于
      * Assign Operator.
      
      Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]
      
      * Follow comments
      7c1755d9
  17. 11 11月, 2017 1 次提交
  18. 10 11月, 2017 1 次提交
    • Y
      feature/while_op (#5502) · 40367d18
      Yang Yang(Tony) 提交于
      * first commit
      
      * Python API for while op
      
      * Python Unittest for simple while_op forward
      
      * fix out to be list
      
      * Fix UT
      
      * VarType
      
      * Fix several bugs
      
      * Fix bug
      
      * Fix bug
      
      * Fix Bug
      
      * Fix bug
      
      * Fix unittest
      
      * Remove debug log
      
      * Add comments
      
      * add PADDLE_ENFORCE
      
      * while_grad_op first commit
      
      * Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs
      
      * refine code
      
      * fix unittest bug
      40367d18
  19. 09 11月, 2017 1 次提交
  20. 08 11月, 2017 6 次提交
    • Y
      Fix CI Compile · 0ede2a73
      Yang Yu 提交于
      0ede2a73
    • Y
      Feature/rnn to array to lod tensor (#5411) · f72729d4
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * clean code
      
      * refactor
      
      * use ostream
      
      * update test
      
      * fix gpu build error
      
      * make gpu test pass
      f72729d4
    • Y
      Rewrite fill_constant op · 5ee62383
      Yu Yang 提交于
      5ee62383
    • Y
      Polish OpWithKernel · bbdac7f7
      Yu Yang 提交于
      * Chage `IndicateDataType` to `GetKernelType`. Make it easier to
        understand.
      * Change `OpKernelKey` to `OpKernelType`
      * Make operator developers can customize which kernel the operator will
        use in runtime.
      bbdac7f7
    • Y
      Compare Operator (#5325) · f74fb790
      Yu Yang 提交于
      * Compare Operator
      
      * Follow comments
      f74fb790
    • Q
      Check errors for the cuda kernel calls. (#5436) · 58db07b7
      qingqing01 提交于
      58db07b7
  21. 07 11月, 2017 3 次提交
    • Y
      Add unittest, backward of array read/write op (#5409) · 6cde889b
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Complete array read/write op unittests
      
      * Add unittest, Gradient of array read/write
      
      * Follow comments
      6cde889b
    • Y
      Update lod_tensor.md (#5383) · 70154597
      Yang Yang(Tony) 提交于
      An important change on lod tensor indexing. A higher level offset will be based on its next level rather than an absolute offset.
      70154597
    • Y
      ReadFromArray/WriteToArray op (#5407) · c9b57dcc
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Complete array read/write op unittests
      c9b57dcc
  22. 06 11月, 2017 3 次提交
  23. 05 11月, 2017 1 次提交
  24. 04 11月, 2017 2 次提交
    • Q
      Add acc test to image classification (#5336) · 906e2565
      Qiao Longfei 提交于
      * add acc layer
      * memory log level change from 3 to 10
      * use gaussian random to init conv parameters
      * use initializer
      * fix import
      * batch_norm use helper to create persistable var
      * refine code
      * train only 2 batches for test
      * use g_program and g_init_program
      * use XavierInitializer to init fc parameter
      906e2565
    • Y
      Add LoDRankTable (#5349) · 74849158
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add InferVarType
      74849158
  25. 03 11月, 2017 1 次提交
  26. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      0a32e74d