1. 22 12月, 2017 1 次提交
    • D
      "remove GPU Sync Interface" (#6793) · abde3130
      dzhwinter 提交于
      * "remove GPU Sync Interface"
      
      * "fix typo"
      
      * "fix type cast error"
      
      * "fix related Copy with stream"
      
      * "fix failed tests with DevicePool"
      
      * "fix stupid removed position error"
      abde3130
  2. 21 12月, 2017 1 次提交
  3. 20 12月, 2017 1 次提交
  4. 19 12月, 2017 1 次提交
  5. 18 12月, 2017 1 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
  6. 14 12月, 2017 2 次提交
  7. 30 11月, 2017 1 次提交
  8. 27 11月, 2017 1 次提交
  9. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  10. 24 11月, 2017 2 次提交
  11. 14 11月, 2017 1 次提交
  12. 08 11月, 2017 1 次提交
  13. 07 11月, 2017 1 次提交
    • Y
      ReadFromArray/WriteToArray op (#5407) · c9b57dcc
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Complete array read/write op unittests
      c9b57dcc
  14. 06 11月, 2017 1 次提交
    • Y
      Feature/lod tensor array (#5379) · 2be4c3cb
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      2be4c3cb
  15. 04 11月, 2017 1 次提交
    • Y
      Add LoDRankTable (#5349) · 74849158
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add InferVarType
      74849158
  16. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      0a32e74d
  17. 01 11月, 2017 1 次提交
    • Y
      Feature/executor use program bind (#5196) · 1363ddb6
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      1363ddb6
  18. 31 10月, 2017 2 次提交
  19. 29 10月, 2017 1 次提交
    • Q
      support sparse output for lookup table grad op (#5145) · 008f40ce
      QI JUN 提交于
      * add sparse support for sum op
      
      * typo fix
      
      * fix gpu build error
      
      * fix unittest error
      
      * typo fix
      
      * infer var type and shape in op_test
      
      * follow comments
      
      * fix build error
      
      * bypass some unittests depend on NetOp
      
      * support sparse output for lookup table grad op
      
      * refine codes
      
      * fix gpu build error
      
      * fix lookup table grad gpu kernel
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix bug in lookup_table_grad op
      
      * fix bug in test_word2vec
      
      * register double kernel for some operators
      
      * set is_sparse=True in test_word2vec
      
      * fix lookup table grad op CUDA kernel bug
      
      * disable test_modified_huber_loss_op temporarily
      
      * disable test_lstm_unit_op temporarily
      008f40ce
  20. 28 10月, 2017 1 次提交
    • F
      Python API for inference model saving/load (#5020) · 6783dcee
      fengjiayi 提交于
      * Add `dump_to_file()` for ProgrameDescBind in pybind
      
      * Update
      
      * Add utility.py
      
      * typo
      
      * Fix bugs
      
      * Move add_feed/fetch_components to untility.py
      
      * Compelete dump
      
      * Follow comments
      
      * Change output of Prune() from inference to pointer
      
      * Expose Prune() to Python
      
      * Compelete save/load API of inference model
      
      * Fix errors
      
      * Debuging
      
      * Compelete unit tests
      
      * follow comments
      6783dcee
  21. 27 10月, 2017 4 次提交
  22. 25 10月, 2017 2 次提交
  23. 24 10月, 2017 2 次提交
  24. 21 10月, 2017 2 次提交
  25. 20 10月, 2017 4 次提交
    • Y
      Feature/remove global scope (#4950) · 102a5f34
      Yu Yang 提交于
      * Unify `set_feed_variable` to one method
      
      * Move global scope to python, not in C++
      102a5f34
    • Y
      Unify `set_feed_variable` to one method (#4949) · 42f2dd40
      Yu Yang 提交于
      42f2dd40
    • Y
      Remove template parameter for Tensor methods (#4937) · c532b967
      Yu Yang 提交于
      * Remove template parameter for Tensor methods
      
      * Also check the type is correct when data()
      * Simplize holder_
      
      * Fix accuracy_op
      
      * Register Code
      c532b967
    • Y
      Feature/py executor test (#4922) · 3db52783
      Yu Yang 提交于
      * Implement FC layer with helper
      
      * Update LayerHelper
      
      * Add debug string for Python ProtoBuf
      
      and Rename `Sync` to `Flush`
      
      * Add check of ProtoBuf initialization
      
      * Layer wrapper for FC
      
      * Fix unittest
      
      * Fix CI
      
      * Add code generator
      
      * AttributeChecker Better error log and speicalize bool
      
      Since lots of types can be cast to bool
      
      * Complete mlp, fit_a_line
      
      * Expose get global scope
      
      * Make global scope not thread-safe
      
      1. It is no need to make global scope thread-safe, since it will be
      invoked in Python main thread.
      2. Do not free the global scope when C++ exit. Let the OS free memories,
      otherwise, we need to handle the destroy dependencies.
      
      See
      https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables
      
      * Fix
      
      * Implementation of simple conv_2d layer
      
      * Stash
      
      * Remove private data members in OpRegister
      
      * Fix bugs
      
      * Stash
      
      * Expose FeedFetchList as VarType
      
      * Change ProgramDesc not a global variable
      
      * Polish code style
      
      * Stash
      
      * Correct implement BlockDesc destructor
      
      * Correct implement BlockDesc destructor
      
      * Unify program as parameter name
      
      * Fix bugs
      
      * Add unittest
      
      * Fix unit test error
      
      * Remove unused functions
      
      * Add clone for Python Program
      
      * Working on executor
      
      * Stash
      
      * Add glog as dependencies of ops
      
      * Use VLOG to logging some information is helpful when we debug Paddle
      
      * Expose VarDesc::persistable to Python
      
      * Test executor
      
      * Complete unittest
      
      * Polish code
      
      * Fix merge error
      
      * Follow comment
      
      * Polish Python Code
      3db52783
  26. 19 10月, 2017 3 次提交
    • Y
      Copy Constructor for ProgramDesc (#4895) · 47f773dd
      Yu Yang 提交于
      * Implement FC layer with helper
      
      * Update LayerHelper
      
      * Add debug string for Python ProtoBuf
      
      and Rename `Sync` to `Flush`
      
      * Add check of ProtoBuf initialization
      
      * Layer wrapper for FC
      
      * Fix unittest
      
      * Fix CI
      
      * Add code generator
      
      * AttributeChecker Better error log and speicalize bool
      
      Since lots of types can be cast to bool
      
      * Complete mlp, fit_a_line
      
      * Implementation of simple conv_2d layer
      
      * Fix bugs
      
      * Change ProgramDesc not a global variable
      
      * Polish code style
      
      * Stash
      
      * Correct implement BlockDesc destructor
      
      * Correct implement BlockDesc destructor
      
      * Unify program as parameter name
      
      * Fix bugs
      
      * Add unittest
      
      * Fix unit test error
      
      * Remove unused functions
      
      * Add clone for Python Program
      
      * Compare OpDescBind directly
      47f773dd
    • Y
      Expose VarDesc::persistable to Python (#4911) · f6e1d959
      Yu Yang 提交于
      f6e1d959
    • F
      Fix several bugs in compile time backward and Protobuf desc (#4894) · a204fefe
      fengjiayi 提交于
      * Implement FC layer with helper
      
      * Update LayerHelper
      
      * Add debug string for Python ProtoBuf
      
      and Rename `Sync` to `Flush`
      
      * Add check of ProtoBuf initialization
      
      * Layer wrapper for FC
      
      * Fix unittest
      
      * Fix CI
      
      * Add code generator
      
      * AttributeChecker Better error log and speicalize bool
      
      Since lots of types can be cast to bool
      
      * Complete mlp, fit_a_line
      
      * Implementation of simple conv_2d layer
      
      * Fix bugs
      
      * Correct implement BlockDesc destructor
      
      * Fix bugs
      
      * Fix unit test error
      
      * Follow comments
      a204fefe