1. 10 2月, 2018 2 次提交
  2. 30 1月, 2018 1 次提交
  3. 09 1月, 2018 2 次提交
  4. 28 12月, 2017 2 次提交
  5. 26 12月, 2017 2 次提交
  6. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  7. 04 12月, 2017 1 次提交
    • Y
      While op forward for sentimental analysis (#6140) · d5e32794
      Yu Yang 提交于
      * Add DataFeeder
      
      A v2 API like data feeder for book demos.
      We can feed data directly from reader.
      
      * Fix CI
      
      * Add an unittest for while/rnn op forward
      
      * Add unittest for raw while op backward
      
      * Fix CI
      d5e32794
  8. 01 12月, 2017 1 次提交
  9. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  10. 07 11月, 2017 1 次提交
    • Y
      Add unittest, backward of array read/write op (#5409) · 6cde889b
      Yu Yang 提交于
      * Use stable_sort in lod_rank_table
      
      It is easy to debug and test when use `stable_sort`and the time
      complexity is not changed.
      
      * Add LoDTensorArray
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Stash
      
      * Better debug message for IsInitialized
      
      * Complete array read/write op unittests
      
      * Add unittest, Gradient of array read/write
      
      * Follow comments
      6cde889b
  11. 02 11月, 2017 1 次提交
    • Y
      Rewrite StaticRNN with Executor (#5224) · 0a32e74d
      Yu Yang 提交于
      * Init commit
      
      * Make executor use ProgramDescBind
      
      * Change Attribute from BlockDesc to BlockDescBind
      
      * Since we will get the program desc in RNN, just BlockDesc is not
        enough.
      
      * Add DeviceContext to Executor API
      
      * Rewrite RNN
      
      * Pass Python
      
      * AddBiasOp does not care num_flatten_dims
      
      * Stash
      
      * Fix MacOS Compile
      
      * Pass RNN forward
      
      * add python test
      
      * refactor test
      
      * Make compile pass
      
      * add gradopmaker
      
      * First draft done
      
      * Polish code
      
      * add grad op maker and grad infershape
      
      * Polish code
      
      * Fix backward.cc bug
      
      * Fix infershape
      
      * Rename function
      
      * add backward test
      
      * simplify recurrent test
      
      * Update
      
      * Pass unittest
      
      * Add comments & refine test
      
      * Add comments
      
      * refactor test
      
      * Complete Unittest
      
      * fix StepScopes enforce
      
      * Remove unused unittest
      
      * no type error
      
      * Update
      
      * Make RNN Pass unittest
      0a32e74d
  12. 29 10月, 2017 1 次提交
    • Q
      support sparse output for lookup table grad op (#5145) · 008f40ce
      QI JUN 提交于
      * add sparse support for sum op
      
      * typo fix
      
      * fix gpu build error
      
      * fix unittest error
      
      * typo fix
      
      * infer var type and shape in op_test
      
      * follow comments
      
      * fix build error
      
      * bypass some unittests depend on NetOp
      
      * support sparse output for lookup table grad op
      
      * refine codes
      
      * fix gpu build error
      
      * fix lookup table grad gpu kernel
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix bug in lookup_table_grad op
      
      * fix bug in test_word2vec
      
      * register double kernel for some operators
      
      * set is_sparse=True in test_word2vec
      
      * fix lookup table grad op CUDA kernel bug
      
      * disable test_modified_huber_loss_op temporarily
      
      * disable test_lstm_unit_op temporarily
      008f40ce
  13. 27 10月, 2017 1 次提交
    • Q
      add sparse support for sum op (#5093) · 7f8574c0
      QI JUN 提交于
      * add sparse support for sum op
      
      * typo fix
      
      * fix gpu build error
      
      * fix unittest error
      
      * typo fix
      
      * infer var type and shape in op_test
      
      * follow comments
      
      * fix build error
      
      * bypass some unittests depend on NetOp
      7f8574c0
  14. 03 10月, 2017 1 次提交
  15. 28 9月, 2017 1 次提交
  16. 05 9月, 2017 1 次提交
  17. 17 8月, 2017 1 次提交
  18. 11 8月, 2017 1 次提交
  19. 09 8月, 2017 1 次提交
  20. 07 8月, 2017 1 次提交
  21. 05 8月, 2017 1 次提交
  22. 04 8月, 2017 2 次提交
  23. 03 8月, 2017 2 次提交
  24. 02 8月, 2017 3 次提交
  25. 01 8月, 2017 2 次提交
  26. 31 7月, 2017 1 次提交
  27. 25 7月, 2017 1 次提交
  28. 19 7月, 2017 2 次提交
  29. 17 7月, 2017 2 次提交
    • Q
      set correct place for output tensor · 2a03e380
      qijun 提交于
      2a03e380
    • Y
      Op varient inputs (#2901) · a0caf234
      Yan Chunwei 提交于
      * add inputs
      
      * add ut for multiple inputs
      
      * fix AddToLayer
      
      * op_desc -> op_proto
      
      * CreateArgumentOffsetMap -> CreateInOutOffsetMap
      
      * move CreateInOutOffsetMap from OperatorBase to op registry
      
      * arg_idxs_ -> in_out_idxs_
      a0caf234