1. 14 11月, 2017 4 次提交
    • Q
      add split and merge lod tensor operator (#5537) · f07a226a
      QI JUN 提交于
      * add split lod tensor operator
      
      * add more test cast
      
      * clean code
      
      * add merge lod tensor operator
      
      * fix bug
      
      * clean code
      
      * add grad operator
      
      * make mask support GPU
      
      * add comments
      f07a226a
    • Y
      Assign Operator. (#5531) · 7c1755d9
      Yu Yang 提交于
      * Assign Operator.
      
      Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]
      
      * Follow comments
      7c1755d9
    • X
      Fix sequence_pool_op in debug mode · 983502d2
      xuwei06 提交于
      The rank of the tensor from the chip() function is changed. In release mode, eigen_assert is not enabled and the dimenstion mismatch is not detected.
      983502d2
    • X
      Fix matmal_op for debug mode · 6a6e4d8d
      xuwei06 提交于
      The dimension is not set correctly and is not being checked in release mode because eigen_assert is not enabled.
      6a6e4d8d
  2. 13 11月, 2017 6 次提交
    • P
      refine var name · c5d71077
      peterzhang2029 提交于
      c5d71077
    • P
      fix warning · 0a6262d5
      peterzhang2029 提交于
      0a6262d5
    • Q
      BeamSearchDecodeOp (#5498) · a4106278
      Qiao Longfei 提交于
      * init trieconcat_op
      
      * add basic implementation
      
      * add test
      
      * add more test
      
      * update unit test
      
      * add PackAllSteps test
      
      * fix PackAllSteps
      
      * all test passed
      
      * clean code
      
      * remove state inside helper
      
      * rename prob to score
      
      * optimize RemoveFromEnd
      
      * use deconstructor to delete BeamNode recursively
      
      * optimize interface
      
      * add comment to interface
      
      * optimizer data structure
      
      * use template to define the type of score
      
      * use template parameter for BeamHelper
      
      * change father to parent
      
      * rename TrieConcat to BeamSearchOutConcat
      
      * use LoDTensorArray
      
      * rename BeamSearchOutConcat to BeamSearchDecode
      
      * refine code
      
      * remain all candidate sentence in beam_search_decode_op, do not consider endid
      
      * use unique_ptr
      
      * fix compare bug
      
      * fix lod compile problem
      a4106278
    • Y
      trigger ci for lod_reset_op · c6275eca
      Yibing Liu 提交于
      c6275eca
    • Y
      bug fix in lod_reset_op: cast int to size_t in LoD · 9bc71087
      Yibing Liu 提交于
      9bc71087
    • P
      refine notation in bilinear_tensor_product_op.h · 5f99ae90
      peterzhang2029 提交于
      5f99ae90
  3. 11 11月, 2017 4 次提交
  4. 10 11月, 2017 6 次提交
  5. 09 11月, 2017 10 次提交
  6. 08 11月, 2017 10 次提交
    • W
      fix CI · b3a86b6d
      wwhu 提交于
      b3a86b6d
    • D
    • Y
      Static lstm sanity check (#5365) · 870650d8
      Yang Yang(Tony) 提交于
      * add fill_constant_batch_size_like_op to rnn h_boot
      
      * first commit
      
      * merge develop; fix conflict
      
      * update to main_program
      870650d8
    • T
      update · 11ee50ce
      typhoonzero 提交于
      11ee50ce
    • P
      refine memory transform · 47269273
      peterzhang2029 提交于
      47269273
    • T
      fix accuracy cudamemset · 6308ccc2
      typhoonzero 提交于
      6308ccc2
    • Y
      CompareOp's kernel device type is decided by input tensor place · 3187451a
      Yang Yu 提交于
      CompareOp can run on CPU even other operators are running on GPU, since
      opeatations like comparing control flags should be performed only on CPU
      3187451a
    • Y
      Rename shrink_state -> shrink_rnn_memory · 01425309
      Yang Yu 提交于
      Follow comments
      01425309
    • C
      fix attr name · cdf5e871
      chengduoZH 提交于
      cdf5e871
    • Y
      Feature/rnn to array to lod tensor (#5411) · f72729d4
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * clean code
      
      * refactor
      
      * use ostream
      
      * update test
      
      * fix gpu build error
      
      * make gpu test pass
      f72729d4