- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 12月, 2017 3 次提交
-
-
由 dzhwinter 提交于
* "add data layout" * "need kernel registry support" * "fix data layout" * "reorder include headers" * "change enum to enum class" * "fix CI"
-
由 Qiao Longfei 提交于
* init kernel hint * fix typo * rm unused code * add include in op_kernel.h * restore op_kernel since it will be moved to op_kernel_type * change force_cpu to use_cpu * fix compilation
-
由 dzhwinter 提交于
-
- 24 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "change operator interface" * "move devicepool to device_context" * "fix operator test" * "fix op_registry Run interface" * "net op passed. Need to fix nccl multi-Context" * "add nccl group function" * "add nccl group function" * "fix gpu count exceed 32 error" * "fix recurrent op, nccl op" * "change the other operators interface with Place" * "fix typo" * "fix pybind" * "fix device in python side" * "fix pybind failed" * "add init for test" * "fix CI"
-
- 22 12月, 2017 3 次提交
- 21 12月, 2017 4 次提交
-
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 fengjiayi 提交于
-
由 Yu Yang 提交于
* Rename XXDescBind --> XXDesc * Fix Compile
-
- 20 12月, 2017 3 次提交
- 19 12月, 2017 3 次提交
-
-
由 fengjiayi 提交于
-
由 qiaolongfei 提交于
-
由 fengjiayi 提交于
-
- 18 12月, 2017 2 次提交
-
-
由 dzhwinter 提交于
* "add DeviceContextPool" * "add devicecontextpool in pybind" * "add comments in python side " * "fix static link error" * "fix CI error" * "add executor.py" * "fix CI error" * "add with gpu macro" * "remove comment out codes" * "add TODO items" * "update init devices"
-
由 fengjiayi 提交于
-
- 14 12月, 2017 2 次提交
- 12 12月, 2017 1 次提交
-
-
由 typhoonzero 提交于
-
- 30 11月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 27 11月, 2017 1 次提交
-
-
由 dangqingqing 提交于
-
- 26 11月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "make global tensor function independently" * "replace functor" * "fix inline template error" * "fix tensor array with CopyFrom" * "fix other case use CopyFrom" * "move the op interface hardly" * "fix operators" * "fix typo" * "delete dynamic recurrent rnn and fix gru_unit in debugmode" * "fix unique_ptr copy" * "fix cuda copy" * "fix namespace error" * "removed nccl python test" * "fix include error" * "fix typo" * fix copy util test
-
- 24 11月, 2017 2 次提交
-
-
由 QI JUN 提交于
* is_training to is_test in dropout op * handle dropout and batch_norm operator when prune pdesc in testing mode * handle dropout and batch_norm operator when prune pdesc in testing mode * add get_inference_program method * fix dropout op * fix ci * test data after each batch training * refine code * refine test_book3 * fix ci * follow comments
-
由 fengjiayi 提交于
* Change all `data_type` in Python to `dtype` * Change `date_type` in C++ to `dtype` * Refine
-
- 14 11月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 08 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Compare Operator * Follow comments
-
- 07 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Use stable_sort in lod_rank_table It is easy to debug and test when use `stable_sort`and the time complexity is not changed. * Add LoDTensorArray * Stash * Better debug message for IsInitialized * Stash * Better debug message for IsInitialized * Complete array read/write op unittests
-
- 06 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Use stable_sort in lod_rank_table It is easy to debug and test when use `stable_sort`and the time complexity is not changed. * Add LoDTensorArray
-
- 04 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Add LoDRankTable LoD Rank Table stores the `level` of `lod` which is ordered by sequence length in descending order. It is useful when implement dynamic RNN and is shared by dynamic RNN memory, dynamic RNN slice input and dynamic RNN slice output operators. * Add InferVarType
-
- 02 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Init commit * Make executor use ProgramDescBind * Change Attribute from BlockDesc to BlockDescBind * Since we will get the program desc in RNN, just BlockDesc is not enough. * Add DeviceContext to Executor API * Rewrite RNN * Pass Python * AddBiasOp does not care num_flatten_dims * Stash * Fix MacOS Compile * Pass RNN forward * add python test * refactor test * Make compile pass * add gradopmaker * First draft done * Polish code * add grad op maker and grad infershape * Polish code * Fix backward.cc bug * Fix infershape * Rename function * add backward test * simplify recurrent test * Update * Pass unittest * Add comments & refine test * Add comments * refactor test * Complete Unittest * fix StepScopes enforce * Remove unused unittest * no type error * Update * Make RNN Pass unittest
-
- 01 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Init commit * Make executor use ProgramDescBind * Change Attribute from BlockDesc to BlockDescBind * Since we will get the program desc in RNN, just BlockDesc is not enough.
-
- 31 10月, 2017 2 次提交
-
-
由 Qiao Longfei 提交于
* improve unique_name, uniq id is related to prefix * fix join
-
由 QI JUN 提交于
* add init_gflags interface * refine code * follow comments
-
- 29 10月, 2017 1 次提交
-
-
由 QI JUN 提交于
* add sparse support for sum op * typo fix * fix gpu build error * fix unittest error * typo fix * infer var type and shape in op_test * follow comments * fix build error * bypass some unittests depend on NetOp * support sparse output for lookup table grad op * refine codes * fix gpu build error * fix lookup table grad gpu kernel * fix ci * fix ci * fix ci * fix bug in lookup_table_grad op * fix bug in test_word2vec * register double kernel for some operators * set is_sparse=True in test_word2vec * fix lookup table grad op CUDA kernel bug * disable test_modified_huber_loss_op temporarily * disable test_lstm_unit_op temporarily
-
- 28 10月, 2017 1 次提交
-
-
由 fengjiayi 提交于
* Add `dump_to_file()` for ProgrameDescBind in pybind * Update * Add utility.py * typo * Fix bugs * Move add_feed/fetch_components to untility.py * Compelete dump * Follow comments * Change output of Prune() from inference to pointer * Expose Prune() to Python * Compelete save/load API of inference model * Fix errors * Debuging * Compelete unit tests * follow comments
-
- 27 10月, 2017 1 次提交
-
-
由 Dong Zhihong 提交于
-