- 10 11月, 2017 5 次提交
-
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
- 08 11月, 2017 12 次提交
-
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
由 Tao Luo 提交于
enable benchmark resnet with MKLDNN
-
由 Yu Yang 提交于
Polish OpWithKernel
-
由 Cao Ying 提交于
Update annotations of layers.py.
-
由 Yu Yang 提交于
* Add LoDRankTable LoD Rank Table stores the `level` of `lod` which is ordered by sequence length in descending order. It is useful when implement dynamic RNN and is shared by dynamic RNN memory, dynamic RNN slice input and dynamic RNN slice output operators. * Add skeleton for array_to_lod_tensor and lod_tensor_to_array * Add VarType::LoDTensorArray * Add PyBind of LoDTensorArray * Add InferVarType * Add first unittest * Add ut * Add unittest * Add unittest * Add unittests * update * init * add infershape for lod_tensor_to_array_op * compelete array_to_lod_tensor_op * copy data * clean code * clean code * Fix unittest data * fix bugs * fix compile error * Refine TensorToArrayOp * refactor array_to_lod_tensor * Unittest * fix bugs * Fix unittest * Fix unittest * debug * Debug * Fix unittest * clean code * refactor * use ostream * update test * fix gpu build error * make gpu test pass
-
由 Yang Yu 提交于
-
由 Yu Yang 提交于
-
由 Yu Yang 提交于
* Chage `IndicateDataType` to `GetKernelType`. Make it easier to understand. * Change `OpKernelKey` to `OpKernelType` * Make operator developers can customize which kernel the operator will use in runtime.
-
由 Yu Yang 提交于
* Compare Operator * Follow comments
-
由 qingqing01 提交于
-
由 Dong Zhihong 提交于
-
- 07 11月, 2017 23 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 ranqiu 提交于
-
由 Tao Luo 提交于
Fix ci not exit 1 when error
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 Dong Zhihong 提交于
-
由 kexinzhao 提交于
* small fix * fix comment * address comment * small fix
-
由 Dong Zhihong 提交于
-
由 Dong Zhihong 提交于
-
由 qingqing01 提交于
Fix cmake error when building with WITH_AVX=OFF.
-
-
由 dangqingqing 提交于
-
由 武毅 提交于
Refine get cuda context
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 Yang yaming 提交于
MemoryHandle* --> MemoryHandlePtr
-
由 Dong Zhihong 提交于
-
由 yangyaming 提交于
-
由 Yu Yang 提交于
* Use stable_sort in lod_rank_table It is easy to debug and test when use `stable_sort`and the time complexity is not changed. * Add LoDTensorArray * Stash * Better debug message for IsInitialized * Stash * Better debug message for IsInitialized * Complete array read/write op unittests * Add unittest, Gradient of array read/write * Follow comments
-