- 07 5月, 2018 5 次提交
-
-
由 typhoonzero 提交于
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
由 Yiqun Liu 提交于
Add the call of DropKids at the end of executor.Run to delete the local scopes created in operators (#10403) * Add the call of DeleteScope to delete the memory of scope created by NewScope. * Call DropKids at the end of executor.Run to delete all local scopes created in operators.
-
由 Darcy 提交于
* adding operator to blas deps list * use device_context instead to solve cycle deps
-
- 06 5月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "optimizer op support float64" * "fix ci" * "fix ftrl op"
-
- 05 5月, 2018 9 次提交
-
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
由 fengjiayi 提交于
-
由 Lei Wang 提交于
-
由 Kexin Zhao 提交于
-
由 Siddharth Goyal 提交于
-
- 04 5月, 2018 14 次提交
-
-
由 chengduoZH 提交于
-
由 Yancey 提交于
* fix remove op * update
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 typhoonzero 提交于
-
由 chengduoZH 提交于
-
由 Qingsheng Li 提交于
* Added auto transform to beam_search_decode_op * Added some comment * Added unittest for beam_search_decode_op on GPU
-
由 Abhinav Arora 提交于
* Fix CPPLint issyes in gru_kernel.h * Fix CPPLint issyes in gru_kernel.h * Fix Compile error
-
由 Yu Yang 提交于
-
由 Lei Wang 提交于
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
* add test float16 inference accuracy example * complete the test * clean code * add argument parse and refine tests * add shell script * add float16 benchmark code * refine code * prepare for contrib/float16 * put things in contrib float16 folder * update benchmark result * further update benchmark report * add float16 inference report * update report
-
- 03 5月, 2018 11 次提交
-
-
由 Luo Tao 提交于
-
由 Yu Yang 提交于
-
由 chengduo 提交于
-
由 Yu Yang 提交于
-
由 xzl 提交于
-
由 chengduoZH 提交于
-
由 typhoonzero 提交于
-
由 dzhwinter 提交于
* "fix double type error" * "fix ci" * "softmax fp64" * "fix momentum" * "fix ci"
-
由 Yiqun Liu 提交于
* Fix the bug when a input variable of op is dispensable. * Add HasInputs/Outputs interfaces to OperatorBase. * Remove the unreferenced header file.
-
由 Tomasz Patejko 提交于
* Initial implementation of forward pass for MKLDNN batch norm * Added attributes for MKLDNN batch norm * MKLDNN batch norm forward pass passes unittest. Started working on backward * Backward pass for MKLDNN batch norm added * MKLDNN batch norm: scoring added to forward pass * MKLDNN batch norm: bias as input added; handling AnyLayout when kernel is looked up * MKLDNN batch norm: python unit tests added; mkldnn tests removed * MKLDNN batch norm: changes required by cpplint * MKLDNN batch norm: refactoring the operator * MKLDNN batch norm: saved variance inversed in backward pass for correct execution of MKLDNN unit tests * MKLDNN batch norm: refctoring, function for static/const cast to void* added * MKLDNN batch norm: remove AnyLayout from batch norm * MKLDNN batch norm: only NCHW format is supported. Unittests refactored * MKDNN batch norm: use_mkldnn added to attributes * MKLDNN batch norm: AnyLayout removed from unittest * MKLDNN batch norm: added CUDNN defines to batch norm * MKLDNN batch norm: undefined data_format variable corrected * MKLDNN batch norm: use_cudnn added, use of setUp method for configuring attributes * MKLDNN batch norm: added use_cudnn attribute to batch norm operator * MKLDNN batch norm: correcting batch norm unit tests for MKLDNN * MKLDNN batch norm: MKLDNN tests moved to another file; reverting changes for saved variance not being inverted * Change default layout to NCHW * MKLDNN batch norm: init_kernel_type method added to unit tests * MKLDNN batch norm: style changes * MKLDNN batch norm: unit tests refactored * MKLDNN batch norm: added use_mkldnn attribute to batch norm python interface
-
由 chengduo 提交于
* fix __shfl_down_sync_ of cross_entropy * use reduceSum * "fix ci"
-