- 20 11月, 2018 6 次提交
- 19 11月, 2018 8 次提交
-
-
由 qingqing01 提交于
* Modify some infer-shape in compile-time.
-
由 Yihua Xu 提交于
* Optimize layer_norm operator with AVX intrinsic functions * Revert the wrong modifications * Implement the jit kernel for layer_norm operator * Add math headfile to fix the compile issue (test=develop) * Add math headfile to fix the compile issue (test=develop) * Fixed the intrinsic headfile issue (test=develop) * Fix the conflicts (test=develop) * Revert for CUDA compiler (test=develop) * Fixed the cuda depency (test=develop) * Fix the marco issues (test=develop)
-
由 hjchen2 提交于
-
由 Superjomn 提交于
test=develop
-
由 qingqing01 提交于
* Convolution fusion operator. * Clean code test=develop
-
由 Yu Yang 提交于
test=develop
-
由 Wu Yi 提交于
* fix dist deps test=develop * update test=develop * update test=develop * update test=develop * update test=develop
-
由 Yu Yang 提交于
test=develop
-
- 18 11月, 2018 1 次提交
-
-
由 Jacek Czaja 提交于
test=develop
-
- 17 11月, 2018 4 次提交
-
-
由 tensor-tang 提交于
test=develop
-
由 Jacek Czaja 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-
- 16 11月, 2018 21 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
test=develop
-
由 superjomn 提交于
the parameters will load from CPUPlace, that will keep copying data between CPU and GPU places. test=develop
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
由 Tomasz Patejko 提交于
test=develop
-
由 Jiabin Yang 提交于
* fix space_to_depth_op unicode problem * test=develop
-
由 Jacek Czaja 提交于
test=develop - Added profiling to softmax functors - MKL based softmax inference op - Fix to softmax compuation via MKL - cleaning - Cosmetic fixes to softmax MKL - Fix to ON_INFER lack of propagation
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
-
由 Tomasz Patejko 提交于
* implements reachability check between identity node and non-identity argument to elementwise_add * implements handling identity node as x and as y argument to elementwise_add
-
由 whs 提交于
* Fix truncated normal. * Fix. * Make nce support more distribution. * Fix API.spec. * Fix python API. * Fix. test=develop * Fix API.spec test=develop * Fix sampler. * Fix order of arguments in python API. test=develop
-
由 tensor-tang 提交于
test=develop
-
由 Yu Yang 提交于
test=develop
-
由 Yu Yang 提交于
test=develop
-
由 hjchen2 提交于
-
由 Wu Yi 提交于
* add cudnn ctc loss * wip add test test=develop * wip * wip * done test=develop * move include cudnn test=develop * test test=develop * fix build test=develop * fix build test=develop * fix build on cudnn5 test=develop * fix cudnn5 build test=develop * fix cudnn5 build test=develop * merge develop softmax functor change test=develop
-
由 tensor-tang 提交于
test=develop
-