- 19 11月, 2018 6 次提交
-
-
由 Yihua Xu 提交于
* Optimize layer_norm operator with AVX intrinsic functions * Revert the wrong modifications * Implement the jit kernel for layer_norm operator * Add math headfile to fix the compile issue (test=develop) * Add math headfile to fix the compile issue (test=develop) * Fixed the intrinsic headfile issue (test=develop) * Fix the conflicts (test=develop) * Revert for CUDA compiler (test=develop) * Fixed the cuda depency (test=develop) * Fix the marco issues (test=develop)
-
由 hjchen2 提交于
-
由 qingqing01 提交于
* Convolution fusion operator. * Clean code test=develop
-
由 Yu Yang 提交于
test=develop
-
由 Wu Yi 提交于
* fix dist deps test=develop * update test=develop * update test=develop * update test=develop * update test=develop
-
由 Yu Yang 提交于
test=develop
-
- 16 11月, 2018 14 次提交
-
-
由 tensor-tang 提交于
test=develop
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
由 Jiabin Yang 提交于
* fix space_to_depth_op unicode problem * test=develop
-
由 whs 提交于
* Fix truncated normal. * Fix. * Make nce support more distribution. * Fix API.spec. * Fix python API. * Fix. test=develop * Fix API.spec test=develop * Fix sampler. * Fix order of arguments in python API. test=develop
-
由 tensor-tang 提交于
test=develop
-
由 Yu Yang 提交于
test=develop
-
由 Yu Yang 提交于
test=develop
-
由 hjchen2 提交于
-
由 Wu Yi 提交于
* add cudnn ctc loss * wip add test test=develop * wip * wip * done test=develop * move include cudnn test=develop * test test=develop * fix build test=develop * fix build test=develop * fix build on cudnn5 test=develop * fix cudnn5 build test=develop * fix cudnn5 build test=develop * merge develop softmax functor change test=develop
-
由 tensor-tang 提交于
test=develop
-
由 Yan Chunwei 提交于
* fix inference on gpu out of mem the transfer logic in operator.cc will keep creating new scopes.
-
由 tensor-tang 提交于
* rename and fix blas vsqr test=develop * update
-
由 hjchen2 提交于
-
由 hjchen2 提交于
-
- 15 11月, 2018 16 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 Sylwester Fraczek 提交于
* add is_test to pooling and activations add prop_kind support for layers activation. conv and pooling add a pass that sets is_test to true add transpiler version of is_test pass test=develop * patch test and pass test=develop * add pass to analyzer.h test=develop * add is_test attr description & pass only on mkldnn in: activation_op.cc batch_norm_op.cc conv_op.cc dropout_op.cc lrn_op.cc pool_op.cc sequence_pool_op.cc softmax_op.cc * fix is_test handling for activation pool and conv * change description of is_test for all layers again * remove GetAttr(use_mkldnn) from pass * rename correct_mkldnn_test_phase to is_test and remove dependency on MKLDNN test=develop * review fix magic number * two if(..)s into one * Check is_test once and pass mkldnn forward prop kind * dereference shared_ptr with * (without get()) test=develop * add is_test_pass back test=develop
-
由 chengduo 提交于
* add selu * use for range test=develop * add API test=develop * follow comment test=develop * update API.spec test=develop
-
由 minqiyang 提交于
test=develop
-
由 Sylwester Fraczek 提交于
test=develop
-
由 Yu Yang 提交于
Clean allocation->Deleter test=develop
-
由 tensor-tang 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-
由 Yu Yang 提交于
-
由 Tao Luo 提交于
test=develop
-
由 Yiqun Liu 提交于
* Refine the tester for MixedRTPredictor. test=develop * Enable the profiler in TensorRT engine. * Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.
-
- 14 11月, 2018 4 次提交
-
-
由 peizhilin 提交于
-
由 Jacek Czaja 提交于
test=develop
-
由 Tao Luo 提交于
test=develop
-
由 nhzlx 提交于
test=develop
-