1. 22 11月, 2018 1 次提交
  2. 19 11月, 2018 1 次提交
    • Y
      Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8
      Yihua Xu 提交于
      * Optimize layer_norm operator with AVX intrinsic functions
      
      * Revert the wrong modifications
      
      * Implement the jit kernel for layer_norm operator
      
      * Add math headfile to fix the compile issue (test=develop)
      
      * Add math headfile to fix the compile issue (test=develop)
      
      * Fixed the intrinsic headfile issue (test=develop)
      
      * Fix the conflicts (test=develop)
      
      * Revert for CUDA compiler (test=develop)
      
      * Fixed the cuda depency (test=develop)
      
      * Fix the marco issues (test=develop)
      f4c869d8
  3. 18 11月, 2018 2 次提交
  4. 17 11月, 2018 3 次提交
  5. 16 11月, 2018 12 次提交
  6. 15 11月, 2018 5 次提交
  7. 14 11月, 2018 3 次提交
  8. 13 11月, 2018 1 次提交
  9. 12 11月, 2018 1 次提交
  10. 09 11月, 2018 4 次提交
  11. 08 11月, 2018 7 次提交
    • P
      remove duplicate · 41b423d4
      peizhilin 提交于
      41b423d4
    • P
      merge from develop · dcfab111
      peizhilin 提交于
      dcfab111
    • C
      Fix input<tensor> (#14208) · c5b6573a
      chengduo 提交于
      * fix input<tensor>
      test=develop
      
      * fix split_ids
      test=develop
      
      * ElementwiseMul should not support SelectedRows
      
      * fix scale op
      test=develop
      
      * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
      
      * fix operator
      
      * refine MultiOutput
      
      * fix MultiOutput
      test=develop
      
      * disable test_dist_save_load
      test=develop
      
      * fix elementwise_op
      test=develop
      
      * add get_sparse_as_op
      test=develop
      
      * add info for check
      test=develop
      
      * rename get_sparse_as_op with extract_rows_as_op.
      test=develop
      
      * elementwise doesn't support selected_rows
      
      * fix regularizer
      
      * remove extract_rows_as
      test=develop
      
      * fix ci
      test=develop
      
      * add test for sum_op
      
      * fix regularizer
      test=develop
      
      *  test=develop
      
      * fix pserver weight decay multi inputs test=develop
      c5b6573a
    • Z
      Revert "cherry picked windows patches." · ba8b5619
      Zhaolong Xing 提交于
      ba8b5619
    • M
      Change the origin VLOG level to 10 times · 0c3227a5
      minqiyang 提交于
      Fix code to support cpplint syntax check
      
      test=develop
      0c3227a5
    • T
      add vaddbias jitcode · 5e64244f
      tensor-tang 提交于
      test=develop
      5e64244f
    • T
      combine jitcode of vscal · 3d950a81
      tensor-tang 提交于
      3d950a81