1. 16 2月, 2022 1 次提交
  2. 15 2月, 2022 24 次提交
  3. 14 2月, 2022 15 次提交
    • C
      9c2cee1c
    • H
      Add Inplace addto pass and unittest. (#39433) · 52af0a60
      hlygit66666 提交于
      * add fuse_relu_depthwise_conv_pass unittest
      
      * fix atol and rtol
      
      * fix according to review
      
      * Update test_dist_fuse_relu_depthwise_conv_pass.py
      
      * add inplace_addto pass and unittest
      52af0a60
    • C
    • S
      [UT] mish op, conv+mish, fc+mish fuse passes (#39340) · 02938b3d
      Sławomir Siwek 提交于
      * mish unit tests
      
      * code format
      
      * remove unused imports
      
      * code format
      
      * remove hard-coded shape values
      
      * remove timeouts
      
      * remove timeouts v2
      
      * restore timeouts
      02938b3d
    • S
      [Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e
      Sylwester Fraczek 提交于
      * prevent squashing pair u8 dequantize -> s8 quantize
      
      * add relu op to check for uint8
      
      * fix ptq fc attr name fuse_activation->activation_type
      
      * fix
      
      * add unit test
      
      * remove unused variable
      
      * test fix unsuccessful
      
      * fix test and logic
      
      * multiline comment
      
      * remove cout
      
      * Revert "fix ptq fc attr name fuse_activation->activation_type"
      
      This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
      
      * fix ptq fc attr name fuse_activation->activation_type
      66b5348e
    • W
      context add generator (#39475) · 463e31f4
      Wilber 提交于
      * context add generator
      
      * update
      463e31f4
    • L
      Add cuda tracer (#39488) · 0790f949
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * add log for Executor
      
      * Add CudaTracer to trace CUDA events
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      0790f949
    • Z
      统一ps:heter ps 二阶段单测通过 (#39468) · 765a2ada
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      765a2ada
    • L
      [NewExe] Ignore eof Exception(#39487) · 2f642159
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * add log for Executor
      
      * Avoid thread reconsruction when EOF
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      2f642159
    • Q
      55da9344
    • C
      [PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
      Chen Weihang 提交于
      * add has_attr for arg map context
      
      * skip useless attr now
      
      * skip attr if not exists
      
      * fix typo
      ddb1e23f
    • z8hanghuan's avatar
      new may of test cases, *test=kunlun (#39444) · e07420b9
      z8hanghuan 提交于
      * new may of test cases, *test=kunlun
      
      * new may of test cases, *test=kunlun
      
      * new may of test cases, *test=kunlun
      e07420b9
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
    • T
      fix gather_nd, *test=kunlun (#39283) · d12c3636
      TTerror 提交于
      d12c3636
    • T