1. 10 2月, 2022 13 次提交
  2. 09 2月, 2022 19 次提交
  3. 08 2月, 2022 8 次提交
    • S
      Make Embedding layer support more int ids type (#39381) · 60f1461a
      sneaxiy 提交于
      * add more int id type support for embedding
      
      * add ut
      
      * add more ut
      
      * fix ci error
      60f1461a
    • H
      Add FuseOptimizerPass and test_dist_fuse_adam_pass unittest. (#39208) · ccdcfa2d
      hlygit66666 提交于
      * add fuse_relu_depthwise_conv_pass unittest
      
      * fix atol and rtol
      
      * fix according to review
      
      * Add FuseOptimizerPass and fuse_adam_pass unittest
      
      * add sgd and momentum unittest
      
      * add fuse_optimizer_pass
      
      * close amp
      
      * close amp
      
      * update
      
      * fix run on two cards
      
      * Update test_dist_fuse_adam_pass.py
      
      * Update test_dist_fuse_momentum_pass.py
      
      * Update test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Update test_dist_fuse_adam_pass.py
      
      * Update test_dist_fuse_momentum_pass.py
      
      * Update test_dist_fuse_sgd_pass.py
      ccdcfa2d
    • Y
    • J
      [Bug fix] Fixed handling of one of the cases in the quantization process (#39342) · e4d475ea
      joanna.wozna.intel 提交于
      * Fix quantization next op findings
      
      * Corrections according to the review
      e4d475ea
    • J
      Fix to #38126 (#39097) · f884edb9
      Jacek Czaja 提交于
      * - 38126 potential fix
      
      * - fix
      
      * - build fix
      
      * - another candidate fix
      
      * - compilation fix
      
      * - another fix
      
      * - Fix to activation of NHWC being first oneDNN op in chain on oneDNN ops
      
      * - compilation fix
      
      * - added NHWC reotating for elementwise being first op
      
      * - compilation fix
      
      * - compilation fix
      
      * - Added UT
      
      * - cosmetic fixes
      f884edb9
    • H
      Update op support gpu impl (#39386) · ba882657
      hong 提交于
      * find gpu kernel in pten factory; test=develop
      
      * check in functional kernel first; test=develop
      ba882657
    • Z
      ps optimize refactor (#38982) · 196dbfc2
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      196dbfc2
    • Z
      [bf16] add bf16 cuda kernel: concat and split (#39380) · de0bad2a
      zhangbo9674 提交于
      * add concat & split
      
      * add concat kernel
      
      * add concat unittest
      
      * add split unittest
      de0bad2a