1. 11 2月, 2022 9 次提交
    • C
      [PTen] Move grad GetExpectedPtenKernelArgs into pten (#39418) · 667bd962
      Chen Weihang 提交于
      * move grad get expected pten kernel args
      
      * fix reduce sum error
      
      * fix element_sub_grad failed
      
      * revert kernel judge change
      667bd962
    • Z
      统一 ps 开发 - python (#39431) · 22c67d14
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      22c67d14
    • W
      [Paddle Inference] support ernie quant model with interleaved (#39424) · 1c44d3e2
      Wangzheee 提交于
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      
      * support ernie quant model with interleaved
      1c44d3e2
    • L
      Add log for executor (#39459) · 7e52beae
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * add log for Executor
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      7e52beae
    • L
      [new-exec] set type of op-kernel op by place (#39458) · 7392578d
      Leo Chen 提交于
      7392578d
    • S
      add print pten kernel tool (#39371) · 8803f6bb
      Shang Zhizhou 提交于
      * test=document_fix;add print pten kernel tool
      
      * test=document_fix
      
      * test=document_fix
      
      * test=document_fix
      
      * test=document_fix
      
      * add print_pten_kernels tool
      
      * add print_pten_kernels tool
      
      * fix windows complie
      
      * notest,test=rocm_ci
      
      * add merge tool
      
      * add comments
      8803f6bb
    • C
      Add profiler node tree implementation (#39316) · f38c2e5c
      chenjian 提交于
      * add event node implementation
      
      * modify profiler.stop interface
      
      * fix according to review
      
      * fix file mode
      
      * modify class method name in event_node.cc
      
      * modify LLONG_MAX to ULLONG_MAX
      
      * fix ci error
      
      * fix ci error
      f38c2e5c
    • Z
      Support different dtypes of inputs for elementwise ops (#38859) · bf305033
      Zhang Ting 提交于
      * improve backward performance
      
      * support different dtypes for elementwise ops
      bf305033
    • Z
      【Pten】Auto-Generate InterMeta register (#39436) · 7d6096ff
      zyfncg 提交于
      * fix code conflict
      
      * generate inter_meta register
      
      * clear cache
      
      * just try
      
      * add sign c++ api
      
      * polish some code
      7d6096ff
  2. 10 2月, 2022 21 次提交
  3. 09 2月, 2022 10 次提交