1. 03 3月, 2022 5 次提交
    • T
      cinn_launch_op: switch to execution by PE (#39911) · 167d511f
      TeFeng Chen 提交于
      * swith to PE execution in cinn launch
      
      * fix outer variables erased
      
      * skip the map bug temporarily for test
      
      * temporary solution for batch_norm bug
      
      * update comment
      
      * fix compile error
      
      * cinn_instruction_run_op_test: update code to skip external alloc/free instructions generated
      167d511f
    • R
      [CustomRuntime] migrate CustomRuntime into phi (#39908) · b4665d23
      ronnywang 提交于
      b4665d23
    • C
      2ffa6436
    • L
      Workqueue threadnames (#40035) · b8a16911
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * Set thread name for WorkQueue
      
      * Add thread names
      
      * fix ut
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      b8a16911
    • H
      Move bn to pten (#39347) · ebd0f512
      hong 提交于
      * add bn cpu version; test=develop
      
      * move batch norm to pten
      
      * move batch norm to pten; test=develop
      
      * fix bug; test=develop
      
      * fix func::tranpose depend bug; test=develop
      
      * fix compile bugs; test=develop
      
      * fix use_op batch_norm bug; test=develop
      
      * fix cudnn bn add relu test; test=develop
      
      * fix pten context build and double grad bug; test= develop
      
      * remve useless code; test=develop
      
      * add batch norm gpu fp16 support; test=develop
      
      * fix test bn op bug; test=develop
      
      * remove output dtype set; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix applay pass to program bug; test=develop
      
      * revert to develop; test=develop
      
      * fix rocm bug; test=develop
      
      * revert operator to develop; test=develop
      
      * fix pre_commit; test=develop
      
      * fix statci check error; test=develop
      
      * resolve conflict; test=develop
      
      * ana batch norm bug;
      
      * revert batch norm op
      
      * resolve conlict
      
      * fix nan inf and speed bug; test=develop
      
      * fix bug; test=develop
      
      * fix error; test=develop
      
      * test expand op; test=develop
      
      * fix bug; test=develop
      
      * resolve confilct
      
      * resolve confilct; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      
      * change mutable data to ctx alloc; test=develop
      
      * make format same with ci; test=develop
      
      * fix format error with ci; test=develop
      ebd0f512
  2. 02 3月, 2022 4 次提交
    • H
      Move sgd to phi (#40045) · f3d54e2e
      hong 提交于
      * move sgd to phi; test=develop
      
      * update
      
      * add sgd kernel; test=develop
      f3d54e2e
    • A
      [XPU] Fix Phi Kernel cache problem in operator.cc (#40044) · 66196573
      Aurelius84 提交于
      * [XPU] Fix Phi Kernel cache problem in operator.cc
      
      * fix typo
      66196573
    • Z
      new fleet_desc builder (#39948) · 1c4e3e5d
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * sync/geo test ok & fix heter_worker program ok
      
      * .
      
      * new fleet desc generator
      
      * new fleet_desc builder
      
      * new fleet_desc builder
      
      * .
      
      * .
      
      * correct ps.proto compile
      
      * .
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      1c4e3e5d
    • C
      Upgrade new profiler (#39984) · 0c3f7fbc
      chenjian 提交于
      * add new profiler components
      
      * fix bug
      
      * upgrade new profiler
      
      * fix operator.cc
      
      * fix operator.cc
      
      * fix cmakelists.txt
      
      * fix bug
      
      * fix according to pr
      
      * fix bug
      
      * fix cmake
      
      * fix bug
      
      * fix a bug
      
      * fix bug
      
      * fix bug
      0c3f7fbc
  3. 01 3月, 2022 7 次提交
  4. 28 2月, 2022 5 次提交
  5. 25 2月, 2022 2 次提交
  6. 24 2月, 2022 6 次提交
  7. 23 2月, 2022 4 次提交
  8. 22 2月, 2022 7 次提交