1. 31 3月, 2022 6 次提交
    • C
      Add time range duration display (#41029) · 6744754f
      chenjian 提交于
      * no
      
      * fix bugs
      
      * fix doc according to review
      
      * fix api doc format
      
      * fix api doc according to review
      
      * fix bug and add unit test
      
      * fix record event bug
      
      * optimize chrome tracing display
      
      * fix bug
      
      * add comment
      
      * add unit test
      
      * fix a bug
      
      * fix
      
      * fix
      
      * fix format
      6744754f
    • Z
      Opt the compilation of sparse kernel (#41086) · b9da48da
      zhangkaihuo 提交于
      b9da48da
    • Y
      update elementwise unittest style, *test=kunlun (#40779) · 23a69bc7
      ykkk2333 提交于
      23a69bc7
    • P
      support view strategy in eager_fluid state (#40830) · 2f1c1ae5
      pangyoki 提交于
      * support view strategy in eager_fluid state
      
      * little change
      
      * little change
      
      * optimize unittest
      
      * fix
      2f1c1ae5
    • P
      fix eager_gen node bug (#41165) · 56493c9e
      pangyoki 提交于
      56493c9e
    • P
      Support inplace strategy for pylayer (#41043) · 11d1a51a
      pangyoki 提交于
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Enabled complex type promotion test for matmul_v2
      
      * pylayer, test=develop
      
      * Fix CI issues
      
      * Support initializing specific grad tensors to zero for selected operators
      
      * finish forward, test=develop
      
      * create grad node finish, test=develop
      
      * Merged adj_edges_ with GradSlotMeta
      
      * Fixed monir issue
      
      * backward finish, start dbg, test=develop
      
      * Adjusted num runs
      
      * Recovered Eager performance tests configurations
      
      * Recovered Eager performance tests configurations
      
      * finish, test=develop
      
      * polish, test=develop
      
      * polish, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * Adjusted performance tests configurations
      
      * Fixed Minor Issues with performance tests
      
      * [Phi] Fix macro name typo
      
      * support set_materialize_grads, test=develop
      
      * suppotr mark_non_differentiable, test=develop
      
      * support once_differentiable, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * Moved out Edge from GradSlotMeta
      
      * Fixed issues from merge
      
      * Fixed typo
      
      * Addressed review comments
      
      * Fixed merge issues
      
      * Fixed minor issues
      
      * Fixed minor issue
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * Fixed major issues and enabled auto_prune test cases
      
      * Fixed issues from merge
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support inplace for pylayer
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      11d1a51a
  2. 30 3月, 2022 18 次提交
  3. 29 3月, 2022 8 次提交
    • J
      Update of oneDNN to 2.5 (#39426) · 35b96d48
      Jacek Czaja 提交于
      * - update of oneDNN to 2.5
      
      * - changes to UT testing onednn verbose
      
      * - Update of oneDNN to 2.5.3
      
      * - update onednn to 2.5.4
      35b96d48
    • R
      [MoE] Moe apis (#40895) · aeade538
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aeade538
    • W
      add elementwise sub and elementwise div in tensorrt op teller (#40806) · f3022dfa
      wangxinxin08 提交于
      * add elementwise sub and elementwise div in tensorrt op teller
      
      * add unittest of elementwise mul, sub and div
      f3022dfa
    • Z
      Add Sparse op sparse_relu (#40959) · c544a181
      zhangkaihuo 提交于
      c544a181
    • T
      Revert "Move some activation to phi (#40727)" (#41056) · 05f3d48e
      tianshuo78520a 提交于
      This reverts commit e77a947e.
      05f3d48e
    • Z
    • Z
      [MLU]add reduce op mlu kernel (#41028) · d1c1d731
      zn 提交于
      d1c1d731
    • A
      [Eager]Switch new Eager mode (#40990) · 55f9b71a
      Aurelius84 提交于
      * [Eager]Switch new Eager mode
      
      * switch into eager
      
      * fix typo
      55f9b71a
  4. 28 3月, 2022 8 次提交
    • H
      Move meshgrid to phi (#40994) · ca871957
      hong 提交于
      * move momentum, rmsprop to phi; test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate; test=develop
      
      * fix xpu npu bugs; test=develop
      
      * fix npu bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows compile error; test=develop
      
      * polish code; test=develop
      
      * fix conflict; test=develop
      
      * add meshgrid;
      
      * update
      
      * polish code
      
      * polish code;
      
      * fix bug
      
      * format; remove useless code
      
      * fix npu bug
      
      * fix bug
      ca871957
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
    • H
      [Dygraph] Add unittests for DataParallel in eager mode (#40709) · 62af5903
      Haohongxiang 提交于
      * add uts for EagerReducer
      
      * add more uts
      
      * fix bugs
      
      * fix bugs
      
      * modify
      
      * modify uts
      
      * fix bugs
      
      * update
      
      * update
      
      * update
      
      * solve conflicts and merge
      
      * add some other uts
      
      * modify time of uts
      
      * update
      
      * update
      
      * update
      
      * remove uts of resnet
      62af5903
    • J
      [Auto parallel] Mixed Precision FP16 Pass (#40615) · b99c1d07
      JZ-LIANG 提交于
      *  add FP16 Pass 
      
      * Support the auto completion of while_op
      
      *  acc aligned
      b99c1d07
    • C
      Fix profiler package bug (#40888) · 77a455c7
      chenjian 提交于
      * no
      
      * fix bugs
      
      * fix doc according to review
      
      * fix api doc format
      
      * fix api doc according to review
      
      * fix bug and add unit test
      
      * fix record event bug
      77a455c7
    • K
      Add window computation in stft op. (#40987) · c049a6b4
      KP 提交于
      c049a6b4
    • C
      [Auto Parallel] Update reshard (#40865) · d101334c
      caozhou 提交于
      * fix code stype
      
      * update unitest
      d101334c
    • W
      Update ResNet test cases (#40953) · 023d8771
      Weilong Wu 提交于
      023d8771