1. 31 3月, 2022 11 次提交
    • L
      [KP] fix bug in phi kp (#41069) · ac5548a2
      Liu-xiandong 提交于
      * [KP] fix bug in phi kp
      
      * delete useless comment
      
      * update
      
      * update
      
      * choose the xpu kp kernel in phi
      ac5548a2
    • L
      add_autotune_kernel_tool (#40658) · 7c5dca9f
      limingshu 提交于
      * for 1st time interface combine.
      
      * modification with kernel factory
      
      * first auto_tune version.
      
      * first version.
      
      * basic version
      
      * add warm up step.
      
      * a debug version.
      
      * optimize the functionality of class auto_tuner.
      
      * add some quotes for optimized auto_tuner class.
      
      * add some quotes for optimized auto_tuner class.
      
      * add namespace.
      
      * modification according to the advices
      
      * replace fluid header with phi header.
      
      * replace fluid header with phi header.
      7c5dca9f
    • Z
      Restrict compilation conditions of optimized topk kernel (#41153) · dea24544
      Zhang Zheng 提交于
      * Restrict compilation conditions of optimized topk kernel
      
      * fix
      dea24544
    • W
      add weight unfold pass and handle trt fc op (#41088) · bdef57cd
      Wilber 提交于
      * add weight unfold pass and handle trt fc op
      
      * update
      
      * add kernel
      
      * update
      
      * update
      bdef57cd
    • W
      remove shape check (#41143) · 4b9e748a
      wenbin 提交于
      4b9e748a
    • P
      support view strategy in eager_fluid state (#40830) · 2f1c1ae5
      pangyoki 提交于
      * support view strategy in eager_fluid state
      
      * little change
      
      * little change
      
      * optimize unittest
      
      * fix
      2f1c1ae5
    • P
      fix eager_gen node bug (#41165) · 56493c9e
      pangyoki 提交于
      56493c9e
    • P
      Support inplace strategy for pylayer (#41043) · 11d1a51a
      pangyoki 提交于
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Enabled complex type promotion test for matmul_v2
      
      * pylayer, test=develop
      
      * Fix CI issues
      
      * Support initializing specific grad tensors to zero for selected operators
      
      * finish forward, test=develop
      
      * create grad node finish, test=develop
      
      * Merged adj_edges_ with GradSlotMeta
      
      * Fixed monir issue
      
      * backward finish, start dbg, test=develop
      
      * Adjusted num runs
      
      * Recovered Eager performance tests configurations
      
      * Recovered Eager performance tests configurations
      
      * finish, test=develop
      
      * polish, test=develop
      
      * polish, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * Adjusted performance tests configurations
      
      * Fixed Minor Issues with performance tests
      
      * [Phi] Fix macro name typo
      
      * support set_materialize_grads, test=develop
      
      * suppotr mark_non_differentiable, test=develop
      
      * support once_differentiable, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * Moved out Edge from GradSlotMeta
      
      * Fixed issues from merge
      
      * Fixed typo
      
      * Addressed review comments
      
      * Fixed merge issues
      
      * Fixed minor issues
      
      * Fixed minor issue
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * Fixed major issues and enabled auto_prune test cases
      
      * Fixed issues from merge
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support inplace for pylayer
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      11d1a51a
    • A
      move inplace_version_counter_ location (#41146) · a09058b2
      Aganlengzi 提交于
      a09058b2
    • L
      Pg heter cloud (#40911) · 92faeedf
      lilong12 提交于
      92faeedf
    • [Infrt] add result check for some infrt op. (#41167) · ec510bfd
      王明冬 提交于
      ec510bfd
  2. 30 3月, 2022 29 次提交