1. 13 2月, 2023 2 次提交
    • Y
      [Auto Parallel] Fix a bug of dist_scale (#50288) · 7f7e9320
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      
      * [Auto Parallel] Clear some fluid APIs
      
      * [Auto Parallel] Fix a bug of dist_scale
      7f7e9320
    • R
      optimize setup.py under conda virtual environment (#48211) · 5143b0eb
      risemeup1 提交于
      * optimize setup.py for conda envir
      
      * check python dependency
      
      * optimize code after reviewed
      5143b0eb
  2. 12 2月, 2023 1 次提交
  3. 11 2月, 2023 2 次提交
    • H
      [Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b
      HongyuJia 提交于
      * init commit
      
      * fix tensor operator*
      
      * fix compile bug
      
      * bug reproduce
      
      * update commit
      
      * polish codes
      
      * fix compile bug
      
      * test begin
      
      * test begin
      
      * compile finish
      
      * restore origin composite_backward_api
      
      * pass local CI
      
      * fix merge error
      
      * fix merge error
      
      * change py_test from GPU->CPU, test custom op
      
      * polish codes, modify prim unittest
      
      * modify prim unittest
      
      * determine phi_tensor_operants location
      
      * polish codes
      
      * add header file
      
      * solve windows unresolved symbol
      
      * fix some CI error
      
      * add overload defination
      
      * fix CI inference and Windows
      
      * polish codes according to reviewers' opinion
      
      * polish codes according to reviewers' opinion
      14e45f6b
    • W
      [TRT] elementwise_add+transpose fusion (#50081) · fd0d4fa4
      Wang Bojun 提交于
      * eleadd_trans first version
      
      log fix
      
      * refine code for linear format, add pass check
      
      * linear format refine and ut fix
      
      * fix ut
      
      * windows ut
      
      * windows ut 2
      
      * move tensorMeta and alloc to configure
      fd0d4fa4
  4. 10 2月, 2023 21 次提交
  5. 09 2月, 2023 14 次提交