1. 18 7月, 2023 5 次提交
  2. 17 7月, 2023 1 次提交
  3. 14 7月, 2023 1 次提交
    • C
      [AutoTuner] Distribute best cfg (#54834) · 7f6d222f
      caozhou 提交于
      * distribute best cfg
      
      * adapt to multi args transmission
      
      * update metric extracting
      
      * fix bugs of prune and reading log
      
      * fix time default value
      
      * remove time record
      
      * adjust the order of searching dim
      
      * fix prune bugs
      
      * fix adding cfg bug
      
      * fix multi nodes bug
      
      * reset status
      
      * remove alarm and set logdir
      
      * deepcopy ctx
      
      * change alarm
      
      * fix restart bug
      
      * add exit
      
      * best no need alarm
      
      * add warmup time
      7f6d222f
  4. 13 7月, 2023 7 次提交
  5. 12 7月, 2023 1 次提交
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
  6. 11 7月, 2023 7 次提交
    • P
      support sharding parallel (#54634) · b7a05057
      pangengzheng 提交于
      * support sharding parallel
      
      * fix name
      
      * fix
      
      * update
      
      * test amp for sharding
      
      ---------
      
      Co-authored-by: pangengzheng <pangengzheng.baidu.com>
      b7a05057
    • M
      DOCS: Adding imformation about datatype in math.py (#55297) · ab73b8c6
      Muhammad Ishaque Nizamani 提交于
      * DOCS: Adding imformation about datatype in math.py
      
      * replaced uint16 with bfloat16.
      ab73b8c6
    • W
      Pipeline pass base (#55174) · 5434560a
      Wennie396 提交于
      * format correction
      
      * variable names adjustment
      
      * variable names adjustment, name-->type, value-->sub_program
      5434560a
    • L
      replace the AdagradOptimizer... · 94365855
      LoneRanger 提交于
      replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)
      
      * replace the AdadeltaOptimizer with Adadelta
      
      * replace the RMSPropOptimizer with RMSProp
      
      * replace the LambOptimizer with lamb
      
      * replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug of Lamp
      
      * fix bug of Lamp
      
      * fix bug of import
      
      * replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update optimizer.py
      
      * fix bug
      
      * fix bug
      94365855
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
    • Q
      赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c
      qiuwenbo 提交于
      * [尝试] 给tensor增加一个属性, 这个属性是一个定值 1
      
      * 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问
      
      * 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理
      
      * 增加一个单元测试
      
      * 优化 code-style
      ab46b14c
  7. 10 7月, 2023 3 次提交
  8. 08 7月, 2023 1 次提交
  9. 07 7月, 2023 3 次提交
  10. 06 7月, 2023 8 次提交
  11. 05 7月, 2023 3 次提交