1. 24 4月, 2023 5 次提交
  2. 23 4月, 2023 4 次提交
  3. 22 4月, 2023 1 次提交
    • W
      [Zero-Dim] support output 0D for... · b406a7db
      wangfengsheng1999 提交于
      [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * add test case
      
      * modify dot/metric.accuracy/static.accuracy/static.auc
      
      * modfiy inner/tensordot bug
      
      * test 9 api
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase
      
      * fix bug
      
      * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy
      
      * code style
      
      * fix bug
      
      * fix test_dot_op bug
      
      * fix accuracy bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * codestyle
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * code style
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * modify code
      b406a7db
  4. 21 4月, 2023 7 次提交
  5. 20 4月, 2023 5 次提交
  6. 19 4月, 2023 5 次提交
  7. 18 4月, 2023 9 次提交
  8. 17 4月, 2023 4 次提交
    • Z
      [Paddle-Inference] Add cutlass conv2d_depthwise (#51792) · bd3b096a
      zhoutianzi666 提交于
      * initial commit for cutlass_teller
      
      * second commit for cutlass_teller
      
      * add conv2d_depthwise python template
      
      * add conv2d_depthwise cutlass template
      
      * /zhoukangkang/paddle_cutlass/Paddle/paddle/fluid/framework/ir/cutlass_teller.h
      
      * refine code in Conv2dFusionCanSupport
      
      * add macro in cutlass_teller.h
      
      * add 3x3 5x5 teller
      
      * add groups not 1 or conv2d_depthwise teller
      
      * 只生成ic是8的倍数的conv2d_depthwise 的kernel
      
      * add EXPLICIT in cutlass_teller.h
      
      * final commit
      
      * add split_k_slices in conv2d_depthwise
      
      * make stages == 2
      
      * 重构部分代码
      
      * add CutlassFusionType
      
      * solve illegal memory
      
      * make stride_h=stride_w && make dilation==1
      
      * must check HasAttr(use_cutlass) before GetAttrIfExists
      
      * add CONV2D_DEPTHWISE_BIAS_SILU to OpType2String
      
      * modify decl.h and util.cu
      bd3b096a
    • C
      [Fused] controlled randomness for fused dropout add (#52903) · e36f80c6
      Chitsing KUI 提交于
      * add random control for fused dropout add
      
      * add __init__
      e36f80c6
    • V
      [AMP OP&Test]Add BF16 implementation and unit tests of multinomial (#52898) · d19d2486
      Vvsmile 提交于
      * fix multinomial
      
      * fix test_elementwise
      
      * fix convert_float_to_uint16
      
      * aadd test_multimial_op
      
      * fix code style
      d19d2486
    • T
      【PaddlePaddle Hackathon 4 No.49】:为 Paddle bce_loss 支持 float16 数据类型 (#50930) · 44e6de98
      thunder95 提交于
      * untracked files
      
      * bce_loss_fp16
      
      * remove unused files
      
      * back max_rel_erro still big
      
      * simplify code
      
      * upd
      
      * fix max_relative_error
      
      * restart ci
      
      * Update test_bce_loss.py
      
      * Update test_bce_loss.py
      
      * Update test_bce_loss.py
      
      * Update test_bce_loss.py
      
      * try to pass test
      
      * restore file
      
      * remove error value
      
      * fix bug
      
      ---------
      Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
      44e6de98