1. 17 4月, 2023 2 次提交
    • Z
      rename_SliceKernel (#52863) · d2b0d63f
      zhangyuqin1998 提交于
      d2b0d63f
    • S
      Add output defs for some kernelsPhi register (#52941) · 23f87442
      Sonder 提交于
      * add register info for eigh and eig_gard
      
      * add sync_batch_norm_op.cu register info
      
      * add lamb output register info
      
      * add unique register info
      
      * change type name
      
      * change type name
      
      * add output register info for check_finite_and_unscale
      
      * update cmake and config file
      
      * add register info for adagrad
      
      * fix build error
      
      * add sync to run_unittests.sh
      
      * add register info for unique_consecutive
      
      * fix build error
      
      * add eigh to STATIC_BUILD_TESTS
      
      * update eig_kernel.cc
      
      * update eig_kernel.cc
      
      * fix infer mate error
      
      * fix unique register error
      
      * fix lamb register info error
      
      * fix lamb register info
      
      * update lamb register info
      
      * fix lamb
      
      * remove one Output Register
      
      * update static build file
      
      * add eigh op to disable_wingpu_test
      
      * update run_unittests
      23f87442
  2. 14 4月, 2023 11 次提交
  3. 13 4月, 2023 12 次提交
  4. 12 4月, 2023 4 次提交
    • Z
      Optimize performance of unique kernel (#52736) · 8cbeefea
      Zhang Zheng 提交于
      * Optimize performance of unique kernel
      
      * fix ci
      8cbeefea
    • W
      [AMP OP&Test] add fp16/bf16 unittest for pool2d op (#52288) · f9b155f9
      Wei Shengyu 提交于
      * add bf16 support and bf16/fp16 unittest for pool2d
      
      * add include files
      
      * dbg
      
      * reformat
      
      * reformat
      
      * modify code according to review comment
      
      * remove duplicate code
      
      * remove dup code
      
      * remove useless include
      
      * dbg
      f9b155f9
    • W
      Patch del (#52754) · 189e0d44
      wangzhen38 提交于
      * [DO NOT MERGE] adadelta lr support
      
      * [DO NOT MERGE] gpu support
      
      * [test] follow torch
      
      * fix acc update order
      
      * for ci
      
      * [bug fix] update master para
      
      * [bug fix] update test
      
      * [bug fix] for ci test
      
      * for ci
      
      * fix xpu
      
      * [adadelta fix] del fluid head file
      
      * for ci
      
      * del notes
      189e0d44
    • G
      [AMP OP&Test] support bf16 for batch norm (#52407) · 523f8a26
      Guoxia Wang 提交于
      * [AMP OP&Test] support bf16 for batchnorm
      
      * codestyle
      
      * Update batch_norm_grad_kernel.cu
      
      * Update batch_norm_kernel.cu
      
      * fix codestyle
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * Update batch_norm_kernel.cc
      523f8a26
  5. 11 4月, 2023 7 次提交
  6. 10 4月, 2023 4 次提交