1. 05 4月, 2022 1 次提交
    • Z
      Implement AutoTuneStatus class for Kernel Auto Tune (#41218) · b0f8000e
      Zhang Ting 提交于
      * switch autotune
      
      * implement AutoTuneCache
      
      * implement AutoTuneCache class
      
      * add pybind api
      
      * add dygraph test
      
      * support static mode and eager mode and improve unittests
      
      * rename the SwitchAutoTune Class and improve tests
      
      * improve AutoTuneStatus and reduce the cost of tests
      b0f8000e
  2. 04 4月, 2022 4 次提交
  3. 03 4月, 2022 4 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
    • F
      add maximum limit for grid of index_select (#41127) · af8d2482
      FlyingQianMM 提交于
      * limit grid dim for index select
      
      * mv LimitGridDim into gpu_launch_config.h
      
      * fix conflicts
      
      * fix conflicts
      
      * fix code style
      
      * set block to 256
      
      * fix grid setting
      
      * set dtype of block_dim to unsigned int
      af8d2482
    • Z
      Add randperm and range yaml (#41265) · fd1ecfc5
      zyfncg 提交于
      * add randperm and range yaml
      
      * add eager test for randperm
      fd1ecfc5
    • F
      Add some yaml config (#41053) · e4914734
      From00 提交于
      * Add yaml config
      
      * Add yaml for flatten_contiguous_range_op
      
      * Remove h_sigmoid yaml
      
      * Fix CI errors
      
      * Fix code format
      
      * Fix flatten OP errors
      
      * Fix conflicts
      
      * Fix CI errors
      
      * Remove flatten_contiguous_range OP
      
      * Remove redundant code
      
      * Fix typos
      e4914734
  4. 02 4月, 2022 9 次提交
  5. 01 4月, 2022 7 次提交
    • H
      update (#41245) · 99029dc9
      hong 提交于
      99029dc9
    • W
      [Eager] Support pinned (#41035) · f3270fc8
      wanghuancoder 提交于
      * support pinned, test=develop
      
      * support async_write, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      f3270fc8
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • Z
      3a29e4f8
    • C
      [Phi] Add shape and strided_slice yaml & Adapt eager mode (#41131) · 9b6a02d4
      Chen Weihang 提交于
      * add several yaml
      
      * polish strided slice kernel & add yaml
      
      * reorder yaml
      
      * add several yaml
      
      * revert yaml config change
      
      * resolve conflict
      
      * Update test_strided_slice_op.py
      9b6a02d4
    • H
      Add basic yaml backward (#40751) · 98303291
      hong 提交于
      * fix error; test=develop
      
      * update
      
      * close some yaml
      
      * fix backward attrite error; test=develop
      
      * add div test
      
      * polish code; test=develop
      
      * update
      
      * update
      
      * fix bug
      
      * update bitwise code; test=develop
      
      * update
      
      * update
      
      * fix some bug
      
      * update
      
      * revert cmakelist
      
      * fix optional bug;
      
      * fix bug
      
      * fix bug;
      
      * add backward test
      
      * open bn
      
      * update
      
      * update
      
      * revert eager_gen
      
      * polish code
      
      * fix topk error
      
      * update
      
      * update
      
      * fix bug;
      
      * move label smooth, nll loss
      
      * revert topk
      
      * fix topk label smooth bug;
      
      * remove batch_norm
      
      * remove topk
      
      * change flip infer meta
      
      * fix flip bug
      
      * update yaml
      
      * close abs
      
      * fix histogram bug
      
      * fix histogram bug
      
      * add abs
      
      * fix histogram kernel
      
      * remove expand
      98303291
  6. 31 3月, 2022 6 次提交
  7. 30 3月, 2022 9 次提交