1. 16 3月, 2022 24 次提交
  2. 15 3月, 2022 16 次提交
    • C
      [Phi] Move determinant op kernel into phi (#40539) · a04a6bd5
      Chen Weihang 提交于
      * add determinant phi kernel
      
      * remove original determinant op kernel
      
      * add determinant grad [hi kernel
      
      * fix determinant test failed
      
      * remove original determinant grad op kernel
      a04a6bd5
    • C
      remove cmake kernel print info (#40550) · 0c0acbd7
      Chen Weihang 提交于
      0c0acbd7
    • L
      [phi] modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot (#40506) · 31729a62
      Liu-xiandong 提交于
      * [phi] move matrix_power op
      
      * MatrixInverse fluid -> phi
      
      * modify the CMake to fix compile bug
      
      * delete useless comment
      
      * mutable memory -> phi Alloc
      
      * modify the include file
      
      * modify the include file
      
      * fix bug in CI compiler
      
      * [phi]modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot
      
      * delete useless comment
      
      * fix bug in CI
      
      * modify after review
      31729a62
    • R
      add number count op (#39224) · 9bdee437
      Roc 提交于
      * add expert count op
      
      add ut for expert_count
      
      * update UT only for cuda
      
      * fix for rocm
      
      * update ut
      
      * add moe module
      
      * add expert count op
      
      add ut for expert_count
      
      * update UT only for cuda
      
      * update ut
      
      * add moe module
      
      * make expert count private
      
      * rename expert count op
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      9bdee437
    • X
      run python api in eager model and filter the out in argument list (#40523) · 4d886f75
      xiongkun 提交于
      * run python api in eager model and filter the out in argument list
      
      * fix code
      4d886f75
    • Z
      Fixed issues with generated scale operator (#40482) · 30417999
      Zhanlue Yang 提交于
      * Fixed issues with generated scale operator
      
      * Fixed minor issues
      30417999
    • F
      [NPU] add AMP O1 support (#40362) · 69dd43d1
      furnace 提交于
      * [NPU] add AMP O1 support
      
      * [NPU] fix NOTE and warnings
      69dd43d1
    • C
      [Phi] Move gather op kernel into phi (#40500) · 0c703fe7
      Chen Weihang 提交于
      * add phi gather kernel
      
      * update year
      
      * remove original gather opkernel
      
      * add gather grad phi kernels
      
      * remove origin gather grad kernel
      
      * fix failed npu and xpu
      
      * fix xpu compile failed
      0c703fe7
    • J
      oneDNN NHWC fixes (#40049) · dde9cec0
      Jacek Czaja 提交于
      * - Prototype of third solution
      
      - fix
      
      - compilation fixes
      
      - fix
      
      - fixe
      
      - fix
      
      - fix
      
      - compilation fix
      
      - comment fix
      
      - lint
      
      update mkldnn conv_elementwise_add_fuse_pass ut
      
      - NHWC changes to prelu
      
      - alhpa dims
      
      - UT fix
      
      - fix to UT
      
      - lint
      
      - Some fixes
      
      - added to BWD of prelu NHWC support
      
      - reverted removal of resetting cu_layout in clearing of caching
      
      * - Small changes
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * lint
      
      * - fixes after internal review
      
      * - compilation fix
      
      * - lint
      dde9cec0
    • zhouweiwei2014's avatar
      change CUDA implementation of randperm OP (#40464) · 813f61d2
      zhouweiwei2014 提交于
      813f61d2
    • T
      add shard_id (#40261) · 6b7d4845
      Thunderbrook 提交于
      * shard_id
      
      * format
      6b7d4845
    • X
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620
      xiongkun 提交于
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)
      
      * tranfer and pass the lgamma unittest
      
      * merge and pass the test
      
      * transfer kldiv_loss and kldiv_loss_grad; pass the unitest
      
      * trafer the isclose and cumprod kernel
      
      * change PT_REGISTER -> PD_REGISTER
      
      * fix by code review
      
      * fix by code review
      
      * fix
      
      * remove enforce include dependence from scalar
      
      * fix
      
      * fix by code review
      
      * fix by code review
      64223620
    • C
      add softmax yaml and add_raw infermeta (#40534) · 7039f61e
      Chen Weihang 提交于
      7039f61e
    • C
      [Phi]move reduce_min/any/all kernel (#40374) · c46e661d
      chentianyu03 提交于
      * add reduce_min kernel
      
      * remove raw reduce_min kernel
      
      * add reduce min
      
      * add reduce any all impl
      
      * add bool reduce Kernel
      
      * remove raw any/all kernel
      
      * add any all kernel
      
      * rm comment
      c46e661d
    • Z
      Added more profile signposts to dygraph (#40201) · 36db75b4
      Zhanlue Yang 提交于
      * Added more signposts to dygraph profiling
      
      * Fixed minor issues
      
      * Refactored signpost names
      
      * Fixed typo
      
      * Removed debug codes
      
      * Fixed typo
      
      * Adjusted signpost names
      
      * Fixed issues from branch merge
      36db75b4
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37