1. 07 9月, 2021 1 次提交
  2. 06 9月, 2021 8 次提交
  3. 05 9月, 2021 1 次提交
  4. 04 9月, 2021 1 次提交
  5. 03 9月, 2021 12 次提交
  6. 02 9月, 2021 7 次提交
    • J
      [NPU] Support npu kernel for gather_nd op (#34800) · bb633965
      JingZhuangzhuang 提交于
      * [NPU] Support npu kernel for gather_ng op
      
      * [NPU] Support npu kernel for gather_nd op
      
      * [NPU] Support npu kernel for gather_nd and gather_nd_grad op
      
      * update py format error.
      
      * modify gather_nd_op_npu
      
      * modify gather_nd 910 test
      
      * modify gather_nd 910 test
      Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
      bb633965
    • X
      Add SVD Op and it's GPU and CPU kernel (#34953) · 7e5fb462
      xiongkun 提交于
      * Add SVD Op and it's GPU and CPU kernel
      
      * Remove CUDAPlace in test_svd_op, make the test available in CPU package
      
      * modfity the file
      
      * fix windows bug/ fix ROCM / fix test timeout
      
      * for pass the CIs
      
      * improve error report
      
      * for code review
      
      * some modification to test_svd_op
      
      * change python code style
      
      * expose the svd interface for document
      7e5fb462
    • Z
      [NPU] Add label_smooth_op (#34828) · e57a88b3
      zhulei 提交于
      * [NPU] Add label_smooth_op
      
      * [NPU] Add label_smooth_op
      e57a88b3
    • Y
      [hybrid] [npu] fit npu nan/inf check (#35171) · 67ed7e12
      Yuang Liu 提交于
      67ed7e12
    • W
      fix static error in summary (#35303) · b28cc734
      wangna11BD 提交于
      b28cc734
    • J
      [Auto Parallel] Logical Partition & Dist Op (#35117) · a622b701
      JZ-LIANG 提交于
      * support shard reader
      
      * support shard reader
      
      * add parallel mode
      
      * update process mesh
      
      * add method to compute comm_group
      
      * implement dist_embedding forward func
      
      * implement dist matmul forward func
      
      * implement dist reshape forward func
      
      * add transpiler framework
      
      * add transpiler forward
      
      * implement transpiler forward
      
      * implement transpiler backward & update
      
      * add process
      
      * add unitest
      
      * chmod
      
      * chmod
      
      * chmod
      
      * update unitest
      
      * add unitest for gpt
      
      * remove unused print
      
      * rename transpiler --> partitioner
      
      * rename transpiler --> partitioner
      
      * chmod
      
      * chmod
      
      * bug fixed
      
      * remove amp function
      
      * update case for dp mode
      
      * update case for dp mode
      a622b701
    • B
      [npu] add update_loss_scaling npu min value (#35270) · 280d7421
      Baibaifan 提交于
      280d7421
  7. 01 9月, 2021 10 次提交