1. 01 4月, 2022 8 次提交
    • X
      add yaml for ele_max ele_min. (#41161) · 0d28edfa
      xiongkun 提交于
      * add yaml for ele_max ele_min
      
      * fig
      
      * push
      
      * xxx
      0d28edfa
    • Z
      [DoubleGrad #4] Bug Fixes to Double Grad Node Generation (#41121) · 01724b1a
      Zhanlue Yang 提交于
      * [Refactor] refactored eager_gen.py PR #2
      
      * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes
      
      * Fixed minor issue
      
      * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition
      
      * Fixed issues
      
      * Supported higher-order grad node generation
      
      * [DoubleGrad PR #4] Supported higher-order GradNode generation
      
      * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation
      
      * Fixed yaml typo
      
      * Fixed yaml typo
      
      * fixed minor issues
      
      * Fixed minor issue
      01724b1a
    • Z
      fix py36 import as error (#41236) · 6ed6f9fe
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * .
      
      * .
      
      * correct py36 import error
      
      * correct py36 import error
      
      * correct py36 import error
      
      * correct py36 import error
      6ed6f9fe
    • Z
      3a29e4f8
    • S
      fix bug of bfgs example code;test=document_fix (#41195) · db948373
      Sing_chan 提交于
      db948373
    • Z
    • C
      [Phi] Add shape and strided_slice yaml & Adapt eager mode (#41131) · 9b6a02d4
      Chen Weihang 提交于
      * add several yaml
      
      * polish strided slice kernel & add yaml
      
      * reorder yaml
      
      * add several yaml
      
      * revert yaml config change
      
      * resolve conflict
      
      * Update test_strided_slice_op.py
      9b6a02d4
    • H
      Add basic yaml backward (#40751) · 98303291
      hong 提交于
      * fix error; test=develop
      
      * update
      
      * close some yaml
      
      * fix backward attrite error; test=develop
      
      * add div test
      
      * polish code; test=develop
      
      * update
      
      * update
      
      * fix bug
      
      * update bitwise code; test=develop
      
      * update
      
      * update
      
      * fix some bug
      
      * update
      
      * revert cmakelist
      
      * fix optional bug;
      
      * fix bug
      
      * fix bug;
      
      * add backward test
      
      * open bn
      
      * update
      
      * update
      
      * revert eager_gen
      
      * polish code
      
      * fix topk error
      
      * update
      
      * update
      
      * fix bug;
      
      * move label smooth, nll loss
      
      * revert topk
      
      * fix topk label smooth bug;
      
      * remove batch_norm
      
      * remove topk
      
      * change flip infer meta
      
      * fix flip bug
      
      * update yaml
      
      * close abs
      
      * fix histogram bug
      
      * fix histogram bug
      
      * add abs
      
      * fix histogram kernel
      
      * remove expand
      98303291
  2. 31 3月, 2022 22 次提交
  3. 30 3月, 2022 10 次提交
    • Z
      py36 Import error bug fix (#41135) · d006c7ff
      ziyoujiyi 提交于
      * lazy import
      
      * log error
      d006c7ff
    • F
      Fix bug for UT test_calc_gradient (#41130) · 4d6a3b9f
      From00 提交于
      4d6a3b9f
    • 0
      Fix test_jit_save_load (#41114) · 4b61918d
      0x45f 提交于
      4b61918d
    • Z
      [AutoParallel] fix converter when sliced_shape is 1 (#41103) · 59c4fdac
      zhaoyingli 提交于
      * fix converter when sliced_shape is 1
      
      * update unittest
      59c4fdac
    • Z
      delete ps env (#41079) · a0e961c0
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * correct pass not regisiter
      
      * back
      
      * back
      
      * .
      
      * .
      a0e961c0
    • P
      5f7d129a
    • R
      [MoE] Moe apis (#41092) · aac7879a
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      
      * update limit by capacity for easily to switch to single thread mode
      
      * update api docs
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aac7879a
    • H
      [Op] Fix uncontrolled randomness of index_select op (#41078) · 8f7c02f2
      Haohongxiang 提交于
      * fix uncontrolled randomness of op
      
      * fix bugs
      8f7c02f2
    • F
      Add new APIs for GPU memory monitoring (max_memory_allocated,... · afe02e9d
      From00 提交于
      Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657)
      
      * Add new API memory_reserved
      
      * Add memory_allocated, max_memory_reserved and max_memory_allocater
      
      * Fix CI error
      
      * Fix CI error
      
      * Enhance UT
      
      * Add FLAGS_memory_stats_opt
      
      * Add STATS macro functions
      
      * Add StatAllocator
      
      * Fix CI errors
      
      * Add UT
      
      * Fix CI errors
      afe02e9d
    • C
      fix reshard bug (#41106) · e494b73b
      caozhou 提交于
      e494b73b