1. 22 2月, 2022 3 次提交
  2. 21 2月, 2022 9 次提交
  3. 20 2月, 2022 1 次提交
  4. 19 2月, 2022 2 次提交
    • Z
      Enabled test_matmul_v2_op for final state Eager Dygraph (#39504) · 77625d7d
      Zhanlue Yang 提交于
      * Enabled test_matmul_v2_op for final state Eager Dygraph
      
      * Fixed minor issue
      
      * Fixed format issue
      77625d7d
    • S
      Add the DistributedFusedLamb optimizer (#39148) · 5df3cd61
      sneaxiy 提交于
      * add DistributedFusedLamb op
      
      * polish code
      
      * fix compile error
      
      * compatible with pten changement
      
      * fix rocm compile error
      
      * improve converage
      
      * update upstream/develop
      
      * fix cast_with_ptr.h
      
      * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1
      
      * fix clip before allreduce
      
      * add use_master_param_norm
      
      * code polish
      
      * fix bug
      
      * fix ROCM ci
      5df3cd61
  5. 18 2月, 2022 7 次提交
  6. 17 2月, 2022 7 次提交
  7. 16 2月, 2022 11 次提交
    • W
      [Eager] Support eager hook_for_layer (#39531) · a909bdf1
      Weilong Wu 提交于
      * Update comment
      
      * [Eager] Support test_imperative_hook_for_layer with _test_eager_guard()
      
      * Polish code name style
      
      * Fix a error name
      
      * Polish code, make it clear and simple
      a909bdf1
    • T
      optimize prior_box for kunlun, *test=kunlun (#39477) · e254e7c6
      TTerror 提交于
      e254e7c6
    • F
      [MLU] support adative pooling (#39500) · f138371c
      fwenguang 提交于
      f138371c
    • 0
      [Dy2St]Refine AnnAssign in static_analysis (#39572) · eb3c7d00
      0x45f 提交于
      eb3c7d00
    • A
      Add ConditionalBlockGradInferVarType (#39585) · ff7e3590
      Aurelius84 提交于
      ff7e3590
    • L
      [bf16] pten matmul cuda kernel support bf16 (#39485) · d5a0d31a
      Leo Chen 提交于
      * pten matmul cuda kernel support bf16
      
      * fix pten kernel name
      
      * add matmul_grad bf16 kernel
      
      * add emptylike bf16 kernel
      
      * fix compile
      
      * suppport rocm
      
      * fix error
      
      * fix rocm
      
      * add bf16 header file
      
      * fix compile
      d5a0d31a
    • J
      EagerTensor to EagerVariable (#39447) · 831fd86e
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * merge develop and refine code
      831fd86e
    • T
      f21d7957
    • Z
      Test only trt group norm (#39561) · ac894ced
      zlsh80826 提交于
      ac894ced
    • Z
      sync/geo test ok & fix heter_worker program ok (#39511) · b2986bab
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * sync/geo test ok & fix heter_worker program ok
      
      * .
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      b2986bab
    • A
      fix ut for pinv (#39566) · 0bcf1365
      andyjpaddle 提交于
      0bcf1365