1. 15 2月, 2022 2 次提交
  2. 14 2月, 2022 7 次提交
    • H
      Add Inplace addto pass and unittest. (#39433) · 52af0a60
      hlygit66666 提交于
      * add fuse_relu_depthwise_conv_pass unittest
      
      * fix atol and rtol
      
      * fix according to review
      
      * Update test_dist_fuse_relu_depthwise_conv_pass.py
      
      * add inplace_addto pass and unittest
      52af0a60
    • S
      [UT] mish op, conv+mish, fc+mish fuse passes (#39340) · 02938b3d
      Sławomir Siwek 提交于
      * mish unit tests
      
      * code format
      
      * remove unused imports
      
      * code format
      
      * remove hard-coded shape values
      
      * remove timeouts
      
      * remove timeouts v2
      
      * restore timeouts
      02938b3d
    • Z
      统一ps:heter ps 二阶段单测通过 (#39468) · 765a2ada
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      765a2ada
    • z8hanghuan's avatar
      new may of test cases, *test=kunlun (#39444) · e07420b9
      z8hanghuan 提交于
      * new may of test cases, *test=kunlun
      
      * new may of test cases, *test=kunlun
      
      * new may of test cases, *test=kunlun
      e07420b9
    • T
      fix gather_nd, *test=kunlun (#39283) · d12c3636
      TTerror 提交于
      d12c3636
    • T
    • Z
      Fixed get_tensor method for EagerTensor (#39414) · 97229944
      Zhanlue Yang 提交于
      * Enabled Eager OpTest #1
      
      * Enabled Eager OpTest #1
      
      * Fixed get_tensor method for EagerTensor
      97229944
  3. 11 2月, 2022 7 次提交
    • L
      Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
      Leo Chen 提交于
      69793a27
    • J
      Added shape (U)INT8/BF16/FP32 oneDNN kernel (#36033) · 52bbaae9
      jakpiase 提交于
      * added shape oneDNN kernel
      
      * removed unnecessary import from test
      
      * added skipping tests for GPU
      
      * refactoring
      
      * refactored shape kernel
      
      * added tests in new framework
      
      * removed one line
      
      * minor change
      
      * added newline at EOF
      
      * added formatting
      
      * added attributes as extra
      52bbaae9
    • F
      [MLU] add pool2d pytest (#39454) · 2db25f0d
      fwenguang 提交于
      2db25f0d
    • J
      uniform_random op for mlu (#39450) · 02f06708
      joeqiao12 提交于
      02f06708
    • Z
      [bf16] add bf16 kernel: transpose & unbind (#39457) · 1e6047f1
      zhangbo9674 提交于
      * add transpose unbind
      
      * add unittest
      
      * refine transpose unittest
      1e6047f1
    • J
      fix prelu trt convert (#39389) · c86765ed
      JingZhuangzhuang 提交于
      c86765ed
    • Z
      统一 ps 开发 - python (#39431) · 22c67d14
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      22c67d14
  4. 10 2月, 2022 8 次提交
  5. 09 2月, 2022 8 次提交
  6. 08 2月, 2022 7 次提交
    • S
      Make Embedding layer support more int ids type (#39381) · 60f1461a
      sneaxiy 提交于
      * add more int id type support for embedding
      
      * add ut
      
      * add more ut
      
      * fix ci error
      60f1461a
    • H
      Add FuseOptimizerPass and test_dist_fuse_adam_pass unittest. (#39208) · ccdcfa2d
      hlygit66666 提交于
      * add fuse_relu_depthwise_conv_pass unittest
      
      * fix atol and rtol
      
      * fix according to review
      
      * Add FuseOptimizerPass and fuse_adam_pass unittest
      
      * add sgd and momentum unittest
      
      * add fuse_optimizer_pass
      
      * close amp
      
      * close amp
      
      * update
      
      * fix run on two cards
      
      * Update test_dist_fuse_adam_pass.py
      
      * Update test_dist_fuse_momentum_pass.py
      
      * Update test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Create test_dist_fuse_sgd_pass.py
      
      * Update test_dist_fuse_adam_pass.py
      
      * Update test_dist_fuse_momentum_pass.py
      
      * Update test_dist_fuse_sgd_pass.py
      ccdcfa2d
    • Z
      ps optimize refactor (#38982) · 196dbfc2
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      196dbfc2
    • Z
      [bf16] add bf16 cuda kernel: concat and split (#39380) · de0bad2a
      zhangbo9674 提交于
      * add concat & split
      
      * add concat kernel
      
      * add concat unittest
      
      * add split unittest
      de0bad2a
    • B
      optimize sharding stage3 (#39334) · 23d559dd
      Baibaifan 提交于
      23d559dd
    • C
      Fix reduce_sum dtype dispatch bug on gpu (#39349) · 4d7ad277
      Chen Weihang 提交于
      * fix pten reduce dispatch bug
      
      * add cast beforce reduce
      
      * fix test failed
      4d7ad277
    • L
      [bf16] support printing bf16 tensor (#39375) · f57b21e6
      Leo Chen 提交于
      f57b21e6
  7. 07 2月, 2022 1 次提交