1. 04 3月, 2022 4 次提交
    • C
      fix warning (#40133) · 14e98a0f
      chenjian 提交于
      14e98a0f
    • Z
      Generate forward-only operators (#39962) · a6947991
      Zhanlue Yang 提交于
      * [Eager][Yaml]Supported Scalar and ScalarArray for AutoCodeGen
      
      * Generate forward-only operators
      
      * [Yaml]Support parsing fwd & bwd returns with name
      
      * Fixed issues
      
      * Fixed minor issues
      a6947991
    • Z
      d2a911b4
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  2. 03 3月, 2022 24 次提交
  3. 02 3月, 2022 12 次提交
    • L
      Replacing dropout eval eigen usage by cuda kernel (#40053) · 272b32fd
      Li Min 提交于
      * Replacing dropout eval eigen usage by cuda kernel
      272b32fd
    • F
      [MLU] add mlu ci script (#39805) · a8e02ef1
      fwenguang 提交于
      * [MLU] add mlu ci script
      
      * Update CMakeLists.txt
      a8e02ef1
    • H
      Move sgd to phi (#40045) · f3d54e2e
      hong 提交于
      * move sgd to phi; test=develop
      
      * update
      
      * add sgd kernel; test=develop
      f3d54e2e
    • W
      modify infershape of yolo_box (#40056) · ebc6959c
      wangxinxin08 提交于
      * modify infershape of yolo_box
      ebc6959c
    • L
      add check for backward hook (#40041) · 1980e33a
      Leo Chen 提交于
      * add check for backward hook
      
      * refine ut
      1980e33a
    • S
      Move gather.h/gather.cu.h/scatter.h/scatter.cu.h to the phi library (#40043) · 09258040
      sneaxiy 提交于
      * move gather.h gather.cu.h scatter.h scatter.cu.h to phi library
      
      * fix CI
      
      * fix rocm ci
      09258040
    • S
      vec scale kernel (#40011) · 2e6548a9
      sneaxiy 提交于
      2e6548a9
    • Y
      [Phi]Move elementwise function to funcs directory (#39986) · 5898e9ab
      YuanRisheng 提交于
      * move elementwise function to funcs directory
      
      * fix compile bugs
      
      * modify according to comment
      5898e9ab
    • A
      [XPU] Fix Phi Kernel cache problem in operator.cc (#40044) · 66196573
      Aurelius84 提交于
      * [XPU] Fix Phi Kernel cache problem in operator.cc
      
      * fix typo
      66196573
    • H
      Move transpose to pten (#39327) · 7a857924
      hong 提交于
      * immigrate_transpose_to_pten cpu kernel only; test=develop
      
      * fix bug; test=develop
      
      * add transpose cuda api
      
      * bug fix;
      
      * fix bugs
      
      * fix bugs; test=develop
      
      * bug fix;
      
      * move transepose to pten; test=develop
      
      * fix bug; test=develop
      
      * fix bugs; test=develop
      
      * add transpose grad fp16 support; test=develop
      
      * fix bug; test=develop
      
      * fix npu bug; test=develop
      
      * fix nemul = 0 bug; test=develop
      
      * add fp16 support; test=develop
      
      * fix data type register bug; test=develop
      
      * fix transpose bug; test=develop
      
      * update transpose
      
      * fix transpose bug; test=develop
      
      * remove useless code; test=develop
      
      * remove useless code; test=develop
      
      * fix transpose alias bug; test=develop
      
      * polish code; test=develop
      
      * resolve confict; test=develop
      
      * resolve confilct; test=develop
      
      * recover prepared operator; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      7a857924
    • F
      Move BroadcastTensors OP to phi (#40047) · 2a5590a1
      From00 提交于
      * Move BroadcastTensors OP to phi
      
      * Remove mutable_data in impl
      
      * Move BilinearTensorProductInferMeta to multiary.h/cc
      2a5590a1
    • Z
      new fleet_desc builder (#39948) · 1c4e3e5d
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * refactor ps optimize
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * refactor theoneps
      
      * the_one_ps
      
      * add ps pass unittest
      
      * add ps pass unittest
      
      * ps unitest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * ps unittest frame
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * add cpu_async_ps_mode test
      
      * ps unittest ready
      
      * ps unittest ready
      
      * solve dist_pass init conflict
      
      * solve import CommContext error
      
      * unittest ok
      
      * implement AllocateFrom
      
      * solve setup.py.in conflict
      
      * solve conflict
      
      * solve conflict
      
      * solve conflict
      
      * .
      
      * .
      
      * cpu-async-ps minimize test ok & gpu minimize test ok
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * add heter 2stage unittest
      
      * sync/geo test ok & fix heter_worker program ok
      
      * .
      
      * new fleet desc generator
      
      * new fleet_desc builder
      
      * new fleet_desc builder
      
      * .
      
      * .
      
      * correct ps.proto compile
      
      * .
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      1c4e3e5d