1. 04 3月, 2022 7 次提交
    • Z
      Fix bug caused by split infershape (#40116) · 45385371
      zyfncg 提交于
      * fix bug caused by split infershape
      
      * revert infer_shape of split
      
      * revert split
      45385371
    • C
      [Phi] Remove cholsky solve deps with svd helper (#40119) · 28fd30cd
      Chen Weihang 提交于
      * remove cholsky solve deps with svd helper
      
      * fix shape infer bug
      28fd30cd
    • zhouweiwei2014's avatar
      【Phi】Migrate bitwise_and/bitwise_or/bitwise_xor/bitwise_not op into phi (#40031) · 03eb792d
      zhouweiwei2014 提交于
      * Migrate bitwise_and/or/xor/not op into phi
      
      * fix CI
      03eb792d
    • L
      clean distribution_helper, index_impl, aligned_vector code in fluid (#40071) · b9672a1e
      Leo Chen 提交于
      * clean distribution_helper, index_impl, aligned_vector code in fluid
      
      * fix conflicts
      b9672a1e
    • C
      [phi]move reduce gpu impl funcs into pten/kernels/funcs (#39990) · e2e2d531
      chentianyu03 提交于
      * move reduce gpu impl funcs into pten/kernels/funcs
      
      * change reduce header name and namespace
      
      * fix spell word error
      
      * change mutable_data to dev_ctx.Alloc
      
      * modify place to devcontex
      
      * format code style
      
      * fix build error
      
      * fix build error
      
      * fix conflict
      e2e2d531
    • X
      transfer selu infershape (#40137) · abacc4cb
      xiongkun 提交于
      abacc4cb
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  2. 03 3月, 2022 13 次提交
    • Y
      fix save_vars bugs (#40062) · eaacf8bf
      YuanRisheng 提交于
      eaacf8bf
    • 0
      move eye, lerp infershape to phi (#40105) · 1c205883
      0x45f 提交于
      1c205883
    • T
      cinn_launch_op: switch to execution by PE (#39911) · 167d511f
      TeFeng Chen 提交于
      * swith to PE execution in cinn launch
      
      * fix outer variables erased
      
      * skip the map bug temporarily for test
      
      * temporary solution for batch_norm bug
      
      * update comment
      
      * fix compile error
      
      * cinn_instruction_run_op_test: update code to skip external alloc/free instructions generated
      167d511f
    • F
      Move compare OPs to phi (#39970) · 0969a4eb
      From00 提交于
      * Move compare OPs to phi
      
      * Fix bug
      
      * Use BroadcastKernel and ElementwiseKernel in phi
      0969a4eb
    • W
      modify infershape of multiclass nms (#40059) · 756af9ff
      wangxinxin08 提交于
      * modify infershape of multiclass nms
      756af9ff
    • Y
      [Phi]Delete kernel registry of elementwise_sub op in Fluid (#40039) · cac00e0b
      YuanRisheng 提交于
      * delete elementwise_sub kernel registry
      
      * fix compile bugs in xpu ci
      
      * fix bugs when run inference ci
      cac00e0b
    • W
      EmbEltwiseLayernorm fix (#40015) · c3f3643b
      wenbin 提交于
      * emb fix
      
      * fix trt6 compile
      
      * fix half
      
      * absolute error fix
      c3f3643b
    • H
      Modified sigmoid by the elementwise interface. (#39898) · 5d9e11a4
      huangxu96 提交于
      * Modified sigmoid by elementwise interface.
      
      * using TensorReduceImpl to repalce Sum function
      
      * using reduceimpl to calculate the norm variable
      
      * Removed useless code
      5d9e11a4
    • L
      Add support of int16 for gather op. (#40052) · 3e56e816
      Li Min 提交于
      * add support of int16 for gather op.
      
      * Recover formats.
      
      * Recover formats.
      
      * fix.
      
      * Fix format.
      
      * Fix format.
      3e56e816
    • X
      [phi] transfer pad kernel into phi and pass the test_pad_op (#40012) · 9f74b84e
      xiongkun 提交于
      * add pad forward
      
      * fix error
      
      * transfer pad and pass the test_pad_op
      9f74b84e
    • C
      move gather_tree infer shape (#40082) · 3779e807
      crystal 提交于
      3779e807
    • F
      [Phi] move gaussian_random (#39932) · 00bbb8c5
      furnace 提交于
      [Phi] move gaussian_random kernel
      00bbb8c5
    • H
      Move bn to pten (#39347) · ebd0f512
      hong 提交于
      * add bn cpu version; test=develop
      
      * move batch norm to pten
      
      * move batch norm to pten; test=develop
      
      * fix bug; test=develop
      
      * fix func::tranpose depend bug; test=develop
      
      * fix compile bugs; test=develop
      
      * fix use_op batch_norm bug; test=develop
      
      * fix cudnn bn add relu test; test=develop
      
      * fix pten context build and double grad bug; test= develop
      
      * remve useless code; test=develop
      
      * add batch norm gpu fp16 support; test=develop
      
      * fix test bn op bug; test=develop
      
      * remove output dtype set; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix applay pass to program bug; test=develop
      
      * revert to develop; test=develop
      
      * fix rocm bug; test=develop
      
      * revert operator to develop; test=develop
      
      * fix pre_commit; test=develop
      
      * fix statci check error; test=develop
      
      * resolve conflict; test=develop
      
      * ana batch norm bug;
      
      * revert batch norm op
      
      * resolve conlict
      
      * fix nan inf and speed bug; test=develop
      
      * fix bug; test=develop
      
      * fix error; test=develop
      
      * test expand op; test=develop
      
      * fix bug; test=develop
      
      * resolve confilct
      
      * resolve confilct; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      
      * change mutable data to ctx alloc; test=develop
      
      * make format same with ci; test=develop
      
      * fix format error with ci; test=develop
      ebd0f512
  3. 02 3月, 2022 17 次提交
  4. 01 3月, 2022 3 次提交
    • C
      [Phi]rm reduce infershape (#39820) · 09039636
      chentianyu03 提交于
      * modify infershape utils and rm reduce infershape
      
      * merge develop
      
      * fix infermete bug
      
      * add IsForInferShape func in ArgumentMappingContext
      
      * add reduce_mean infermeta
      
      * modify annotation
      
      * add default dims
      09039636
    • X
      [phi] tranfer the selu_op and pass the CI (#39819) · 197da15a
      xiongkun 提交于
      * tranfer the selu_op and pass the CI
      
      * add sig files
      
      * fix code
      
      * fix by code review
      
      * remove TOOD
      
      * change the include position
      
      * change the head position
      197da15a
    • Z
      [bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978
      zhangbo9674 提交于
      * add layer norm
      
      * add p norm
      
      * add reduce sum
      
      * refine layer norm register bf16 for cudnn811
      
      * add bf16 cast for hip
      
      * add unittest
      
      * refine rocm
      
      * refine layer_norm unittest
      
      * refine reduce op
      
      * refine unittest
      
      * enhance atol for reduce unittest
      ce8ed978