1. 14 4月, 2022 1 次提交
  2. 11 4月, 2022 1 次提交
  3. 30 3月, 2022 1 次提交
  4. 27 3月, 2022 2 次提交
  5. 24 3月, 2022 2 次提交
  6. 23 3月, 2022 1 次提交
  7. 17 3月, 2022 1 次提交
    • H
      Move layer norm to phi (#40193) · 681a6865
      hong 提交于
      * update
      
      * fix bugs; test=develop
      
      * update; test=develop
      
      * fix test compile error; test=develop
      
      * fix cpu compile error; test=develop
      
      * fix test error; test=develo
      
      * fix layer_norm_op plugin error; test=develop
      
      * fix error; test=develop
      
      * fix test bug; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * fix bugs; test=develop
      
      * remove unused depency; test=develop
      
      * polish code; test=develop
      681a6865
  8. 15 3月, 2022 1 次提交
    • J
      oneDNN NHWC fixes (#40049) · dde9cec0
      Jacek Czaja 提交于
      * - Prototype of third solution
      
      - fix
      
      - compilation fixes
      
      - fix
      
      - fixe
      
      - fix
      
      - fix
      
      - compilation fix
      
      - comment fix
      
      - lint
      
      update mkldnn conv_elementwise_add_fuse_pass ut
      
      - NHWC changes to prelu
      
      - alhpa dims
      
      - UT fix
      
      - fix to UT
      
      - lint
      
      - Some fixes
      
      - added to BWD of prelu NHWC support
      
      - reverted removal of resetting cu_layout in clearing of caching
      
      * - Small changes
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * lint
      
      * - fixes after internal review
      
      * - compilation fix
      
      * - lint
      dde9cec0
  9. 14 3月, 2022 1 次提交
  10. 12 3月, 2022 1 次提交
  11. 11 3月, 2022 2 次提交
  12. 10 3月, 2022 1 次提交
    • L
      [phi] move shape op (#40248) · 575dea8f
      Liu-xiandong 提交于
      * add selected row op and fix bug in ctest
      
      * modify the date
      
      * fix bug in npu and xpu
      
      * modfiy the include file
      575dea8f
  13. 08 3月, 2022 1 次提交
  14. 04 3月, 2022 1 次提交
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  15. 02 3月, 2022 1 次提交
    • H
      Move transpose to pten (#39327) · 7a857924
      hong 提交于
      * immigrate_transpose_to_pten cpu kernel only; test=develop
      
      * fix bug; test=develop
      
      * add transpose cuda api
      
      * bug fix;
      
      * fix bugs
      
      * fix bugs; test=develop
      
      * bug fix;
      
      * move transepose to pten; test=develop
      
      * fix bug; test=develop
      
      * fix bugs; test=develop
      
      * add transpose grad fp16 support; test=develop
      
      * fix bug; test=develop
      
      * fix npu bug; test=develop
      
      * fix nemul = 0 bug; test=develop
      
      * add fp16 support; test=develop
      
      * fix data type register bug; test=develop
      
      * fix transpose bug; test=develop
      
      * update transpose
      
      * fix transpose bug; test=develop
      
      * remove useless code; test=develop
      
      * remove useless code; test=develop
      
      * fix transpose alias bug; test=develop
      
      * polish code; test=develop
      
      * resolve confict; test=develop
      
      * resolve confilct; test=develop
      
      * recover prepared operator; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      7a857924
  16. 26 2月, 2022 1 次提交
  17. 25 2月, 2022 2 次提交
  18. 24 2月, 2022 1 次提交
    • J
      Added nearest interp v2 BF16 FWD kernel (#39490) · 2ec943a7
      jakpiase 提交于
      * added nearest interp v2 bf16
      
      * disabled bilinear interp nhwc test
      
      * added skipping UT for gpu
      
      * added NHWC support
      
      * removed unnecessary statements
      
      * minor change
      
      * CI fix
      
      * added appropriate changes to interpolate_v1
      
      * fix after review
      
      * minor change
      
      * minor change
      
      * revert unwanted deletions
      
      * CI fix
      2ec943a7
  19. 22 2月, 2022 1 次提交
  20. 20 2月, 2022 1 次提交
  21. 19 2月, 2022 2 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
    • C
      fix RecordEvent interface (#39675) · 019a552b
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update operator.cc
      019a552b
  22. 18 2月, 2022 1 次提交
  23. 16 2月, 2022 1 次提交
    • Y
      [Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270
      YuanRisheng 提交于
      * remove reshape and elementwise_add registry
      
      * delete code
      
      * fix bugs when run ci ut
      
      * remove log
      
      * fix bugs when run unit test
      
      * fix bugs when run unit test
      
      * fix bugs when run cinn
      
      * fix bugs when run ci-mac-python3
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix bugs when run kunlun
      
      * fix bugs when compile
      
      * update code according comment
      c6478270
  24. 15 2月, 2022 2 次提交
    • J
      disabled unnecessary int reorders profiling (#39498) · 3581c075
      jakpiase 提交于
      3581c075
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  25. 11 2月, 2022 1 次提交
    • J
      Added shape (U)INT8/BF16/FP32 oneDNN kernel (#36033) · 52bbaae9
      jakpiase 提交于
      * added shape oneDNN kernel
      
      * removed unnecessary import from test
      
      * added skipping tests for GPU
      
      * refactoring
      
      * refactored shape kernel
      
      * added tests in new framework
      
      * removed one line
      
      * minor change
      
      * added newline at EOF
      
      * added formatting
      
      * added attributes as extra
      52bbaae9
  26. 08 2月, 2022 1 次提交
    • J
      Fix to #38126 (#39097) · f884edb9
      Jacek Czaja 提交于
      * - 38126 potential fix
      
      * - fix
      
      * - build fix
      
      * - another candidate fix
      
      * - compilation fix
      
      * - another fix
      
      * - Fix to activation of NHWC being first oneDNN op in chain on oneDNN ops
      
      * - compilation fix
      
      * - added NHWC reotating for elementwise being first op
      
      * - compilation fix
      
      * - compilation fix
      
      * - Added UT
      
      * - cosmetic fixes
      f884edb9
  27. 24 1月, 2022 1 次提交
  28. 18 1月, 2022 2 次提交
    • S
      Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c
      Sławomir Siwek 提交于
      * Mish
      
      * Change exp() library
      
      * mish fuse pass
      
      * mish attrs
      
      * fixes
      
      * mishop maker
      
      * remove attrs
      
      * mish kernal for bf16
      
      * fc+mish fuse
      
      * fix code format error
      
      * Resolve merge conflicts
      
      * Update mish operator version
      
      * update mish variable to new naming convention
      1d18bc2c
    • Z
      [Unify Tensors PR #8] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3
      Zhanlue Yang 提交于
      * Merged LoDTensor with Tensor,test=allcases
      
      * Patched python level LoDTensor
      
      * Patched python level LoDTensor
      
      * Merge Tensor into DenseTensor
      
      * Fixed namespace issues,test=allcases
      
      * Fixed merge issues
      
      * Fixed inference issues
      
      * Fixed NPU test issues
      
      * Fixed merge issues
      2052f1e3
  29. 17 1月, 2022 1 次提交
  30. 15 1月, 2022 1 次提交
  31. 13 1月, 2022 1 次提交
    • J
      Added mul BF16/FP32 FWD/BWD oneDNN kernel (#38552) · fc6eed5b
      jakpiase 提交于
      * base changes for mul reimplementation
      
      * empty commit
      
      * tmp save
      
      * full implementation of mul bf16/fp32 fwd bwd
      
      * CI fix
      
      * CI rerun
      
      * changed unity build cmake to avoid gpu issues
      
      * removed mul mkldnn from unity build
      
      * added skipping tests if not cpu_bf16
      
      * CI fix
      
      * CI fix
      
      * CI fix
      fc6eed5b
  32. 12 1月, 2022 1 次提交
  33. 06 1月, 2022 1 次提交