1. 28 11月, 2022 4 次提交
  2. 26 11月, 2022 1 次提交
  3. 25 11月, 2022 2 次提交
    • C
      [PROFILER] add flops for Profiler (#47766) · 3d1981ad
      Chitsing KUI 提交于
      * attr ready
      
      * op ip ready
      
      * start dynamic
      
      * end2end ok
      
      * input shape to map, stat by op
      
      * layer wip
      
      * first version ready
      
      * fix proto depds
      
      * fix profiler deps
      
      * fix flops typo, rm tuple shape
      3d1981ad
    • R
      Refactor stream anayzer (#48158) · 889318d8
      Ruibiao Chen 提交于
      * Move stream_anayzer to interpreter
      
      * Refactor StreamAnalyzer
      
      * Refactor RunNextInstructionList
      
      * Remove no_data_transform_index
      
      * Fix typos
      
      * Fix data_transfer OpFuncType error
      
      * Add event for depend_op
      
      * Update transfer OpFuncType for heter place
      889318d8
  4. 24 11月, 2022 3 次提交
  5. 23 11月, 2022 3 次提交
  6. 22 11月, 2022 2 次提交
  7. 21 11月, 2022 3 次提交
  8. 18 11月, 2022 2 次提交
  9. 17 11月, 2022 3 次提交
  10. 16 11月, 2022 3 次提交
  11. 15 11月, 2022 2 次提交
    • J
      Added optimization pass for oneDNN layernorm kernel (#47782) · 519e7426
      jakpiase 提交于
      * optimization for ln
      
      * fix
      
      * added output to gpd
      
      * added formatting
      
      * fix
      519e7426
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  12. 14 11月, 2022 2 次提交
  13. 11 11月, 2022 3 次提交
  14. 10 11月, 2022 4 次提交
  15. 09 11月, 2022 2 次提交
    • J
      Final changes to introduce mem_desc to be hold in Tensor (#46768) · 14f261ad
      Jacek Czaja 提交于
      * first commit
      
      - more fixes
      
      - compilation fix
      
      - compilation fix
      
      - fix
      
      - another fix
      
      - yet another fix
      
      - Fix
      
      - fix to fused ops
      
      - compilation fix
      
      - compilation fix
      
      - another compilation fix
      
      - another fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix
      
      - yet another fix
      
      - fix
      
      - fix
      
      - cosmetic fix
      
      :- lint
      
      - Revert some changes (to be brought back later)
      
      - fix to build
      
      - Added prototype of slice
      
      - fix
      
      compilation fix
      
      - compilation fix
      
      - fix
      
      - fix
      
      - Fix
      
      - fix
      
       fix
      	modified:   cmake/flags.cmake
      
      * lint
      
      * rerun of CI
      
      * - Fix
      
      * - lint
      
      * - lint2
      14f261ad
    • C
      [PHI decoupling] Move fluid op generator into fluid (#47714) · f369b2b1
      Chen Weihang 提交于
      * move fluid op generator into fluid
      
      * remove parsed op
      
      * resolve sig undef error
      
      * append python interp find logic
      
      * remove dup code
      f369b2b1
  16. 08 11月, 2022 1 次提交
    • S
      Migrate old C++ unit tests to Python framework (#47006) · 0c9f09b8
      Sławomir Siwek 提交于
      * softplus+activation
      
      * fc + elementwise_add test refactored
      
      * rename MKLDNN to OneDNN
      
      * fc+activation tests refactored
      
      * remove softplus ut
      
      * whitespace
      
      * whitespace
      
      * codestyle
      
      * codestyle
      
      * add more cases to fc+act
      
      * remove softplus+hard_sigmoid pass
      
      * remove softplus + hard_sigmoid UT
      
      * add approximate for gelu
      
      * swish beta range
      
      * new codestyle
      
      * reduce number of tests
      0c9f09b8