1. 08 2月, 2023 1 次提交
  2. 07 2月, 2023 1 次提交
  3. 03 2月, 2023 2 次提交
    • S
      Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645
      Sławomir Siwek 提交于
      * replace matmul with matmul_v2 in fuse passes
      
      * Remove fusion logic from matmul
      
      * removing fusion methods
      
      * add proper name
      
      * adjust namespaces
      
      * clean attrs in python tests
      
      * delete checkpoint and restore matmul version
      
      * remove unused code
      
      * matmul and reshape/transpose fuses migrated
      
      * split MatmulOneDNN headers
      
      * fuse activation and eltwise_add
      
      * add fuse_activation
      
      * matmul_transpose_reshape/reshape_transpose_matmul
      
      * matmul + elementwise_add (fused)
      
      * activation temporary modifciation
      
      * merge newest develop
      
      * remove depedency from other PR
      
      * revert pbtxt
      
      * remove placeholders from matmul_v2
      
      * add description in OPMaker
      
      * remove matmul_v2_op.h and all depedencies
      
      * remove dims changing in base op
      
      * add possibility to fuse already fused_matmul
      
      * restart broken CI
      
      * Empty-Commit
      
      * revert matmul_utils.h
      
      * codestyle
      
      * adjust imports
      
      * add pbtxt file
      
      * 100% matmul unit tests coverage
      
      * trigger CI with minimal changes to develop
      
      * adjust changes to develop
      
      * add fused_matmul op
      
      * inherit base ops
      
      * add "v2"
      
      * move OPMaker
      
      * Gradually add fused_matmul files
      
      * second batch of fused_matmul changes
      
      * split infershapes of matmul_v2 and fused_matmul
      
      * inherit fused_matmul from matmul_v2
      
      * Update paddle/phi/backends/onednn/onednn_reuse.h
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      5cfe1645
    • P
      Rewrite conv testers from cpp to python (#49582) · aa8cef4a
      Paulina Gacek 提交于
      * conv_bias_mkldnn_fuse_pass_tester rewritten
      
      * conv_concat_relu_mkldnn_fuse_pass_tester rewritten
      
      * conv_elementwise_add_fuse_pass_tester rewritten
      
      * mkldnn changed to onednn
      
      * tests added to cmakeLists, style fix
      
      * got rid of unnecessary UT, some style changes
      
      * changes in naming convention
      
      * max_examples reduced
      
      * time out added
      aa8cef4a
  4. 01 2月, 2023 2 次提交
    • Z
      remove fluid.initializer.UniformInitializer, ConstantInitializer,... · 6edc7bba
      zqw_1997 提交于
      remove fluid.initializer.UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, BilinearInitializer, MSRAInitializer, NumpyArrayInitializer and calculate_gain.. (#49498)
      
      * move UniformInitializer and ConstantInitializer
      
      * more modify
      
      * circular import resolved
      
      * another circular import resolved?
      
      * more circular import 2
      
      * circular import 3
      
      * change import paddle in metric.py
      
      * BuildStrategy import from fluid
      
      * modify the framework import path in common.py
      
      * change rnn.py import, from static to original framework
      
      * change import static in the nn folder
      
      * default_main_program should import from common_ops_import
      
      * add import paddle in param_attr.py
      
      * use core not paddle module for using VarDesc
      
      * another old uniform
      
      * mistake that use Uniform instead of UniformInitializer
      
      * modify UniformInitializer doc
      
      * move fluid.NormalInitializer to nn.initializer.NormalInitializer
      
      * remove import of Normal in fluid.layers.nn.py
      
      * remove more import of old Normal
      
      * remove more import of old Normal
      
      * sample code modify and tests modify import
      
      * is_listen_failed passing arg should be log file
      
      * problem solved
      
      * a mistake solved
      
      * comments resoleved and remove paddle.fluid.initializer.TruncatedNormalInitializer
      
      * remove paddle.fluid.initializer.XavierInitializer and paddle.fluid.initializer.MSRAInitializer
      
      * remove paddle.fluid.initializer.BilinearInitializer NumpyArrayInitializer and set_global_initializer
      
      * change fluid to static
      
      * change static to fluid to avoid circular import in distributed_strategy.py
      
      * fix example code and test_initializer
      
      * ValueType
      
      * sample code fix
      
      * change set_global_initializer back to fluid
      
      * put paddle.static.BuildStrategy.ReduceStrategy into the fuction to avoid circular import
      
      * remove calculate_gain, delete BilinearInitializer and revert set_global_initializer
      
      * change the time of using UniformInitializer, ConstantInitializer, NormalInitializer, TruncatedNormalInitializer, XavierInitializer, MSRAInitializer, NumpyArrayInitializer as few as possible
      
      * fix argument incampatible
      
      * fix more arg incompatible
      
      * fix test_prelu_op_xpu.py Constant
      
      * fix inaccurate doc
      
      * more doc fix: default value
      6edc7bba
    • W
      Preln fix (#49802) · e03718f5
      Wang Bojun 提交于
      * preln_residual 2 fused_bias_residual
      
      * skip layernorm fix and ut
      
      * code refine
      
      * code style refine
      
      * fix ut
      
      * fix output
      
      * add trt layer fall back info
      
      * refine op teller and ut
      
      * DropoutMaskOut output fix
      e03718f5
  5. 31 1月, 2023 4 次提交
  6. 30 1月, 2023 1 次提交
  7. 18 1月, 2023 1 次提交
  8. 17 1月, 2023 1 次提交
  9. 16 1月, 2023 1 次提交
  10. 13 1月, 2023 2 次提交
    • W
      add oss flash fmha and fmhca support (#49438) · a48b8e2c
      Wang Bojun 提交于
      * add fmha_flashattention oss plugin
      a48b8e2c
    • Z
      [inference][trt]set output data type of trt network (#49712) · 690d7a69
      Zhang Jun 提交于
      * update trt engine to set in/out data type
      
      * update
      
      * Update engine.cc
      
      * Update engine.cc
      
      * update
      
      * set engine output type before freeze the network
      
      * update
      
      * update trt autoscan ut
      
      * update
      
      * update ut
      
      * fix equal bug, update ut
      
      * fix cast and equal ut
      
      * update cast ut using TRT < 8.4
      
      * set datatype from scope
      
      * check output var is nullptr
      
      * Update op_converter.h
      
      * update tensorrt_engine_op_test ut
      
      * update
      690d7a69
  11. 12 1月, 2023 1 次提交
  12. 10 1月, 2023 2 次提交
  13. 09 1月, 2023 2 次提交
  14. 05 1月, 2023 3 次提交
    • Z
      [inference][trt]Upgrade expand cast nearestinterp for sd (#48998) · 5defefd6
      Zhang Jun 提交于
      * update nearest_interp, expand_v2, cast for stable diffusion
      
      * update nearest_interp, expand_v2, cast for stable diffusion
      
      * correct shape rank
      
      * Update expand_v2_op.cc
      5defefd6
    • Yj/rm core ops exp (#49490) · 70ea88bf
      姜永久 提交于
      * rm op_function_generator
      
      * rm op_func_generator.h
      
      * rm op_function
      
      * modify cmake
      
      * rm op_function.h
      
      * rm check for op_function_generator.cc
      
      * reset imperative
      
      * rm python part
      
      * fix imperative
      
      * lint
      
      * lint
      
      * modify legacy_c
      
      * review
      
      * modify
      
      * modify legacy
      
      * rm gen op_functions code
      
      * reset framework
      
      * rm core.ops for test
      
      * core.ops->core.eager.ops.legacy
      
      * not raiseError for xpu
      70ea88bf
    • W
      [Inference] inplace all reshape op (#49146) · 017af746
      Wilber 提交于
      017af746
  15. 04 1月, 2023 1 次提交
  16. 03 1月, 2023 5 次提交
  17. 02 1月, 2023 1 次提交
  18. 30 12月, 2022 1 次提交
  19. 21 12月, 2022 2 次提交
    • Z
      remove paddle.fluid.layers.layer_norm (#49174) · b333d7ed
      zqw_1997 提交于
      * remove paddle.fluid.layers.layer_norm
      
      * templatedoc import from paddle.fluid.layers.layer_function_generator
      
      * del import of fluid.layers.layer_norm in __init__.py
      
      * add import of ..common.layer_norm in __init__.py
      
      * fix bug in UT
      
      * fix doc
      b333d7ed
    • Z
      Refactor Pass for fused_conv (#48848) · 7f0eb2e3
      zyfncg 提交于
      * refactor conv_activation_mkldnn_fuse_pass
      
      * refactor conv_affine_channel_mkldnn_fuse_pass
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix mkldnn unittest
      
      * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass
      
      * refactor conv_elementwise_add_mkldnn_fuse_pass
      
      * fix quant
      
      * refactor conv_bn_fuse_pass
      
      * fix conv_bn_fuse_pass
      
      * refactor depthwise_conv_bn_fuse_pass
      
      * fix unittest
      
      * fix conv_bn_fuse_pass
      
      * remove redundant conv2d in params_quantization_mkldnn_pass
      
      * fix params_quantization_mkldnn_pass_tester
      7f0eb2e3
  20. 20 12月, 2022 1 次提交
  21. 19 12月, 2022 2 次提交
  22. 15 12月, 2022 3 次提交
    • H
    • [FluidAPI] remove fluid rnn apis (#49050) · 4672ea8e
      骑马小猫 提交于
      * remove lstm api
      
      * remove gru_unit api
      
      * remove lstm in all
      
      * remove beam-search
      
      * remove beam_search slot
      
      * remove lstm test code
      
      * remove fluid.layers.nn api
      
      * update gru-unit
      
      * revert gru_unit white list
      4672ea8e
    • H
      Remove/move 16 fluid APIs (#48377) · e577040e
      HydrogenSulfate 提交于
      * remove density_prior_box
      
      * remove anchor_generator
      
      * remove roi_perspective_transform
      
      * remove generate_proposal_labels
      
      * remove generate_mask_labels
      
      * remove generate_proposals
      
      * remove box_clip
      
      * remove retinanet_detection_output
      
      * remove multiclass_nms
      
      * remove locality_aware_nms
      
      * remove matrix_nms
      
      * remove distribute_fpn_proposals
      
      * remove box_decoder_and_assign
      
      * remove collect_fpn_proposals
      
      * remove 2 trt files
      
      * move prior_box to static/nn/common.py
      
      * move multi_box_head to static/nn/common.py
      
      * fix for CI/CE
      
      * remove retinanet_detection_output
      
      * restore compile_vs_runtime_white_list.py
      
      * restore test_retinanet_detection_output to white list
      
      * replace nn.flatten by paddle.flatten, and fix doc for retinanet_target_assign
      
      * add enable_static in demo and fix bug
      
      * remove roi_perspective_transform in test_layers
      
      * remove multi_box_head
      
      * change self.multiclass_nms to _legacy_C_ops.multiclass_nms
      
      * empty commit
      
      * empty commit
      
      * check code style
      
      * fix prior_box
      
      * fix CI
      
      * remove redundant prior_box in detection.py
      
      * fix docs
      
      * remove detection
      
      * fix prior_box en doc
      
      * delete prior_box in common
      
      * remote proir_box from __init__.py
      e577040e