1. 06 2月, 2023 2 次提交
  2. 04 2月, 2023 1 次提交
  3. 03 2月, 2023 8 次提交
    • S
      Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645
      Sławomir Siwek 提交于
      * replace matmul with matmul_v2 in fuse passes
      
      * Remove fusion logic from matmul
      
      * removing fusion methods
      
      * add proper name
      
      * adjust namespaces
      
      * clean attrs in python tests
      
      * delete checkpoint and restore matmul version
      
      * remove unused code
      
      * matmul and reshape/transpose fuses migrated
      
      * split MatmulOneDNN headers
      
      * fuse activation and eltwise_add
      
      * add fuse_activation
      
      * matmul_transpose_reshape/reshape_transpose_matmul
      
      * matmul + elementwise_add (fused)
      
      * activation temporary modifciation
      
      * merge newest develop
      
      * remove depedency from other PR
      
      * revert pbtxt
      
      * remove placeholders from matmul_v2
      
      * add description in OPMaker
      
      * remove matmul_v2_op.h and all depedencies
      
      * remove dims changing in base op
      
      * add possibility to fuse already fused_matmul
      
      * restart broken CI
      
      * Empty-Commit
      
      * revert matmul_utils.h
      
      * codestyle
      
      * adjust imports
      
      * add pbtxt file
      
      * 100% matmul unit tests coverage
      
      * trigger CI with minimal changes to develop
      
      * adjust changes to develop
      
      * add fused_matmul op
      
      * inherit base ops
      
      * add "v2"
      
      * move OPMaker
      
      * Gradually add fused_matmul files
      
      * second batch of fused_matmul changes
      
      * split infershapes of matmul_v2 and fused_matmul
      
      * inherit fused_matmul from matmul_v2
      
      * Update paddle/phi/backends/onednn/onednn_reuse.h
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      5cfe1645
    • P
      Rewrite conv testers from cpp to python (#49582) · aa8cef4a
      Paulina Gacek 提交于
      * conv_bias_mkldnn_fuse_pass_tester rewritten
      
      * conv_concat_relu_mkldnn_fuse_pass_tester rewritten
      
      * conv_elementwise_add_fuse_pass_tester rewritten
      
      * mkldnn changed to onednn
      
      * tests added to cmakeLists, style fix
      
      * got rid of unnecessary UT, some style changes
      
      * changes in naming convention
      
      * max_examples reduced
      
      * time out added
      aa8cef4a
    • R
      Fix div 0 error of case20: paddle.min (#50013) · 50c43dd3
      RedContritio 提交于
      50c43dd3
    • HappyHeavyRain's avatar
      Generate some static graph ops (#49906) · 85490f70
      HappyHeavyRain 提交于
      * generate some static graph ops
      
      * fix the bug of pow
      
      * add REGISTER_ACTIVATION_OP in operators.cmake
      
      * modify the file operators.cmake
      85490f70
    • Y
      Fused attention pass backward op replace. (#50186) · 7e8ef328
      Yuang Liu 提交于
      7e8ef328
    • R
      Reduce time cost of BuildOpHappensBefore (#50137) · 6b151c0e
      Ruibiao Chen 提交于
      * Reduce time cost of BuildOpHappensBefore
      
      * Update code
      
      * Update code
      
      * Improve data struct
      6b151c0e
    • J
      【Prim】optimize log (#50160) · 80310541
      Jiabin Yang 提交于
      * optimize log
      
      * fix type error
      
      * fix type error2
      80310541
    • J
      【Prim】Blacklist bwd comp (#50148) · cc8a7858
      Jiabin Yang 提交于
      * refactor dir for prim
      
      * support blacklist for bwd comp
      
      * fix type error
      
      * remove additional file
      
      * fix git ignore
      
      * add more test
      
      * merge develop
      cc8a7858
  4. 02 2月, 2023 3 次提交
  5. 01 2月, 2023 7 次提交
  6. 31 1月, 2023 15 次提交
  7. 30 1月, 2023 4 次提交
    • J
      ac84dce9
    • E
      add phi tensor vector array api from fluid (#49885) · 094e3b8c
      engineer1109 提交于
      replace all TensorFromVector & TensorToVector
      
      AssignKernel async copy
      094e3b8c
    • R
      Support stream priority for standalone executor (#49939) · 172d1de6
      Ruibiao Chen 提交于
      * Support stream priority for standalone executor
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      172d1de6
    • Z
      [Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e
      zmxdream 提交于
      * add set slot_num for psgpuwraper (#177)
      
      * add set slot_num_for_pull_feature for psgpuwarper
      
      * Add get_epoch_finish python interface (#182)
      
      * add get_epoch_finish interface
      
      * add return
      
      * delete return
      
      * add unzip op (#183)
      
      * fix miss key for error dataset (#186)
      
      * fix miss key for error dataset
      
      * fix miss key for error dataset
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * add excluded_train_pair and infer_node_type (#187)
      
      * support return of degree (#188)
      
      * fix task stuck in barrier (#189)
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * check node/feature format when loading (#190)
      
      * check node&feature format when loading
      
      * check node&feature format when loading (2£ (2)
      
      * degrade log (#191)
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]replace LodTensor with phi::DenseTensor
      
      * [PGLBOX]fix gpu_primitives.h include path
      
      * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * fix code style
      
      * fix code style
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip
      
      * fix code stype
      
      * add ut
      
      * add c++ ut & fix train_mode_ set
      
      * fix load into memory
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix code style
      
      * fix collective
      
      * fix unzip_op.cc
      
      * fix barrier
      
      * fix code style
      
      * fix barrier
      
      * fix barrier
      
      * fix code styple
      
      * fix unzip
      
      * add unzip.py
      
      * add unzip.py
      
      * fix unzip.py
      
      ---------
      Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      cb525d4e