1. 03 2月, 2023 6 次提交
    • S
      Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645
      Sławomir Siwek 提交于
      * replace matmul with matmul_v2 in fuse passes
      
      * Remove fusion logic from matmul
      
      * removing fusion methods
      
      * add proper name
      
      * adjust namespaces
      
      * clean attrs in python tests
      
      * delete checkpoint and restore matmul version
      
      * remove unused code
      
      * matmul and reshape/transpose fuses migrated
      
      * split MatmulOneDNN headers
      
      * fuse activation and eltwise_add
      
      * add fuse_activation
      
      * matmul_transpose_reshape/reshape_transpose_matmul
      
      * matmul + elementwise_add (fused)
      
      * activation temporary modifciation
      
      * merge newest develop
      
      * remove depedency from other PR
      
      * revert pbtxt
      
      * remove placeholders from matmul_v2
      
      * add description in OPMaker
      
      * remove matmul_v2_op.h and all depedencies
      
      * remove dims changing in base op
      
      * add possibility to fuse already fused_matmul
      
      * restart broken CI
      
      * Empty-Commit
      
      * revert matmul_utils.h
      
      * codestyle
      
      * adjust imports
      
      * add pbtxt file
      
      * 100% matmul unit tests coverage
      
      * trigger CI with minimal changes to develop
      
      * adjust changes to develop
      
      * add fused_matmul op
      
      * inherit base ops
      
      * add "v2"
      
      * move OPMaker
      
      * Gradually add fused_matmul files
      
      * second batch of fused_matmul changes
      
      * split infershapes of matmul_v2 and fused_matmul
      
      * inherit fused_matmul from matmul_v2
      
      * Update paddle/phi/backends/onednn/onednn_reuse.h
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      5cfe1645
    • R
      Fix div 0 error of case20: paddle.min (#50013) · 50c43dd3
      RedContritio 提交于
      50c43dd3
    • HappyHeavyRain's avatar
      Generate some static graph ops (#49906) · 85490f70
      HappyHeavyRain 提交于
      * generate some static graph ops
      
      * fix the bug of pow
      
      * add REGISTER_ACTIVATION_OP in operators.cmake
      
      * modify the file operators.cmake
      85490f70
    • Y
      Fused attention pass backward op replace. (#50186) · 7e8ef328
      Yuang Liu 提交于
      7e8ef328
    • J
      【Prim】optimize log (#50160) · 80310541
      Jiabin Yang 提交于
      * optimize log
      
      * fix type error
      
      * fix type error2
      80310541
    • J
      【Prim】Blacklist bwd comp (#50148) · cc8a7858
      Jiabin Yang 提交于
      * refactor dir for prim
      
      * support blacklist for bwd comp
      
      * fix type error
      
      * remove additional file
      
      * fix git ignore
      
      * add more test
      
      * merge develop
      cc8a7858
  2. 02 2月, 2023 1 次提交
  3. 01 2月, 2023 4 次提交
  4. 31 1月, 2023 2 次提交
    • C
      Integrate static code gen info (#49858) · 0e51f398
      Charles-hit 提交于
      * polish static grad op maker gen
      
      * fix some bugs
      
      * fix static code gen
      
      * solve conflict
      
      * modify composite grad maker name
      
      * integrate phi and fluid info in static code gen
      
      * rename some composite maker
      
      * modify static code gen format
      0e51f398
    • T
      support inplaced variable in cinn_launch (#49912) · 754ab705
      TeFeng Chen 提交于
      * support inplaced variable in cinn_launch
      
      * fix error hint when compiling
      
      * fix inplaced output variable of the subgraph
      
      * skip CinnCompiler check
      
      * using existed definition
      
      * fix namespace reference error
      
      * modify error message
      
      * update cinn tage
      
      * fix namespace
      
      * skip enforce check
      
      * fix unittest attribute throw
      754ab705
  5. 30 1月, 2023 1 次提交
    • Z
      [Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e
      zmxdream 提交于
      * add set slot_num for psgpuwraper (#177)
      
      * add set slot_num_for_pull_feature for psgpuwarper
      
      * Add get_epoch_finish python interface (#182)
      
      * add get_epoch_finish interface
      
      * add return
      
      * delete return
      
      * add unzip op (#183)
      
      * fix miss key for error dataset (#186)
      
      * fix miss key for error dataset
      
      * fix miss key for error dataset
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * add excluded_train_pair and infer_node_type (#187)
      
      * support return of degree (#188)
      
      * fix task stuck in barrier (#189)
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * check node/feature format when loading (#190)
      
      * check node&feature format when loading
      
      * check node&feature format when loading (2£ (2)
      
      * degrade log (#191)
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]replace LodTensor with phi::DenseTensor
      
      * [PGLBOX]fix gpu_primitives.h include path
      
      * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * fix code style
      
      * fix code style
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip
      
      * fix code stype
      
      * add ut
      
      * add c++ ut & fix train_mode_ set
      
      * fix load into memory
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix code style
      
      * fix collective
      
      * fix unzip_op.cc
      
      * fix barrier
      
      * fix code style
      
      * fix barrier
      
      * fix barrier
      
      * fix code styple
      
      * fix unzip
      
      * add unzip.py
      
      * add unzip.py
      
      * fix unzip.py
      
      ---------
      Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      cb525d4e
  6. 29 1月, 2023 1 次提交
  7. 19 1月, 2023 1 次提交
  8. 18 1月, 2023 2 次提交
  9. 17 1月, 2023 4 次提交
    • J
      Add more dy2st ut2 (#49881) · 2242136a
      Jiabin Yang 提交于
      * add test for composite with dy2st
      
      * add more log
      2242136a
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
    • W
      [Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
      WangZhen 提交于
      * Support call backward() without params in dy2st
      2f24b2d8
    • X
      【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a
      Xiaoxu Chen 提交于
      * support elementwise base func
      
      * fix compiling error and add test
      
      * support vjp for div using comp
      
      * remove additional change
      
      * fix dy2st error with magic num
      
      * fix dy magic num
      
      * another magic
      
      * another magic
      
      * another magic
      
      * add skip rename strategy
      
      * support add vjp
      
      * support add with new axis cal
      
      * support sub vjp
      
      * [prim] add multiply vjp rules
      
      * [prim] add multiply vjp rules
      
      * [prim] fix no infershape with composite in _append_backward_ops
      
      * [prim] add expand vjp rule
      
      * [prim] add exp vjp rule
      
      * uncomment infer shape for reshape/sum static prim api
      
      * [prim] fix tanh nullptr error
      
      * remove some print message
      
      * fix magic number in run_program relative tests @JiaBinYang
      
      * [prim] add expand,multiply,exp vjp rules
      
      * fix only support single direction reduce error
      
      * infer reduce dims using out dims
      Co-authored-by: NJiabinYang <360788950@qq.com>
      39c6765a
  10. 16 1月, 2023 5 次提交
  11. 13 1月, 2023 5 次提交
  12. 12 1月, 2023 1 次提交
  13. 11 1月, 2023 2 次提交
  14. 10 1月, 2023 4 次提交
    • W
      Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973
      Wen Sun 提交于
      * refactor: gloo comm context migration
      
      * fix: headers & avoid mutable_data usage
      
      * fix: cmake gloo dep
      
      * style: rename funcs
      
      * refactor: move to new files
      
      * fix: gloo deps
      
      * refactor: simplify create device
      05df6973
    • R
      [PHI Decoupling] move sequence_scale from fluid to phi (#49668) · a36c5490
      Ryan 提交于
      * try sequence_padding
      
      * fix cant use mutable_data
      
      * fix mistake fluid_sequence_scale.hh/CMakeLists.t include
      
      * fix namespace bug
      
      * fix framework::ToAbsOffset not found
      
      * fix codestyle
      a36c5490
    • J
      Fix prim paddle c make (#49643) · a227ae2b
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      
      * remove duplicated cmake
      
      * fix cmake and codegen problem
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      a227ae2b
    • MarDino's avatar
      Refine name style and MoeKernel (#49432) · 39210ed0
      MarDino 提交于
      39210ed0
  15. 09 1月, 2023 1 次提交
    • W
      Preln groupnorm (#49463) · 591be3bd
      wenbin 提交于
      * skip_groupnorm
      
      * init
      
      * preln
      
      * add ut
      
      * more assert
      
      * set timeout
      
      * fix windows ci issue
      591be3bd