1. 05 12月, 2022 3 次提交
  2. 04 12月, 2022 1 次提交
  3. 03 12月, 2022 1 次提交
  4. 02 12月, 2022 8 次提交
    • Y
    • P
      [PHI] Migrate elementwise_sub kernel (#48611) · 493825a5
      Piotr Paturej 提交于
      * Add migrations
      
      * Fix build errors
      
      * Remove elementwise_mul from migration
      493825a5
    • H
      Migrate mul_mkldnn_op to phi matmul_kernel (#48299) · e8edbb09
      Hulek 提交于
      * Migrate mul_mkldnn_op to matmul_kernel
      
      * Review fixes - changed mutable_data, changed ctx to dev_ctx, fixed namespaces
      
      * switched some funcs to phi
      
      * Deleted not needed phi:: and changed place checking according to standards
      e8edbb09
    • S
      Fix fuse_gemm_epilogue (#47805) · 6efc2888
      Shijie 提交于
      * Fix fuse_gemm_epilogue
      
      * update tests
      
      * Update CMakeLists.txt
      
      * Update CMakeLists.txt
      
      * Update CMakeLists.txt
      
      * fix random seed
      
      * use assert_allclose
      
      * Update test_dist_fuse_gemm_epilogue_pass.py
      
      * Update cpp_pass.py
      
      * Update test_dist_fuse_gemm_epilogue_pass.py
      
      * fix codestyle
      
      * update seed and atol
      6efc2888
    • G
      add some compare and logical trt converter (#48592) · 4c38b87e
      gem5 提交于
      4c38b87e
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
    • Y
      add silu, silu_grad, unfold and unfold_grad xpu kernels (#48325) · f71de378
      ykkk2333 提交于
      * add stat tool
      
      * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
      
      * add silu, unfold and their grads,test=kunlun
      f71de378
    • I
      fix boardcasting superlink (#48434) · c34812ac
      Infinity_lee 提交于
      * fix boardcasting superlink
      
      * Update bitwise_op.cc
      
      * fix typo errors(from 48186)
      
      * Update python/paddle/distribution/uniform.py
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      
      * Update math.py
      
      * Update math.py
      
      * refix
      
      * Update logic.py
      
      * BaseTransform api doc; test=docs_preview
      
      * Update python/paddle/vision/transforms/transforms.py
      
      * for text block; test=docs_preview
      
      * Update transforms.py
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      c34812ac
  5. 01 12月, 2022 9 次提交
  6. 30 11月, 2022 13 次提交
  7. 29 11月, 2022 5 次提交
    • L
      fix mma_tensorcore (#48386) · bf4d1792
      lzy 提交于
      * fix mma_tensorcore (__CUDA_ARCH__)
      
      * disable tensorcore by default.
      
      disable tensorcore by default, because the judgment of __CUDA_ARCH__ will cause undefined behavior in some environments, can manually enable it on a machine that supports tensorcore.
      bf4d1792
    • H
      rename use_cudnn to use_gpudnn in phi (#48443) · 41f15537
      HongyuJia 提交于
      41f15537
    • P
      [PHI] traspose2 kernel migration (#47748) · d86aa4ca
      Paulina Gacek 提交于
      * traspose2 kernel migrated
      
      * Got rid of mutable_data
      
      * x modification added
      
      * ops added in extra info file
      
      * Formatting fix
      
      * 2 fuse passes with tanpose2 commented
      
      * nr of outs changed in 2 passes, passes uncommented
      
      * Changes in passes reverted
      
      * transpose chnaged in operator.cc
      
      * MKLDNN check in operator.cc
      
      * Transpose fixes
      
      * Fix deleted from operato
      
      * template corrected
      Co-authored-by: NPaulina Gacek <paulinagacek@intel.com>
      d86aa4ca
    • Replace LoDTensor with phi::DenseTensor in fluid\operators (#48417) · 91dd8a2e
      张春乔 提交于
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * replace LoDTensor with phi::DenseTensor in fluid\operators
      
      * Update split_lod_tensor_op.cc
      
      * Update warpctc_op.cc
      
      * Update broadcast_tensors_op.cc
      
      * Update crf_decoding_op.cc
      
      * Update lstm_op.cc
      
      * Update lstm_op.cc
      
      * Update lod_reset_op.cc
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * resume 2 files for confilct
      
      * Update gru_op.cc
      
      * Update linear_chain_crf_op.cc
      
      * Update lstm_op.cc
      91dd8a2e
    • N
      [CodeStyle][isort] introduce isort (part4) (#48402) · f85def97
      Nyakku Shigure 提交于
      * isort all files
      
      * revert conflicting files
      
      * revert conflicting files
      
      * revert conflicting files
      f85def97