1. 15 3月, 2022 4 次提交
  2. 14 3月, 2022 12 次提交
    • S
      [Phi]Add diag_v2 grad kernel (#40447) · e157f2af
      Siming Dai 提交于
      * Add diag grad kernel
      
      * fix unittest case
      
      * add float16, remove const &
      
      * delete diag_grad in op_utils.h
      e157f2af
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
    • F
      [MLU] add merged_momentum mlu kernel (#40406) · 1f7b2516
      fwenguang 提交于
      1f7b2516
    • J
      Support custom op and paddle.autograd.bacward in eager (#40423) · 227fa408
      Jiabin Yang 提交于
      * eager, test=develop
      
      * fix bug, test=develop
      
      * eager, test=develop
      
      * merge legacy to fluid
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Refactor TensorAdd func by template and remove gradient_accumulation in eager
      
      * Remove needless target name
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Use overload instead of template
      
      * Remove legacy code
      
      * Remove legacy code
      
      * selectedrows, test=develop
      
      * Remove DataType test
      
      * eager, test=develop
      
      * eager, test=develop
      
      * support gan, test=develop
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * ptb, test=develop
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * add more test
      
      * eager, test=develop
      
      * Support copiable selected rows and merge develop
      
      * save load, eager, test=develop
      
      * save load, eager, test=develop
      
      * refine, test=develop
      
      * remove useless _set_value method
      
      * refine, test=develop
      
      * refine, test=develop
      
      * revert static_runner, test=develop
      
      * EagerTensor to Tensor, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * clear grad, test=develop
      
      * merge, develop
      
      * merge, develop
      
      * merge, test=develop
      
      * merge, test=develop
      
      * Support quant and part of slice
      
      * support legacy static save
      
      * extend slim tests time
      
      * remove imperative on inference
      
      * remove imperative on inference
      
      * merge develop
      
      * fix typo
      
      * fix typo
      
      * split slice related code into 2 part for imperative and eager
      
      * split slice from inference
      
      * split slice from inference
      
      * fix test_tensor_register_hook
      
      * support custom op in eager mode
      
      * fix inference deps error
      
      * split eager utils from custom operator
      
      * fix type match
      
      * fix typo
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
      227fa408
    • W
      [Eager] [Bug Fix] fix eager trace op bug (#40402) · 65adfecf
      wanghuancoder 提交于
      * fix some slice bug, test=develop
      
      * refine, test=develop
      65adfecf
    • 0
      adjust params order for eager.Tensor._copy_to (#40449) · c6ec8b9f
      0x45f 提交于
      c6ec8b9f
    • L
      [KP] Add unittests for... · f269ca3f
      Lijunhui 提交于
      [KP] Add unittests for brelu,ceil,celu,elu,floor,hard_shrink,hard_sigmoid,log1p,logsigmoid,relu6,silu,soft_relu,softsign,swish (#40448)
      
      * solve unexecuted UT
      
      * add 24 activation op UT
      
      * append swish&thresholded_relu to kpfirst_list
      
      * rm thresholded_relu
      f269ca3f
    • Z
      [AutoParallel] Converter (#40434) · 3881b6cb
      zhaoyingli 提交于
      * [AutoParallel] Converter
      Converter API
      3881b6cb
    • W
      [hybrid fix] fix sharding save offload (#40477) · edd97f94
      WangXi 提交于
      edd97f94
    • B
      fix_group_sharded_note (#40488) · e14a6ec9
      Baibaifan 提交于
      e14a6ec9
    • Z
      [multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors ... · e553f758
      Zhong Hui 提交于
      [multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors  between python processes. (#37302)
      
      * Add support for paddle.multiprocessing
      * move multiprocessing to incubate.
      e553f758
    • 0
      Refine partial_program for new run_program OP (#40355) · afafb1c3
      0x45f 提交于
      * refine partial_program
      
      * fix code for test_mnist.py train
      
      * support quantify UT
      
      * make __fake_vars and _double_grads to lazy
      
      * fix comments
      afafb1c3
  3. 12 3月, 2022 1 次提交
  4. 11 3月, 2022 8 次提交
  5. 10 3月, 2022 5 次提交
    • H
      Inference add ONNXRuntime back-end (#39988) · 431afc39
      heliqi 提交于
      * add onnxruntime predictor
      
      * Add code comments
      
      * support link paddle2onnx onnxruntime
      
      * support onnxruntime with python
      
      * support onnxruntime with python
      
      * support onnxruntime with windows
      
      * paddle2onnx compile with windows
      
      * supoort windows compile
      
      * supoort windows compile with onnxruntime
      
      * supoort windows compile with paddle2onnx
      
      * supoort mac compile
      
      * compile with mac
      
      * compile with mac
      
      * add code comments
      
      * fix remind word
      
      * code optimization
      
      * add test case
      
      * add test case
      
      * add inference demo_ci test case
      
      * fix compile paddle2onnx with no python
      
      * add inference demo_ci test case
      
      * add inference demo_ci test case
      
      * add inference infer_ut test case
      
      * support c go api and test cases
      
      * add converage test case
      
      * add converage test case
      
      * add capi test case
      
      * add capi test case
      431afc39
    • C
      [Auto Parallel]Update reshard for while sub block (#40366) · 2747de2b
      caozhou 提交于
      * update reshard for while sub block
      
      * fix code format error
      2747de2b
    • Z
      Supported auto code gen for sparse kernels (#40276) · 2b6da4de
      Zhanlue Yang 提交于
      2b6da4de
    • z8hanghuan's avatar
      add tril_triu for xpu, *test=kunlun (#40246) · 1128db30
      z8hanghuan 提交于
      * add tril_triu for xpu, *test=kunlun
      
      * add tril_triu for xpu, *test=kunlun
      
      * add tril_triu for xpu, *test=kunlun
      
      * add tril_triu for xpu, *test=kunlun
      
      * add tril_triu for xpu, *test=kunlun
      1128db30
    • H
      Move dropout to phi (#40148) · 99fc1b08
      hong 提交于
      * move dropout to phi; test=develop
      
      * fix xpu, npu compile error; test=develop
      99fc1b08
  6. 09 3月, 2022 10 次提交