1. 15 3月, 2022 14 次提交
    • J
      oneDNN NHWC fixes (#40049) · dde9cec0
      Jacek Czaja 提交于
      * - Prototype of third solution
      
      - fix
      
      - compilation fixes
      
      - fix
      
      - fixe
      
      - fix
      
      - fix
      
      - compilation fix
      
      - comment fix
      
      - lint
      
      update mkldnn conv_elementwise_add_fuse_pass ut
      
      - NHWC changes to prelu
      
      - alhpa dims
      
      - UT fix
      
      - fix to UT
      
      - lint
      
      - Some fixes
      
      - added to BWD of prelu NHWC support
      
      - reverted removal of resetting cu_layout in clearing of caching
      
      * - Small changes
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * lint
      
      * - fixes after internal review
      
      * - compilation fix
      
      * - lint
      dde9cec0
    • T
      add shard_id (#40261) · 6b7d4845
      Thunderbrook 提交于
      * shard_id
      
      * format
      6b7d4845
    • X
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620
      xiongkun 提交于
      [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)
      
      * tranfer and pass the lgamma unittest
      
      * merge and pass the test
      
      * transfer kldiv_loss and kldiv_loss_grad; pass the unitest
      
      * trafer the isclose and cumprod kernel
      
      * change PT_REGISTER -> PD_REGISTER
      
      * fix by code review
      
      * fix by code review
      
      * fix
      
      * remove enforce include dependence from scalar
      
      * fix
      
      * fix by code review
      
      * fix by code review
      64223620
    • C
      [Phi]move reduce_min/any/all kernel (#40374) · c46e661d
      chentianyu03 提交于
      * add reduce_min kernel
      
      * remove raw reduce_min kernel
      
      * add reduce min
      
      * add reduce any all impl
      
      * add bool reduce Kernel
      
      * remove raw any/all kernel
      
      * add any all kernel
      
      * rm comment
      c46e661d
    • Z
      Added more profile signposts to dygraph (#40201) · 36db75b4
      Zhanlue Yang 提交于
      * Added more signposts to dygraph profiling
      
      * Fixed minor issues
      
      * Refactored signpost names
      
      * Fixed typo
      
      * Removed debug codes
      
      * Fixed typo
      
      * Adjusted signpost names
      
      * Fixed issues from branch merge
      36db75b4
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37
    • C
      Fix truncated norm operator (#40287) · 0c333543
      Chang Xu 提交于
      0c333543
    • Y
      [Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180
      YuanRisheng 提交于
      * move activation op
      
      * adjust code format
      
      * fix compile bugs
      
      * fix ci bugs
      
      * code format adjust
      
      * code format adjust2
      
      * activate ci status
      
      * modify according to comment
      
      * move activation kernel
      
      * revert relu6
      
      * reduce add code
      
      * perfect use_phi_functor
      
      * completing func name
      
      * fix bugs when run ci
      
      * fix bugs when run infr
      
      * modifpy infrt get kernel signature
      d7112180
    • Q
      [MLU] add check_finite_and_unscale op for amp (#40458) · 42c7bb47
      qipengh 提交于
      42c7bb47
    • Z
      [Phi]Move searchsorted kernel to phi (#40520) · 85f8fd9b
      Zhang Zheng 提交于
      85f8fd9b
    • H
      [Dygraph] Refactoring of reducer in DataParallel (#40389) · 1a32391c
      Haohongxiang 提交于
      * refactor reducer
      
      * modify cmakelists
      
      * solve conflicts
      
      * rename group and update process_group
      
      * fix bugs of ProcessGroupNCCL
      
      * modify for CIs
      
      * refactoring reducer
      1a32391c
    • Z
      Remove pybind index error (#40538) · 47d764a3
      zyfncg 提交于
      * change the exception of getitem from pybind type to PADDLE_ENFORCE
      
      * fix bug
      
      * remove pybind::index_error exception
      47d764a3
    • Z
      [Phi]Move kron kernel to phi (#40427) · f181d47f
      Zhang Zheng 提交于
      * first commit
      
      * fix
      
      * fix
      
      * fix compile eeror
      
      * fix
      
      * fix complex
      
      * fix
      
      * fix
      
      * fix npu
      
      * fix
      
      * modify accroding to comments
      
      * fix
      f181d47f
    • C
      move allclose infershape (#40508) · 5d08a447
      Chen Weihang 提交于
      5d08a447
  2. 14 3月, 2022 19 次提交
    • S
      [Phi]Add diag_v2 grad kernel (#40447) · e157f2af
      Siming Dai 提交于
      * Add diag grad kernel
      
      * fix unittest case
      
      * add float16, remove const &
      
      * delete diag_grad in op_utils.h
      e157f2af
    • Z
      [PHI] Move set_value_grad kernel form fluid to phi (#40478) · 3149e399
      zyfncg 提交于
      * move set_value_grad kernel form fluid to phi
      
      * add unittest for passing coverage ci
      3149e399
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
    • F
      [MLU] add merged_momentum mlu kernel (#40406) · 1f7b2516
      fwenguang 提交于
      1f7b2516
    • C
      optimize group_norm op backward (#39944) · 5720537e
      crystal 提交于
      * optimize backwad
      
      * optimize group_norm backward
      
      * Add vectorized code
      
      * move assignment code
      
      * merge function
      
      * move code
      
      * optimize code
      
      * Modify function name
      5720537e
    • L
      Optimize bilinear_interp backward (#39423) · 9e1f762c
      Lijunhui 提交于
      * bilinear_bw init
      
      * optimize code
      
      * optimize
      
      * optimize 2
      
      * optimize functions
      
      * modify func name
      9e1f762c
    • X
      [phi]migrate fmax,fmin kernel to phi (#40140) · bb801960
      Xiaoxu Chen 提交于
      bb801960
    • J
      Support custom op and paddle.autograd.bacward in eager (#40423) · 227fa408
      Jiabin Yang 提交于
      * eager, test=develop
      
      * fix bug, test=develop
      
      * eager, test=develop
      
      * merge legacy to fluid
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Refactor TensorAdd func by template and remove gradient_accumulation in eager
      
      * Remove needless target name
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Use overload instead of template
      
      * Remove legacy code
      
      * Remove legacy code
      
      * selectedrows, test=develop
      
      * Remove DataType test
      
      * eager, test=develop
      
      * eager, test=develop
      
      * support gan, test=develop
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * ptb, test=develop
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * add more test
      
      * eager, test=develop
      
      * Support copiable selected rows and merge develop
      
      * save load, eager, test=develop
      
      * save load, eager, test=develop
      
      * refine, test=develop
      
      * remove useless _set_value method
      
      * refine, test=develop
      
      * refine, test=develop
      
      * revert static_runner, test=develop
      
      * EagerTensor to Tensor, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * clear grad, test=develop
      
      * merge, develop
      
      * merge, develop
      
      * merge, test=develop
      
      * merge, test=develop
      
      * Support quant and part of slice
      
      * support legacy static save
      
      * extend slim tests time
      
      * remove imperative on inference
      
      * remove imperative on inference
      
      * merge develop
      
      * fix typo
      
      * fix typo
      
      * split slice related code into 2 part for imperative and eager
      
      * split slice from inference
      
      * split slice from inference
      
      * fix test_tensor_register_hook
      
      * support custom op in eager mode
      
      * fix inference deps error
      
      * split eager utils from custom operator
      
      * fix type match
      
      * fix typo
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
      227fa408
    • Z
      Optimize performance of log_softmax (#38992) · 250e254f
      Zhang Zheng 提交于
      * Optimize performance of log_softmax
      
      * delete unity build
      
      * modify to phi
      
      * fix
      
      * fixfixfixfix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * simplify
      
      * fix
      
      * fix enforce
      250e254f
    • 0
      adjust params order for eager.Tensor._copy_to (#40449) · c6ec8b9f
      0x45f 提交于
      c6ec8b9f
    • L
      [KP] Add unittests for... · f269ca3f
      Lijunhui 提交于
      [KP] Add unittests for brelu,ceil,celu,elu,floor,hard_shrink,hard_sigmoid,log1p,logsigmoid,relu6,silu,soft_relu,softsign,swish (#40448)
      
      * solve unexecuted UT
      
      * add 24 activation op UT
      
      * append swish&thresholded_relu to kpfirst_list
      
      * rm thresholded_relu
      f269ca3f
    • C
      【phi】migrate matrix_rank to phi (#40074) · b9d4285b
      crystal 提交于
      * migrate matrix_rank to phi
      
      * migrate eigh and matrix_rank to phi
      
      * fix matrix_rank
      
      * optimize code
      
      * move matrix_rank to phi
      
      * add max functor
      
      * migrate matrix_rank to phi
      
      * optimize code
      b9d4285b
    • zhouweiwei2014's avatar
      930a5136
    • L
      Update profiler (#40460) · 89a70c76
      liutiexing 提交于
      89a70c76
    • Z
      d6e99fe4
    • Z
      [GPUPS]fix instag lod information (#40483) · e5c59fc9
      zmxdream 提交于
      e5c59fc9
    • Z
      [multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors ... · e553f758
      Zhong Hui 提交于
      [multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors  between python processes. (#37302)
      
      * Add support for paddle.multiprocessing
      * move multiprocessing to incubate.
      e553f758
    • F
      Move Pool OPs to phi (#40208) · 88ec08a7
      From00 提交于
      * Move Pool OPs to phi
      
      * Fix CI error
      
      * Fix conflicts
      88ec08a7
    • 0
      Refine partial_program for new run_program OP (#40355) · afafb1c3
      0x45f 提交于
      * refine partial_program
      
      * fix code for test_mnist.py train
      
      * support quantify UT
      
      * make __fake_vars and _double_grads to lazy
      
      * fix comments
      afafb1c3
  3. 13 3月, 2022 2 次提交
  4. 12 3月, 2022 5 次提交