1. 19 8月, 2022 9 次提交
    • W
      fix layernormTrt meanVar alloc bug (#45255) · 6fb34e74
      Wang Bojun 提交于
      * fix layernormTrt meanVar alloc bug
      6fb34e74
    • R
      Fix random op dependency and lr_shedule bugs for standalone executor (#45265) · 6d4ae007
      Ruibiao Chen 提交于
      * Fix random op depenency and lr_shedule bugs for standalone executor
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      6d4ae007
    • W
      Trt groupnorm dynamic plugin (#44911) · 1aa6adb1
      Wang Bojun 提交于
      * add group_norm dyanmic plugin
      1aa6adb1
    • H
      polish default param of XXX_interp_test, the same default value with … (#45258) · 4528ed2a
      HongyuJia 提交于
      * polish default param of XXX_interp_test, the same default value with XXX_interp_np
      
      * set default value data_layout=NCHW, cause C++ end treat NCDHW the same way as NCHW
      4528ed2a
    • D
      [XPU] add merged_momentum unittest and change momentum (#45241) · e0f1c9f2
      dongfangshenzhu 提交于
      * add merged_momentum *test=kunlun
      
      * add merged_momentum *test=kunlun
      
      * add fp16 to merged_momentum,*test=kunlun
      
      * change dist_model.cc
      
      * add merged_momentum unittest and  change momentum,test=kunlun
      
      * add merged_momentum unittest and  change momentum,test=kunlun
      
      * add merged_momentum unittest and  change momentum,test=kunlun
      
      * add merged_momentum unittest and  change momentum,test=kunlun
      e0f1c9f2
    • N
      [CodeStyle] use np.testing.assert_allclose instead of... · 4e2a3c11
      Nyakku Shigure 提交于
      [CodeStyle] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 3) (#45251)
      
      4e2a3c11
    • X
      [ Dy2Static ]Modify while interface[python] to fit onnx (#45034) · e654f1e7
      xiongkun 提交于
      * Make sure that the output of whilep must exist in the input
      
      * insert assign in block(0)
      
      * add unittest.
      e654f1e7
    • N
      [CodeStyle] use np.testing.assert_allclose instead of... · 9107b653
      Nyakku Shigure 提交于
      [CodeStyle] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 2) (#45213)
      
      * autofix (get ci log)
      
      * retrigger ci
      
      * fix test_gather_nd_op, wrong expected in dygraph
      
      * fix test_activation_op, unpack static graph result
      
      * fix test_auc_op, unpack static graph result
      
      * fix test_bce_loss, unpack static graph result
      
      * fix test_bce_with_logits_loss, unpack static graph result
      
      * fix test_cond, unpack static graph result
      
      * fix test_dygraph_weight_norm, wrong numpy reference when `axis=None`
      
      * fix test_einsum, wrong matmul inputs
      
      * fix test_elementwise_heaviside_op, unpack static graph result
      
      * fix test_frac_api, unpack static graph result
      
      * skip test_group_norm_op_v2, probably the wrong numpy reference
      
      * fix test_imperative_double_grad, wrong subscript
      
      * skip test_imperative_tensor_clear_gradient, ???
      
      * skip test_layer_norm_op, probably the wrong numpy reference
      
      * fix test_math_op_patch, unpack static graph results
      
      * fix test_masked_select_op, unpack static graph results
      
      * fix test_mse_loss, unpack static graph results
      
      * fix test_multi_label_soft_margin_loss, unpack static graph results
      
      * fix test_multi_dot_op, unpack static graph results
      
      * fix test_nll_loss, unpack static graph results
      
      * fix test_normalization_wrapper, unpack static graph results
      
      * fix test_pass_builder, unpack static graph results
      
      * fix test_prelu_op, possibly an extra comma
      
      * fix test_psroi_pool_op, unpack static graph results
      
      * fix test_queue, unpack static graph results
      
      * fix test_reorder_lod_tensor, compare an item with a list
      
      * fix test_rrelu_op, unpack static graph results
      
      * fix test_searchsorted_op, unpack static graph results
      
      * fix test_sigmoid_focal_loss, unpack static graph results
      
      * fix test_smooth_l1_loss, unpack static graph results
      
      * fix test_soft_margin_loss, unpack static graph results
      
      * fix test_softmax2d, unpack static graph results
      
      * fix test_square_error_cost, unpack static graph results
      
      * fix test_tril_indices_op, unpack static graph results
      
      * fix test_unsqueeze_op, mismatch numpy reference (axis)
      
      * skip test_layers, `static_rlt` is missing an axis
      
      * fix test_mnist, unpack PredictorTools result (also a list)
      
      * fix test_build_strategy, unpack PredictorTools result
      
      * fix test_mobile_net, unpack PredictorTools result
      
      * fix test_resnet_v2, unpack PredictorTools result
      
      * revert some changes
      
      revert test_layers
      
      revert test_group_norm_op_v2
      
      revert test_layer_norm_op
      
      revert test_imperative_tensor_clear_gradient
      
      * fix test_normal, use flatten instead of reshape, (PR-CI-Windows-OPENBLAS)
      
      * empty commit, trigger CI
      9107b653
    • A
      [CustomDevice] support scalar (#45244) · dc331231
      Aganlengzi 提交于
      dc331231
  2. 18 8月, 2022 9 次提交
  3. 17 8月, 2022 7 次提交
  4. 16 8月, 2022 8 次提交
  5. 15 8月, 2022 7 次提交
    • R
      f30c7bd6
    • H
      [phi] change op name linear_interp_v2 to linear_interp (#45128) · 6de3bdb3
      HongyuJia 提交于
      * change name linear_interp_v2 to linear_interp
      
      * fix deprecated_op_names
      
      * deprecated_op_names add linear_interp_grad
      6de3bdb3
    • Z
      Refine TRT unit test (#45102) · 3512bf11
      zlsh80826 提交于
      * Reduce pool2d test configuration
      
      * Reduce depthwise_conv2d test configuration
      
      * Reduce trt_convert_conv2d_fusion test configuration
      
      * Reduce trt_convert_conv2d test configuration
      
      * Reduce trt_convert_conv2d_transpose test configuration
      
      * Reduce trt_convert_hard_swish test configuration
      
      * Enhance trt auto scan test error message and mechanism
      
      * Increase FP16 trt ut tolerance
      3512bf11
    • Z
      add mish and mish_grad for XPU, test=kunlun (#45098) · 6815c8ab
      zhangyikun02 提交于
      6815c8ab
    • Z
      [AutoParallel] add collate_fn for dist_loader (#45053) · 3649099f
      zhaoyingli 提交于
      * add collate_fn
      
      * fix number of inputs
      3649099f
    • H
      [jit] rm useless property pybind (#44962) · 8788513b
      Hui Zhang 提交于
      * rm useless pybind
      
      * rm useless ut
      8788513b
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe