- 08 9月, 2023 1 次提交
-
-
由 xiaoguoguo626807 提交于
* add reference of lbfgs * add reference of lbfgs * modify concat kernel choose * modify ci
-
- 07 9月, 2023 3 次提交
-
-
由 Difer 提交于
* rename fluid to base * fix setup and rename paddle.fluid * fix protobuf generated paddle.fluid * fix some errors * fix cmake depends * fix some error * try to fix cache error * fix codestyle && some doc error * remove build change * fix some error --------- Co-authored-by: NXiangGao <jeff41404@gmail.com>
-
由 Ruibin Cheung 提交于
-
由 yangguohao 提交于
* complex op logsigmoid * fix 2023-08-31
-
- 06 9月, 2023 4 次提交
-
-
由 Chen Zhiyang 提交于
* div passed v1.0 * IrChange->IrGuard & optimize static input dtype * clean * remove IrChange and optimize IrGuard
-
由 akshatvishu 提交于
-
由 kangguangli 提交于
-
由 zhenhailiu 提交于
* sep group * add test * test ok * polish * test cmake script generated * add sep group * format * polish * polish
-
- 04 9月, 2023 9 次提交
-
-
由 tianhaodongbd 提交于
* add rotate_half in fused_rope * add position_ids in fused_rope * modified examples about fused_rope * add set_device in examples
-
由 Sonder 提交于
* remove has_structed_kerenl and has_fluid_kernel * add test_fused_layernorm_op to STATIC_BUILD_TESTS list * open static_build flag * remove distributed_fused_lamb_init from StaticBuildBlackList * use initialized replacing IsInitialized * recover codes * delete useless codes * close the flag
-
由 niuliling123 提交于
* add num_splist for flash_attn_bwd and FlashAttnUnpaddedGradKernel * Add assertTrue * Update submodule to a specific commit
-
由 wanghuancoder 提交于
* disable strided split
-
由 zhangbo9674 提交于
* add code * add code * refine code * add code * fix bug * fix bug * fix bug * add code * add ut * polish code * fix bug * refine code * fix bug * refine code * fix bug * refine code * fix bug * refine code * fix bug * refine code * add code * fix bug * fix bug * fix bug * fix bug * fix bug * refine code
-
由 WangZhen 提交于
-
由 duanyanhui 提交于
-
由 BiynXu 提交于
Change FlattenLoops in the elementwise schedule to Fuse
-
由 wanghuancoder 提交于
* fix contiguous
-
- 03 9月, 2023 1 次提交
-
-
由 Chen Zhiyang 提交于
* add reference of lbfgs * add reference of lbfgs * new ir op test v1.0 * fix new ir optest bug1.0 * modify two testcase bug * add new ir white list & pass test_mean_op.py * rename white list * add new_ir_guard * new ir sum op test all pass * rename backward.grad as ir_backward.grad * check place for new ir * fix test_build_model env bug * fix test_prim_program backward bug * change backward to ir_backward in check_appr * add check_new_ir flag for mkldnn * clean --------- Co-authored-by: Nwangruting <wangruting@baidu.com>
-
- 01 9月, 2023 1 次提交
-
-
由 Scotty 提交于
* support index_select op * index_sample in cpu * support index_sample in gpu * change data_transform * fix api gen and use skip_transform in yaml
-
- 31 8月, 2023 6 次提交
-
-
由 iSerendipity 提交于
* add complex support for isclose * add complex test for isclose * fix template complie issue * fix cuda compilation error * fix type typo * fix error for complex's abs * add complex dtype into input * fix ut
-
由 Chen Zhiyang 提交于
* add reference of lbfgs * add reference of lbfgs * new ir op test v1.0 * fix new ir optest bug1.0 * modify two testcase bug * add new ir white list & pass test_mean_op.py * rename white list * add new_ir_guard * rename backward.grad as ir_backward.grad * check place for new ir * fix test_build_model env bug * fix test_prim_program backward bug * change backward to ir_backward in check_appr --------- Co-authored-by: Nwangruting <wangruting@baidu.com>
-
由 Scotty 提交于
-
由 iSerendipity 提交于
-
由 Tian Zheng 提交于
* Add fused_scale_bias_relu_conv_bnstats op * Review changes * Fix no CUDNN Frontend build * Fix PADDLE_ENFORCE format * Fix PADDLE_ENFORCE CI error * Rename kernel filename * Refactor unittest to use paddle eager_op_test * Fix padding bugs * Review changes * test=cuda117 * test=cuda117
-
由 Yuang Liu 提交于
-
- 30 8月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* 【complex op】No.6 add complex support for logical_and/or/xor/not * fix dtype check * modify the docs * add special condition for not raise when x.dtype is complex * add random generate for complex dtype * fix generate for complex * fix * fix * add corner case for complex type * fix ut * fix ut
-
- 29 8月, 2023 1 次提交
-
-
由 gouzil 提交于
* [Fluid] move lars_momentum to phi * add sig * fix optional Output * off check_dygraph * fix input * fix operator[] * fix * try fix AllocateTmpTensor * fix * fix type * Update paddle/phi/kernels/gpu/lars_momentum_kernel.cu * fix type * rollback * Add Registration * try fix win * try fix win * try use double * try use operator *(float,const Derived &) * try auto * fix * fix * fix * fix dtype * fix type * fix index
-
- 28 8月, 2023 4 次提交
-
-
由 张春乔 提交于
[API Enhancement] No.6 support single `int` input in UpsamplingNearest2D and UpsamplingBilinear2D (#56470) * enhance single int input in UpsamplingNearest2D and UpsamplingBilinear2D * add unittest * add unittest
-
由 GGBond8488 提交于
【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509) * tmp commit * add atan2 * add inplace api * fix error * add inpalce divide * add inplace api * add more inplace * add more inpalce * fix logical_not error * support sinh and cosh in cpu * support asin, acos, atan, asinh, acosh, atanh in cpu * fix typro * fix typro * mv out atan2 ldexp * mv out atan2 ldexp * support sinh and cosh in gpu * support asin, acos, atan, asinh, acosh, atanh in gpu * fix ge error * fix dygraph commpare error * fix dygraph commpare error * check complex in python * fix cast inpalce error * open inplace test * fix ops.yaml error * mv cast inpalce to python * fix coverage ci * add last inplace * fix inplace error * fix cast error * fix error * add nan_to_num_ * fix typro * fix sparse cast error * remove gpu 4 * fix static cast error * tmp commit * add atan2 * add inplace api * fix error * add inpalce divide * add inplace api * add more inplace * add more inpalce * fix logical_not error * fix typro * fix typro * mv out atan2 ldexp * mv out atan2 ldexp * fix ge error * fix dygraph commpare error * fix dygraph commpare error * fix cast inpalce error * open inplace test * fix ops.yaml error * mv cast inpalce to python * fix coverage ci * add last inplace * fix inplace error * fix cast error * fix error * add nan_to_num_ * fix typro * fix sparse cast error * remove gpu 4 * fix static cast error * fix cast error * fix * Revert "check complex in python" This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05. * add renorm , fix error * add coverage * fix cumsum inpalce version error * add cast inpalce impl * rm test.log * fix multiply_dyfunction and add multiply_backward test * add and use is_same_tensor * fix typro * fix sone error * fix typro --------- Co-authored-by: NScotty <jmhgchn@gmail.com> Co-authored-by: NScotty <527407973@qq.com>
-
由 Xavier ZXY 提交于
* Fix: The number of times to recursively compute the difference supports any number * Fix: use pre-commit to format code
-
由 Sonder 提交于
* move shuffle_batch to phi * remove useless codes * add test_shuffle_batch_op to STATIC_BUILD_TESTS * move shuffle_batch_kernel.cc to cpu folder * move shuffle_batch_grad to phi * rm shuffle_batch_op.h * change year at file head
-
- 25 8月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* add_bias_and_simplify_mmha
-
- 22 8月, 2023 1 次提交
-
-
由 FormlessUnit 提交于
* Modify kernels to support quantized_matmul --------- Co-authored-by: Nsuperxf <1208713646@qq.com>
-
- 21 8月, 2023 3 次提交
-
-
由 feifei-111 提交于
-
由 Lu Qi 提交于
-
由 Ryan 提交于
* add complex numel * change test && add doc
-
- 18 8月, 2023 1 次提交
-
-
由 lzy 提交于
[Inference] Make share_external_data supports bf16 and bool; fix while_op cache_inference_while_scope when using fleet_executor. (#56055) * 1. make share_external_data supports bf16 and bool; 2. don't drop_kids when cache_inference_while_scope * fix FLAGS_cache_inference_while_scope * add unitest * add unitest * skip unitest when cudnn_version < 8100 * skip test share_external_data_bf16 when CUDA_ARCH < 80
-
- 17 8月, 2023 3 次提交
-
-
由 huangjiyi 提交于
* update * fix codestyle * update * update
-
由 Sonder 提交于
* remove execution_config.used_for_control_flow_op * update * update * open static build flag * close static build flag * open static build flag * add searchsorted to analyze dtype list * recover and add test_searchsorted_op to static build list * Update CMakeLists.txt * Update CMakeLists.txt
-
由 zhiboniu 提交于
* add lu_unpack data check * add error input api test * add error type info
-