- 28 4月, 2023 4 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
由 iSerendipity 提交于
-
由 co63oc 提交于
-
由 superwinner1 提交于
* 'fmin' * 'fix' * 'fix'
-
- 27 4月, 2023 4 次提交
-
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * mv kernels impl * fix parameter error * clean include * fix compat filename * [phi] move fluid sequence_pool_grad to phi * [phi][compat] sig rm GradVarName * [phi] fix sequence_pool out type * [phi] rm impl, add const string * [phi] fix const str * fix sequence_pooling cmake * [phi] mv sequence_pooling_test * [phi] fix grad sig * [phi] fix sequence_pool is_test error * [phi] fix sequence_pooling gpu include * [phi] mv to impl * [phi] fix SequencePoolFunctor cu include * [phi] modify out max_index int32_t * [phi] add pooltype mapping determine * [phi] fix sequence_pool_sig * [phi] fix sequence_pool_sig sum * [phi] try ci * [phi] fix max_index optional
-
由 NetPunk 提交于
* support fp16 for maxout op * format code * change api * add test for static float16 * format code * formatting code * atol alignment * experiment—1 * experiment-2 * experiment-3 * format code
-
由 superwinner1 提交于
-
由 cyberslack_lee 提交于
-
- 26 4月, 2023 3 次提交
-
-
由 sneaxiy 提交于
* optimize embedding deterministic mode * fix compile error * change FLAGS_cudnn_deterministic to int64 * fix 700 error * add ut * fix ut * fix ut * fix win32 ci * fix flags with PHI_DEFINE_EXPORTED_int64
-
由 denglianbin 提交于
-
由 denglianbin 提交于
-
- 25 4月, 2023 4 次提交
-
-
由 Zero Rains 提交于
* create KernelMinMax to optimize the performance of histogram op in GPU * change to block and warp wise operation * remove the time in DtoH * fix a bug
-
由 cyberslack_lee 提交于
-
由 shaojie_wang 提交于
* fix shared memory over usage in embedding grad kernel on determistic mode * use IdT as interger dtype
-
由 Difer 提交于
* add fp_bf for pool_max_withidx * fix some error * fix error * codestyle error * fix masktype * fix input bf type * input bf dtype convert error * back to convert input to bf16 first * fix convert error * fix bf16 grad check
-
- 24 4月, 2023 2 次提交
-
-
由 GGBond8488 提交于
* add 0D support for trace, test=allcase * fix trace gpu kernel 0d error, test=allcase * fix windows error, test=allcase
-
由 Siming Dai 提交于
Add paddle.geometric.weighted_sample_neighbors API
-
- 23 4月, 2023 2 次提交
-
-
由 zhangyuqin1998 提交于
* delete overwrite from gather_grad * fix * Update gather_grad_kernel.cc
-
由 zhangyuqin1998 提交于
* remove axis from elementwise_grad * Update elementwise_sig.cc
-
- 22 4月, 2023 1 次提交
-
-
由 wangfengsheng1999 提交于
[Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850) * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * add test case * modify dot/metric.accuracy/static.accuracy/static.auc * modfiy inner/tensordot bug * test 9 api * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase * fix bug * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy * code style * fix bug * fix test_dot_op bug * fix accuracy bug * fix bug * fix bug * fix bug * fix bug * codestyle * fix dot bug * fix dot bug * fix dot bug * code style * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * modify code
-
- 21 4月, 2023 3 次提交
- 20 4月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* add flash randomness control * fix VLOG undefied
-
- 19 4月, 2023 2 次提交
-
-
由 limingshu 提交于
* first commit * restruct c++ interface to divide linear from matmulwithcublaslt * finish building in cublaslt impl * fix code bugs * fix host cost * add some changes
-
由 zhangyuqin1998 提交于
* fix graph_reindex * fix * Update op_compat.yaml
-
- 18 4月, 2023 7 次提交
-
-
由 chenxujun 提交于
* Add prelu, clip_by_norm, multi_dot tests * Fix code * Fix code
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Unique support float16&bfloat16 * add test
-
由 zhangyuqin1998 提交于
* reorder MatrixRank * fix * fix * fix * fix * fix
-
由 chenxujun 提交于
-
由 chenxujun 提交于
* Add split, split_with_num tests * Add randperm tests * Fix code
-
由 chenxujun 提交于
-
由 zhangyuqin1998 提交于
* reorder_prior_box * fix
-
- 17 4月, 2023 5 次提交
-
-
由 Vvsmile 提交于
* fix multinomial * fix test_elementwise * fix convert_float_to_uint16 * aadd test_multimial_op * fix code style
-
由 thunder95 提交于
* untracked files * bce_loss_fp16 * remove unused files * back max_rel_erro still big * simplify code * upd * fix max_relative_error * restart ci * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * try to pass test * restore file * remove error value * fix bug --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 Hanchiao 提交于
* Implement optimized kernel for OP-expand_as. * Support fp16. Co-authored-by: Timber-Ye <ye_hanqiao@163.com> Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com> * remove fp16 support * remove MAX_RANK_SUPPORTED --------- Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com>
-
由 zhangyuqin1998 提交于
-
由 Sonder 提交于
* add register info for eigh and eig_gard * add sync_batch_norm_op.cu register info * add lamb output register info * add unique register info * change type name * change type name * add output register info for check_finite_and_unscale * update cmake and config file * add register info for adagrad * fix build error * add sync to run_unittests.sh * add register info for unique_consecutive * fix build error * add eigh to STATIC_BUILD_TESTS * update eig_kernel.cc * update eig_kernel.cc * fix infer mate error * fix unique register error * fix lamb register info error * fix lamb register info * update lamb register info * fix lamb * remove one Output Register * update static build file * add eigh op to disable_wingpu_test * update run_unittests
-
- 14 4月, 2023 2 次提交
-
-
由 Zhang Zheng 提交于
-
由 cyberslack_lee 提交于
-