- 22 3月, 2023 9 次提交
-
-
由 Sławomir Siwek 提交于
* extract common methods to reuse * add header for transpose ops * fused_transpose * Split big function * transpose2 tests * fused_transpose * Apply extra attributes * add pbtxt file * update pbtxt * Merge develop * add more strict op compats * code style * remove mkldnn_data_type * unify SetOutMemDescWithReshape2FuseSupport * adjust quantize-dequantize for transpose * remove appendact * transpose2 quantization * fix int8 tests * adjust transpose_op to current develop * delete fusion code from transpose_kernel * add fused transpose to NHWC unittest * change order
-
由 PuQing 提交于
* add nms3 register output defs * remove nms from set * remove nms from set
-
由 Bo Zhang 提交于
* test_logit_op * add cudaKernel to replace eigen impl * bf16 unit test CI
-
由 niuliling123 提交于
-
由 Zhang Zheng 提交于
This reverts commit 3b2cd23a.
-
由 sneaxiy 提交于
* add fused_linear_param_grad_add_kernel * fix compile error * remove flag * fix ci compile error * fix ci compile error * revert pylayer revision * fix ci ut * improve performance
-
由 Bo Zhang 提交于
* test_accuracy_op * add create_test_fp/bf16_class * cast after calculation * change convert_uint16_to_float_ifneed * delete TestAccuracyOpFp32 according to PR comment * fix the rtol setting rules in bfloat16 forward
-
由 Difer 提交于
-
由 Yuang Liu 提交于
-
- 21 3月, 2023 9 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
由 ShenLiang 提交于
* fix flash_attention * Update mp_layers.py
-
由 Zhang Zheng 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] Support output 0D for argmin/argmax/median/kthvalue/mode/equal_all/allclose * fix CI
-
由 Siming Dai 提交于
* add fp16 unittest * support bf16 and add unittest * fix according to review
-
由 houj04 提交于
* [XPU] add fp16 support for compare ops. * fix ci.
-
由 zhouweiwei2014 提交于
-
由 Bo Zhang 提交于
* with printf * add DropOutNdForwardKernel * PR comment
-
由 zhouweiwei2014 提交于
[Zero-Dim] Support 0D for numel/rank/size/optimizer/create_parameter/create_global_var, fix some usage to adapt 0D (#51566)
-
- 20 3月, 2023 9 次提交
-
-
由 201716010711 提交于
-
由 YuanRisheng 提交于
* remove init * delete fluid in context pool * fix custom op bugs * fix profiler bugs * fix ci bugs * fix window compile bugs * fix windows bugs * fix window bugs
-
由 limingshu 提交于
* optimization for fused linear op * fix code format * optimization for linear fused forward * merge with develop * fix bugs for gemm_ephilog * package of cublaslt ephilogue type with enmu * final fix before code reviewing * fix missed fusedType typo * fix code according to review suggestions * fix windows ci error * change location of MatmulPlanner * add some changes for compiler error fix ---------
-
由 iSerendipity 提交于
* fix Werror in roi_align_grad_kernel * adopt a better way
-
由 zyfncg 提交于
* register some custom kernel * fix bug
-
由 mayang002 提交于
-
由 FormlessUnit 提交于
shape support bf16
-
由 zhouweiwei2014 提交于
-
由 ykkk2333 提交于
* add xpu tile and concat kernel int64, test=kunlun * fix previous xpu dataoader bug, and add maxpool3dgrad special dim support, test=kunlun
-
- 19 3月, 2023 2 次提交
- 17 3月, 2023 6 次提交
-
-
由 denglianbin 提交于
* finish task * fix some question. * fix error * change unittest:zeroDim.
-
由 Infinity_lee 提交于
-
由 PuQing 提交于
* add multinomial output defs * fix register on gpu
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float & bfloat16 when using cub * fix compile error * fix * fix rocm compile error
-
由 chenxujun 提交于
-
由 gouzil 提交于
* [phi][jit] rm Softmax StrideScal * [phi][jit] rm kStrideScal * [phi][jit] fix Softmax clean omission * [phi][jit] fix Softmax clean omission * [phi][jit] fix StrideScal clean omission * [phi][jit] fix mkl SoftmaxKernel clean omission * [phi][jit] fix test error * [phi][jit] fix test error * [phi][jit] rm NCHW16CMulNC * [phi][jit] fix test error * [phi][jit] rm HSum HMax * [phi][jit] fix test error * [phi][jit] rm StrideASum * add AUTHORS.md * [phi][jit] fix test error
-
- 16 3月, 2023 5 次提交
-
-
由 Chitsing KUI 提交于
* rename flash_attn_raw to flash_attn_unpadded * fix static api * fix static return
-
由 shaojie_wang 提交于
* add fp32 grad plus fp16 param in adamw * add python UT * fix test case * in test_adamw_op py file, force the moment2 value LE 0 * add a compare option * remove bf16 fused adam kernel case
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
由 PuQing 提交于
* add rnn and searchsorted output defs * add gpu kernel
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-