- 19 4月, 2023 7 次提交
-
-
由 limingshu 提交于
* first commit * restruct c++ interface to divide linear from matmulwithcublaslt * finish building in cublaslt impl * fix code bugs * fix host cost * add some changes
-
由 huangjiyi 提交于
-
由 Wang Xin 提交于
* add autogen code support for mean_all op * bug fixed * bug fixed * bug fixed
-
由 zhangyuqin1998 提交于
* fix graph_reindex * fix * Update op_compat.yaml
-
由 huangjiyi 提交于
* update * fix bug * update * fix bug
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
* update * fix bug
-
- 18 4月, 2023 9 次提交
-
-
由 chenxujun 提交于
* Add prelu, clip_by_norm, multi_dot tests * Fix code * Fix code
-
由 cyber-pioneer 提交于
* add gn vjp * fix 0 * fix args num * fix type * debug2 * remove unused expand * support fp16 * fix typo * fix reshape bug * test3 * test4 * fix bug3 * add comment
-
由 huangjiyi 提交于
* update * fix bug * update * fix bug
-
由 huangjiyi 提交于
* update * revert lookup_table_op
-
由 张春乔 提交于
-
由 Zhenghai Zhang 提交于
* add autogen code support for rnn op * fix bug * fix bug
-
由 LoneRanger 提交于
* add autogen code support for lu * fix bug * fix bug * fix bug * fix bug
-
由 ronnywang 提交于
* [CustomDevice] add c_identity op * fix use calc stream
-
由 Xiaoxu Chen 提交于
-
- 17 4月, 2023 8 次提交
-
-
由 Sanbu 提交于
-
由 YuanRisheng 提交于
* unify kernel * fix ci bugs * fix py3 bugs * fix py3 bugs * perfect code
-
由 lzydev 提交于
* fix bug in parse args * fix bug * recover legacy_*.yaml * change 'Out' to Output
-
由 LoneRanger 提交于
-
由 wangzhen38 提交于
* [CINN] fix concat&pow * update concat * composite_backward_api * for ci * for ci * update test & fix opmaker
-
由 JingZhuangzhuang 提交于
-
由 Sonder 提交于
* add register info for eigh and eig_gard * add sync_batch_norm_op.cu register info * add lamb output register info * add unique register info * change type name * change type name * add output register info for check_finite_and_unscale * update cmake and config file * add register info for adagrad * fix build error * add sync to run_unittests.sh * add register info for unique_consecutive * fix build error * add eigh to STATIC_BUILD_TESTS * update eig_kernel.cc * update eig_kernel.cc * fix infer mate error * fix unique register error * fix lamb register info error * fix lamb register info * update lamb register info * fix lamb * remove one Output Register * update static build file * add eigh op to disable_wingpu_test * update run_unittests
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Sync_batch_norm support bfloat16 * fix * fix
-
- 14 4月, 2023 10 次提交
-
-
由 YangQun 提交于
[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185) * support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops * fix gaussian random mkldnn op ut
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * [phi] mv sequence_pooling to phi funcs * [phi] mv sequence_pooling_test * [phi] RollBACK `paddle/fluid/operators/sequence_ops/sequence_pool_op.cc` * [phi][funcs] fix mutable_data * [phi][funcs] fix mutable_data
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * 恢复正向代码 * update CMkaeList * trans Compute function to phi [build success] * add register code and fix include error [build success] * fix parameter sequence * add include file * update #if before include * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * fix namespace and remove fluid include * recover random seed * remove fluid quant_helper * fix include error * include utils in funcs * change include file * move grad codes back to fluid floder * move grad codes back to fluid floder * fix sig file error * update include * recover codes to develop * update register codes * fix build error * recover fluid include * remove some fluid include * remove some fluid include * Update fused_attention_op.cu * remove fluid include * add some fluid include * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * remote useless include
-
由 Jiabin Yang 提交于
* add more infer var type * fix split error * fix ut * fix top_k infer vartype * fix top_k infer vartype
-
由 Kim Yann 提交于
-
由 ronnywang 提交于
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
-
- 13 4月, 2023 6 次提交
-
-
由 Wangzheee 提交于
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
-
由 jjyaoao 提交于
* delete WITH_ASCEND_CL * delete NPU/ and WITH_MLU
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of c_softmax_with_cross_entropy_op. Support ignore_index is negative number.
-
由 RedContritio 提交于
-
由 Sanbu 提交于
-
由 zhangyuqin1998 提交于
* rename PD_REGISTER_GENERAL_KERNEL * Update feed_op.cc * fix * Update strings_empty_kernel.cc
-