- 29 3月, 2023 10 次提交
-
-
由 chenxujun 提交于
-
由 ShenLiang 提交于
* fix bg * add utest * add utest
-
由 xiaoguoguo626807 提交于
* tanh_double_grad_rules * delete log got api_base * modify composite yaml * optimize rules
-
由 zhupengyang 提交于
-
由 yuehuayingxueluo 提交于
* add fuse adamw pass * fix some bugs * fix CIbug * change chunk_size * fix CI bug * rm test_fused_adam_op.py * fix CI bugs * fix fuse_adamw_op_pass.cc * change code style * fix CI bug * fix ut bug and use_adamw_op_pass.cc * fix test_fuse_adamw_pass.py * fix CI bug * remove fluid * fix ci bug * fix CI bug
-
由 zhangyikun02 提交于
-
由 ronnywang 提交于
-
由 傅剑寒 提交于
-
由 YuhangLi 提交于
-
由 sneaxiy 提交于
* fix generate_kernels.py in CUDA 12.0 * fix attrs bug
-
- 28 3月, 2023 13 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
由 cyberslack_lee 提交于
-
由 张春乔 提交于
* mv cumprod * add attrs * Update backward.yaml * Update backward.yaml
-
由 Leo Chen 提交于
* add flag to set chunk size * use the flag * add vlog * add ut * rename ut
-
由 sneaxiy 提交于
* add overflow check in memory efficient attention * fix ci compile error * fix ci compile error
-
由 houj04 提交于
* fix int8 support for full kernel * fix ut.
-
由 cyberslack_lee 提交于
* fix huber_loss * fix * fix ops.yaml add intermediate * fix * fix test
-
由 RedContritio 提交于
* support auto generate static for one_hot_v2 * format
-
由 Wang Xin 提交于
-
由 RedContritio 提交于
* support auto generate for log_softmax * add data_type
-
由 Haohongxiang 提交于
-
由 wangxinxin08 提交于
* add unittest for conv2d/depthwise_conv2d/conv2d_transpose * add bf16 for DWConv and ConvTranspose * fix unitest of conv2d_transpose * modify DWConv2d op and unittest * fix unittest of conv2d_transpose_bf16 * modify unittest name according to review * modify atol of DWConv2D unittest
-
由 csy0225 提交于
-
- 27 3月, 2023 13 次提交
-
-
由 ZhangDY-6483 提交于
-
由 zhouweiwei2014 提交于
-
由 HappyHeavyRain 提交于
* add fused_yaml fused_backward * fix eager_funciton bug * add some comment of fused yaml file * add 'support_dygraph_mode' configuration in fused yaml * delete some 'fused_api.h' in include file * add fused flag in api_gen
-
由 Xinyu Chen 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output
-
由 HappyHeavyRain 提交于
* support assign op * support assign infer_var_type * change code according to review * change code according to review * only save 'get_infer_var_type_func' * rest file mode
-
由 Leo Chen 提交于
* unbind support bool dtype * replace np.array_equal
-
由 Leo Guo 提交于
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
-
由 risemeup1 提交于
* fix_gcc12_error * fix gcc12 error * fix gcc12 error
-
由 risemeup1 提交于
* fix_gcc12_error * patch on eigen3 for fixing gcc12 error * Update multiary.cc
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
由 houj04 提交于
-
由 sneaxiy 提交于
* fix mea compile error * support 2-D bias * add inline to avoid compile error * polish codes
-
- 25 3月, 2023 1 次提交
-
-
由 Ruibin Cheung 提交于
[Fix Bug] fix get_new_shape and get_new_data_from_tensor not support fallback to CPU on custom device (#52002)
-
- 24 3月, 2023 3 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
由 PuQing 提交于
* fix momentum dtype infer * fix momentum datatype * fix on cpu * add momentum
-