- 31 3月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 2 次提交
-
-
由 ykkk2333 提交于
-
由 huangjiyi 提交于
* update assign_pos * update attention_lstm * update barrier * update batch_fc * update beam_search * update beam_search_decode * update bilateral_slice * fix bug * Handle Structure kernel for InterpreterCore::RunOperator * fix bug * fix rocm compile * fix rocm compile * Revert "fix rocm compile" * test * revert test and update cmake --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 29 3月, 2023 5 次提交
-
-
由 chalsliu 提交于
* Fix flashattn build error on jetson * Fix nvcc not found on jetson
-
由 jameszhang 提交于
* [kunlun] support min/max in dygraph mode * update xccl to 1.0.13
-
由 huangjiyi 提交于
* fix kp compile * test * Revert "test" This reverts commit 3a1cbfaa0f23e6e06d3dcd8d0b0c28aa63a98e70. * update copyright * update cmake * update cmake * update cmake * update cmake
-
由 chalsliu 提交于
* Fix jetson conv2d_fusion not found error * Fix jetson conv2d_fusion not found error * Add comment test=document_fix
-
由 sneaxiy 提交于
* fix generate_kernels.py in CUDA 12.0 * fix attrs bug
-
- 28 3月, 2023 2 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
由 HongyuJia 提交于
-
- 27 3月, 2023 2 次提交
- 24 3月, 2023 3 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 risemeup1 提交于
* fix ninja error * fix_lite_ninja_error
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
- 23 3月, 2023 2 次提交
-
-
由 Huang Jiyi 提交于
* update * update * update * update * update * fix test
-
由 zqw_1997 提交于
* to support cuda12, pybind need to upgrade to v2.10.0 * add DEPS of pybind in test_custom_plugin_creater.cc * only change the tag * please let CI pass * try pybind v2.10/3 * modify the include header in test * code check
-
- 22 3月, 2023 1 次提交
-
-
由 risemeup1 提交于
* fix ninja error * fix_ninja_error * fix ninja error * fix r-200 ci ninja error
-
- 21 3月, 2023 2 次提交
- 20 3月, 2023 2 次提交
- 15 3月, 2023 2 次提交
- 13 3月, 2023 2 次提交
-
-
由 shentanyue 提交于
[Lite] Change the source code integration of Paddle Lite to the compilation library integration (#51405)
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
- 10 3月, 2023 1 次提交
-
-
由 zhangyikun02 提交于
-
- 09 3月, 2023 3 次提交
-
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict
-
由 risemeup1 提交于
-
由 Wang Xin 提交于
-
- 08 3月, 2023 2 次提交
-
-
由 pangyoki 提交于
-
由 Chitsing KUI 提交于
-
- 07 3月, 2023 2 次提交
-
-
由 risemeup1 提交于
-
由 Chen Weihang 提交于
-
- 06 3月, 2023 1 次提交
-
-
由 mayang002 提交于
-
- 03 3月, 2023 2 次提交
-
-
由 danleifeng 提交于
-
由 risemeup1 提交于
* patch on gloo/types.h * fix patch * change patch dir * add patch
-
- 02 3月, 2023 1 次提交
-
-
由 risemeup1 提交于
* fix gcc12 error * patch on device.cc * fix gcc error while compiling gloo
-
- 01 3月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* flash attn * seed * almost * softmax * fix workspace * add unitest; linux only * fix setup * fix datatype include * fix setup typo * fix def scope * new error api * use paddle fork * fix attr bug; complete ut * update flash hash * fix rng reset * fix offset * fix comments
-