- 13 4月, 2023 3 次提交
-
-
由 jjyaoao 提交于
* remove code with PADDLE_WITH_ASCEND * try pass codestyle
-
由 zhangyuqin1998 提交于
* rename PD_REGISTER_GENERAL_KERNEL * Update feed_op.cc * fix * Update strings_empty_kernel.cc
-
由 risemeup1 提交于
* fix ninja error * fix_ninja_error_qa
-
- 12 4月, 2023 1 次提交
-
-
由 zqw_1997 提交于
* slight modify * support cuda12+ arch, Hopper arch and discard 30 arch * add arch 90 for each paddle_known_gpu_archs12 * for comments
-
- 11 4月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 jjyaoao 提交于
* Delete the keyword WITH_ASCEND_INT64 in configure.cmake and CMakeList * try pass Static-Check
-
由 ykkk2333 提交于
-
- 10 4月, 2023 6 次提交
- 07 4月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 06 4月, 2023 3 次提交
-
-
由 RedContritio 提交于
-
由 Galaxy1458 提交于
* delete [-Wno-error=terminate], test=develop * remove GPUps[-Wterminate],test=develop * remove some -Wno-, test=develop
-
由 WJJ1995 提交于
* fixed glog * add * add bfloat16 test for logical op * rm useless code * add uint16 * deal with comments * fixed code style * fixed code style * fixed for ci * deal with comments * fixed for ci
-
- 04 4月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* delete [-Wno-error=terminate], test=develop * remove GPUps[-Wterminate],test=develop
-
- 03 4月, 2023 3 次提交
-
-
由 engineer1109 提交于
-
由 zhangyuqin1998 提交于
* add kernel register macro for all backend * fix msvc bug * fix --------- Co-authored-by: Nzhangyuqin1998 <2368719379@qq.com>
-
由 risemeup1 提交于
* fix error,test=document_fix * test * fix gcc12_error * fix gcc12_error * fix gcc12_error * fix_gcc12_py3_error * fix_range-loop-construct_error * fix_gcc12_error
-
- 01 4月, 2023 1 次提交
-
-
由 risemeup1 提交于
-
- 31 3月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 2 次提交
-
-
由 ykkk2333 提交于
-
由 huangjiyi 提交于
* update assign_pos * update attention_lstm * update barrier * update batch_fc * update beam_search * update beam_search_decode * update bilateral_slice * fix bug * Handle Structure kernel for InterpreterCore::RunOperator * fix bug * fix rocm compile * fix rocm compile * Revert "fix rocm compile" * test * revert test and update cmake --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
- 29 3月, 2023 5 次提交
-
-
由 chalsliu 提交于
* Fix flashattn build error on jetson * Fix nvcc not found on jetson
-
由 jameszhang 提交于
* [kunlun] support min/max in dygraph mode * update xccl to 1.0.13
-
由 huangjiyi 提交于
* fix kp compile * test * Revert "test" This reverts commit 3a1cbfaa0f23e6e06d3dcd8d0b0c28aa63a98e70. * update copyright * update cmake * update cmake * update cmake * update cmake
-
由 chalsliu 提交于
* Fix jetson conv2d_fusion not found error * Fix jetson conv2d_fusion not found error * Add comment test=document_fix
-
由 sneaxiy 提交于
* fix generate_kernels.py in CUDA 12.0 * fix attrs bug
-
- 28 3月, 2023 2 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
由 HongyuJia 提交于
-
- 27 3月, 2023 2 次提交
- 24 3月, 2023 3 次提交
-
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 risemeup1 提交于
* fix ninja error * fix_lite_ninja_error
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
- 23 3月, 2023 2 次提交
-
-
由 Huang Jiyi 提交于
* update * update * update * update * update * fix test
-
由 zqw_1997 提交于
* to support cuda12, pybind need to upgrade to v2.10.0 * add DEPS of pybind in test_custom_plugin_creater.cc * only change the tag * please let CI pass * try pybind v2.10/3 * modify the include header in test * code check
-