- 12 4月, 2023 4 次提交
-
-
由 Yuanle Liu 提交于
* update save model * update
-
由 RedContritio 提交于
* move python/paddle/fluid/tests/unittests/xpu to test/xpu * update CMakeLists.txt * remove xpu in fluid/tests/unittests/ * add path to op_test_xpu * fix incorrect path * update test script * fix test_adadelta_op_xpu error
-
由 Guoxia Wang 提交于
* [AMP OP&Test] support bf16 for batchnorm * codestyle * Update batch_norm_grad_kernel.cu * Update batch_norm_kernel.cu * fix codestyle * fix * fix * fix * fix * fix * Update batch_norm_kernel.cc
-
由 wanghuancoder 提交于
-
- 11 4月, 2023 22 次提交
-
-
由 lzydev 提交于
-
由 Galaxy1458 提交于
* delete [-Wno-error=terminate], test=develop * remove GPUps[-Wterminate],test=develop * remove some -Wno-, test=develop * modify ~MatmulDescriptor * mess * remove -Wimplicit-fallthrough, test=develop * remove -Wimplicit-fallthrough, test=develop * remove -Wimplicit-fallthrough, test=develop * remove -Wimplicit-fallthrough, test=develop * remove , test=develop
-
由 Yuanle Liu 提交于
-
由 wz1qqx 提交于
-
由 Xiaoxu Chen 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
* support auto generate static for randperm * remove enforce in randperm infermeta
-
由 zhangyuqin1998 提交于
-
由 wanghuancoder 提交于
-
由 WJJ1995 提交于
* add bfp16 test for isfinite * fixed for ci * deal with comments * fixed test * skip test in cpu * deal with comments * fixed for ci * fixed testcase * fixed for ci * fixed for testcase
-
由 wuhuachaocoding 提交于
-
由 wangzhen38 提交于
-
由 RedContritio 提交于
* fix error in generator/type_mapping.py * support auto generate for op merged_momentum optimizer
-
由 RedContritio 提交于
* support auto generate for flatten (flatten_contiguous_range) * add data_type for flatten_grad
-
由 LinearTemporalLogic 提交于
* Add output defs for eigh kernel * fix * update * update * fix * fix
-
由 Wang Xin 提交于
* add autogen code support for reverse op * bug fixed
-
由 RedContritio 提交于
-
由 Thomas Young 提交于
-
由 RedContritio 提交于
* support auto generate for op momentum optimizer * remove momentum_op.* and update signature * fix dgc momentum op maker error
-
由 ykkk2333 提交于
-
由 jjyaoao 提交于
* remove paddle/infrt/ * delete .lit_test_times.txt
-
由 Chitsing KUI 提交于
-
- 10 4月, 2023 14 次提交
-
-
由 lzydev 提交于
* autogen segment_pool * delete legacy_dygraph about segment_pool
-
由 jjyaoao 提交于
* delete paddle/fluid/operators/*_npu.* * try pass CI * try pass CI
-
由 Difer 提交于
* add_fp_bf_for_flip_gaussian_random * forget convert uint * fix some error * fix some error
-
由 jjyaoao 提交于
* delete paddle/fluid/operators/*_npu.* * try pass code-style
-
由 Zhang Ting 提交于
* support set master_grad * move register_hook to auto_cast * update unittest * fix fp16 test * update for review comments
-
由 xiaoxiaohehe001 提交于
* Support two inputs of multihead attention named qk_multihead
-
由 HongyuJia 提交于
* [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference * opt AutoGradMeta implementation * remove profiler codes * fix unit test * change year, 2021->2023 * fix int64_t parse bug
-
由 gouzil 提交于
* add autogen code bilinear_tensor_product * [phi] rm cc file
-
由 cyberslack_lee 提交于
-
由 lzydev 提交于
* autogen softmax_with_cross_entropy * fix error in softmax_with_cross_entropy version
-
由 kangguangli 提交于
* add strategy force_sequential_run * remove flag * fix * fix * fix * fix * fix * fix * fix * fix * fix
-
由 HongyuJia 提交于
* [enforce.h Decouple gflags.h] Move gflags.h from enforce.h to enforce.cc * Add gflags.h for other files * Add gflags.h for other files * Add gflags.h for blas_impl.hip.h * Add gflags.h for miopen_helper.h
-
由 Vvsmile 提交于
* adjust defalut tolerance of output and grad * fix a bug in the grad of OpTest * fix the type of setting defalut value in optest, both forward and backward * add defalut * fix test_sum_op * adjust tolerance * fix the tolerance of eager * add bf16 and fp16 to the activation tests * remove some fixs * fix activation * fix fp16 * fix gelu * fix the activation tests * add bfloat16 specialization to singrad and cosgrad * fix bugs * fix bugs * add unittest * add skip * add fp/bf to rrelu/rrelu_grad * git add rrelu * fix bugs
-
由 qizhaoaoe 提交于
* add fp16 and bf16 support for instance_norm * fix /= operator which not support bf16 * fix instance_norm_grad kernel and unittests. * fix fp32 unittests. * fix instance_norm_kernel and unittests. * fix instance_norm_grad_kernel and unittest threshold. * add fp16/bf16 for instance_norm_grad_grad op. * add bf16 dtype check. * fix conflicts. * fix cpu support for fp32 op and fix type in instance_norm_grad_kernel. * fix type in instance_norm_kernel. * fix bf16 outputs in unittests and refine codes. * fix dx computation. * delete unuseful params and head including. * add fp16/bf16 for static graph. * fix device condiction for instance_norm op. * fix instance_norm_grad_grad and bf16 op tests. * fix op_test to support grad of bf16 can be compared with fp32. * remove updates. * add self-defined grad.
-