- 03 1月, 2023 8 次提交
-
-
由 Charles-hit 提交于
* move fc from fluid to paddle2.0 * fix unit test * fix some examples * fix some examples
-
由 GGBond8488 提交于
* remove fluid.save_dygraph and fluid.load_dygraph use paddle.save and paddle.load instead * move Sequential to paddle.nn * modify convert_call_func.py Sequential reference * remove related unitests * remove fluid.dynamic.Sequntial * test remove conver_call_func * fix conflicts * fix typro * fix unitests * fix sample_code * fix unitest * fix __init__
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 骑马小猫 提交于
-
由 Sanbu 提交于
-
由 Jianghai 提交于
* relu flops all * add annotations and tests * revision for codestyle
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 31 12月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 30 12月, 2022 10 次提交
-
-
由 xiongkun 提交于
* bugfix: fix bugs in Indexable and support LayerDict * fix bugs.
-
由 wangxinxin08 提交于
* check weight shape of conv1d_transpose * add unittest case
-
由 HongyuJia 提交于
* add custom_cpu testcase * update test_custom_device_setup * update path to custom_runtime * fix cmd wait * test Linux only * setup once * integrate to one run_cmd * add pip install * change timeout * add debug string * add debug string * add debug string * use os.system and change module name * add runtime * add more debug message * continue debug * timestamp * fix testcase import bug * remove error message * set TIMEOUT property
-
由 risemeup1 提交于
* delete batch_norm * test * test * test * test * test * recover cmake_gen * debug
-
由 Roc 提交于
-
由 HongyuJia 提交于
* clean custom_xpu testcase test_static_pe * use assert_allclose to solve precision error * adjust precision * flatten tensor * fix flatten
-
由 zhouzj 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-
由 姜永久 提交于
* rm legacy * clear in_legacy * fix tracer
-
- 29 12月, 2022 6 次提交
-
-
由 wangzhen38 提交于
* [fluid remove] rawconv
-
由 Lin Manhui 提交于
-
由 xu98bin 提交于
* auto parallel bf16
-
由 姜永久 提交于
* rm legacy dygraph part7 * rm non_static_mode * modify * modify * add static test * set static for lstm_cudnn test * reset tracer * reset varbase * fix
-
由 Wang Bojun 提交于
* fusedAttenGrad_noGrad * code style fix * add ut * remove unnecessary log
-
由 姜永久 提交于
* rm legacy layers part6 * rm non_static_mode * modify non_static * minor change * rm loss * rm in_legacy part8 * minor change
-
- 28 12月, 2022 9 次提交
-
-
由 RichardWooSJTU 提交于
-
由 姜永久 提交于
* rm legacy nn part2 * rm _non_static_mode * modify * modify unpool test * modify unpool test * modify loss * keep legacy for layer_norm
-
由 zqw_1997 提交于
remove fluid.contrib.fused_elemwise_activation, sequence_topk_avg_pooling, var_conv_2d, match_matrix_tensor and tree_conv (#49331)
-
由 Leo Chen 提交于
* add skip run * alloc minimum memory * skip check_size in Alloc * skip check_size in Alloc * skip check_size in Alloc * fix cases when tensor is initialized or empty * alloc empty output for place info * add test * increase timeout * format code * skip cpu * add cudnn_deterministic * fit for hostAlloc * follow comments * change check_size to fake_alloc
-
由 xiongkun 提交于
* einsum support 0d tensor. 1. support 0d tensor in multi-operands. 2. add 9 unittests for einsum 0d tensor. * override NVIDIA_TF32_OVERRIDE to avoid accuracy problem in 11.2 and 11.8
-
由 zhaoyingli 提交于
* [AutoParallel] adapt for clip * fix unittest * enable_static * fix dist_fill_constant_batch_size_like * fix process_mesh.shape * update cond of modifying shape_list
-
由 姜永久 提交于
* rm legacy fluid part4 * rm non_static_mode * minor change * modify initializer * rm legacy for initializer * fix dataloader test
-
由 Huihuang Zheng 提交于
This PR increased the delta in unit test for CUDA 11.8. The reason of this fix: (1) It seems CUDA 11.8 has higher delta in accuracy result. Our other targets for seresnext under parallel executor have already added delta such as CPU, all reduce test cases, so we did same for GPU base case with CUDA 11.8 (2) A new executor is under developing in PaddlePaddle team, so the unit test for old executor can be relaxed.
-
由 wanghuancoder 提交于
* delete old dygraph pylayer
-
- 27 12月, 2022 5 次提交
-
-
由 zhouweiwei2014 提交于
-
由 zhangyikun02 提交于
-
由 xiaoting 提交于
* fix fold for large bs * fix fold for large bs
-
由 zhaoyingli 提交于
* fix input order * add unittest * update cmakelist
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-