- 06 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 9 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 Siming Dai 提交于
* support 0D for paddle.sort/argsort * support 0D tensor for paddle.sort/argsort in xpu * fix bug * fix grad and add value assertion
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 Jianghai 提交于
* add pool flops * add annotations and tests
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 Wilber 提交于
-
由 Huihuang Zheng 提交于
Add 0d Tensor Test Cases for cond, case, switch_case. Since the 3 APIs are control flow APIs, their support for 0d tensor relies on the underneath APIs. This PR just added test cases to prove that the 3 APIs have already handled 0d tensor well.
-
由 Yuang Liu 提交于
-
由 zhangkaihuo 提交于
-
- 04 1月, 2023 5 次提交
-
-
由 houj04 提交于
-
由 Wilber 提交于
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 张春乔 提交于
* add for-else * add * for unpacking
-
- 03 1月, 2023 10 次提交
-
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 Charles-hit 提交于
* move fc from fluid to paddle2.0 * fix unit test * fix some examples * fix some examples
-
由 GGBond8488 提交于
* remove fluid.save_dygraph and fluid.load_dygraph use paddle.save and paddle.load instead * move Sequential to paddle.nn * modify convert_call_func.py Sequential reference * remove related unitests * remove fluid.dynamic.Sequntial * test remove conver_call_func * fix conflicts * fix typro * fix unitests * fix sample_code * fix unitest * fix __init__
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 骑马小猫 提交于
-
由 Sanbu 提交于
-
由 Jianghai 提交于
* relu flops all * add annotations and tests * revision for codestyle
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 31 12月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 30 12月, 2022 9 次提交
-
-
由 xiongkun 提交于
* bugfix: fix bugs in Indexable and support LayerDict * fix bugs.
-
由 wangxinxin08 提交于
* check weight shape of conv1d_transpose * add unittest case
-
由 HongyuJia 提交于
* add custom_cpu testcase * update test_custom_device_setup * update path to custom_runtime * fix cmd wait * test Linux only * setup once * integrate to one run_cmd * add pip install * change timeout * add debug string * add debug string * add debug string * use os.system and change module name * add runtime * add more debug message * continue debug * timestamp * fix testcase import bug * remove error message * set TIMEOUT property
-
由 Roc 提交于
-
由 HongyuJia 提交于
* clean custom_xpu testcase test_static_pe * use assert_allclose to solve precision error * adjust precision * flatten tensor * fix flatten
-
由 zhouzj 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-
由 姜永久 提交于
* rm legacy * clear in_legacy * fix tracer
-
- 29 12月, 2022 4 次提交
-
-
由 Lin Manhui 提交于
-
由 xu98bin 提交于
* auto parallel bf16
-
由 姜永久 提交于
* rm legacy dygraph part7 * rm non_static_mode * modify * modify * add static test * set static for lstm_cudnn test * reset tracer * reset varbase * fix
-
由 Wang Bojun 提交于
* fusedAttenGrad_noGrad * code style fix * add ut * remove unnecessary log
-