- 09 1月, 2023 5 次提交
-
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
由 wangzhen38 提交于
-
由 zhaoyingli 提交于
* [AutoParalle] balancing the calculation of global_norm in data parallel * fix unittest * update cond pure_data_parallel
-
由 xiongkun 提交于
* [Dy2static] Add ParameterRecorder * filter by shape(tensor)==0 * fix code by review * fix random failed in CI. (especially coverage) * fix bugs * remove API changes to avoid static CI approval
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 7 次提交
-
-
由 JYChen 提交于
* add 0-d support for paddle.kthvalue * add 0-d support for paddle.mode * fix coverage test for device * fix check-bug in windows * change axis check from LT to LE * add shape & value check for grad when input is 0d tensor
-
由 seemingwang 提交于
* zero-tensor * remove unused * zero_dim_xpu * relocate * add value test * fix syntax
-
由 Aurelius84 提交于
* [D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars * remove print
-
由 jiangcheng 提交于
* flatten op support 0D-tensor * add test in zero dim py * fix shape should be list * short code for ci-coverage * add backward test * simple code for ci coverage * add axis check * add 0D-tensor test in test_flatten_contiguous_range_op.py * add axis error test for Coverage CI * add more test for CI-Coverage * add more test for CI-Coverage
-
由 Nyakku Shigure 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 9 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 Siming Dai 提交于
* support 0D for paddle.sort/argsort * support 0D tensor for paddle.sort/argsort in xpu * fix bug * fix grad and add value assertion
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 Jianghai 提交于
* add pool flops * add annotations and tests
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 Wilber 提交于
-
由 Huihuang Zheng 提交于
Add 0d Tensor Test Cases for cond, case, switch_case. Since the 3 APIs are control flow APIs, their support for 0d tensor relies on the underneath APIs. This PR just added test cases to prove that the 3 APIs have already handled 0d tensor well.
-
由 Yuang Liu 提交于
-
由 zhangkaihuo 提交于
-
- 04 1月, 2023 5 次提交
-
-
由 houj04 提交于
-
由 Wilber 提交于
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 张春乔 提交于
* add for-else * add * for unpacking
-
- 03 1月, 2023 10 次提交
-
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 Charles-hit 提交于
* move fc from fluid to paddle2.0 * fix unit test * fix some examples * fix some examples
-
由 GGBond8488 提交于
* remove fluid.save_dygraph and fluid.load_dygraph use paddle.save and paddle.load instead * move Sequential to paddle.nn * modify convert_call_func.py Sequential reference * remove related unitests * remove fluid.dynamic.Sequntial * test remove conver_call_func * fix conflicts * fix typro * fix unitests * fix sample_code * fix unitest * fix __init__
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 骑马小猫 提交于
-
由 Sanbu 提交于
-
由 Jianghai 提交于
* relu flops all * add annotations and tests * revision for codestyle
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 31 12月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 30 12月, 2022 1 次提交
-
-
由 xiongkun 提交于
* bugfix: fix bugs in Indexable and support LayerDict * fix bugs.
-