- 09 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 12 次提交
-
-
由 Guanghua Yu 提交于
-
由 JYChen 提交于
* add 0-d support for paddle.kthvalue * add 0-d support for paddle.mode * fix coverage test for device * fix check-bug in windows * change axis check from LT to LE * add shape & value check for grad when input is 0d tensor
-
由 seemingwang 提交于
* zero-tensor * remove unused * zero_dim_xpu * relocate * add value test * fix syntax
-
由 Aurelius84 提交于
* [D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars * remove print
-
由 HongyuJia 提交于
* acquire custom_deivce stream * regulate file name and unittest
-
由 jiangcheng 提交于
* flatten op support 0D-tensor * add test in zero dim py * fix shape should be list * short code for ci-coverage * add backward test * simple code for ci coverage * add axis check * add 0D-tensor test in test_flatten_contiguous_range_op.py * add axis error test for Coverage CI * add more test for CI-Coverage * add more test for CI-Coverage
-
由 Weilong Wu 提交于
-
由 Weilong Wu 提交于
-
由 Nyakku Shigure 提交于
-
由 张春乔 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 17 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 Siming Dai 提交于
* support 0D for paddle.sort/argsort * support 0D tensor for paddle.sort/argsort in xpu * fix bug * fix grad and add value assertion
-
由 HappyHeavyRain 提交于
* generate the static graph code of ops * modify the isclose comment * modify the clip comment in nn.py * reset nn.py
-
由 sneaxiy 提交于
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 Jianghai 提交于
* add pool flops * add annotations and tests
-
由 Infinity_lee 提交于
-
由 zhouzj 提交于
-
由 ucsk 提交于
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 Wilber 提交于
-
由 Yuang Liu 提交于
-
由 Weilong Wu 提交于
* [Eager] optimize same python api logic * optimize full api * optimize logic * optimize logic
-
由 Huihuang Zheng 提交于
Add 0d Tensor Test Cases for cond, case, switch_case. Since the 3 APIs are control flow APIs, their support for 0d tensor relies on the underneath APIs. This PR just added test cases to prove that the 3 APIs have already handled 0d tensor well.
-
由 Yuang Liu 提交于
-
由 Weilong Wu 提交于
-
由 zhangkaihuo 提交于
-
- 04 1月, 2023 9 次提交
-
-
由 houj04 提交于
-
由 Guanghua Yu 提交于
-
由 Wilber 提交于
-
由 Roc 提交于
-
由 JZ-LIANG 提交于
* remove deps and prior comm * grad comm fuse * add deps for amp&global norm * stage2 broadcast prior deps * stage2 grad overlap * stream_analyzer bugfix * overlap enable * dep op namescope * depend support multiple inputs * check finite deps * stage2 param comm overlap * Set kD2HStream * grad comm hierarchical * grad comm hierarchical * new unitest Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 张春乔 提交于
-
由 Matsumoto Ruko 提交于
* update requirements.txt and setup.py.in * update requirements.txt setup.py.in setup.py
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 张春乔 提交于
* add for-else * add * for unpacking
-