- 09 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 11 次提交
-
-
由 Yuanle Liu 提交于
-
由 Guanghua Yu 提交于
-
由 wangzhen38 提交于
-
由 Yuanle Liu 提交于
-
由 Aurelius84 提交于
* [D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars * remove print
-
由 jiangcheng 提交于
* flatten op support 0D-tensor * add test in zero dim py * fix shape should be list * short code for ci-coverage * add backward test * simple code for ci coverage * add axis check * add 0D-tensor test in test_flatten_contiguous_range_op.py * add axis error test for Coverage CI * add more test for CI-Coverage * add more test for CI-Coverage
-
由 Wen Sun 提交于
* fix: fix hidden virtual funcs * fix: add default impl
-
由 MarDino 提交于
-
由 张春乔 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 14 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 xiaoxiaohehe001 提交于
-
由 HappyHeavyRain 提交于
* generate the static graph code of ops * modify the isclose comment * modify the clip comment in nn.py * reset nn.py
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 jiangcheng 提交于
* CINN add fetch op for skip gc vars * perfect test annotation * break if not is_only_used_internal * move skip_gc_var_names get out of for loop
-
由 Ruibiao Chen 提交于
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 zyfncg 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Wen Sun 提交于
* refactor: use base class * fix: incorrect deps * fix: add missing header * refactor: update class structures * fix: bkcl typo * fix: remove redundant def
-
由 TeFeng Chen 提交于
* delivery skip_gc_vars from the main graph to each subgraph compiled by CINN * rearrange format and annotation * fix lacking namespace * fix segmentation fault cinn subgraph doesn't own kSkipGcVarNames * deliver all skip_gc_vars of main graph * add vlog for skip_gc_vars
-
由 Yuang Liu 提交于
-
由 GaoYuYang 提交于
-
- 04 1月, 2023 7 次提交
-
-
由 Aurelius84 提交于
-
由 Yuanle Liu 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
由 lzy 提交于
-
- 03 1月, 2023 6 次提交
-
-
由 wangzhen38 提交于
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 kangguangli 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-