- 06 1月, 2023 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 14 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 xiaoxiaohehe001 提交于
-
由 HappyHeavyRain 提交于
* generate the static graph code of ops * modify the isclose comment * modify the clip comment in nn.py * reset nn.py
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 jiangcheng 提交于
* CINN add fetch op for skip gc vars * perfect test annotation * break if not is_only_used_internal * move skip_gc_var_names get out of for loop
-
由 Ruibiao Chen 提交于
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 zyfncg 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Wen Sun 提交于
* refactor: use base class * fix: incorrect deps * fix: add missing header * refactor: update class structures * fix: bkcl typo * fix: remove redundant def
-
由 TeFeng Chen 提交于
* delivery skip_gc_vars from the main graph to each subgraph compiled by CINN * rearrange format and annotation * fix lacking namespace * fix segmentation fault cinn subgraph doesn't own kSkipGcVarNames * deliver all skip_gc_vars of main graph * add vlog for skip_gc_vars
-
由 Yuang Liu 提交于
-
由 GaoYuYang 提交于
-
- 04 1月, 2023 7 次提交
-
-
由 Aurelius84 提交于
-
由 Yuanle Liu 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
由 lzy 提交于
-
- 03 1月, 2023 10 次提交
-
-
由 wangzhen38 提交于
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 kangguangli 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 zhoutianzi666 提交于
-
由 Sanbu 提交于
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 01 1月, 2023 1 次提交
-
-
由 gem5 提交于
-
- 30 12月, 2022 6 次提交
-
-
由 zyfncg 提交于
* fix test_conv_bn_fuse_pass_cc * remove comment
-
由 Zhang Jun 提交于
* update conv to convNd * trigger ci
-
由 zyfncg 提交于
* support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * add extra output in op_compat * remove debug log
-
由 HongyuJia 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-