- 06 1月, 2023 4 次提交
-
-
由 Thomas Young 提交于
-
由 张春乔 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 18 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 Siming Dai 提交于
* support 0D for paddle.sort/argsort * support 0D tensor for paddle.sort/argsort in xpu * fix bug * fix grad and add value assertion
-
由 xiaoxiaohehe001 提交于
-
由 HappyHeavyRain 提交于
* generate the static graph code of ops * modify the isclose comment * modify the clip comment in nn.py * reset nn.py
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 jiangcheng 提交于
* CINN add fetch op for skip gc vars * perfect test annotation * break if not is_only_used_internal * move skip_gc_var_names get out of for loop
-
由 Zhong Hui 提交于
-
由 Ruibiao Chen 提交于
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 zyfncg 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Wen Sun 提交于
* refactor: use base class * fix: incorrect deps * fix: add missing header * refactor: update class structures * fix: bkcl typo * fix: remove redundant def
-
由 TeFeng Chen 提交于
* delivery skip_gc_vars from the main graph to each subgraph compiled by CINN * rearrange format and annotation * fix lacking namespace * fix segmentation fault cinn subgraph doesn't own kSkipGcVarNames * deliver all skip_gc_vars of main graph * add vlog for skip_gc_vars
-
由 XiangGao 提交于
-
由 Yuang Liu 提交于
-
由 GaoYuYang 提交于
-
由 risemeup1 提交于
* modify setup.py and paddle_build.sh * modify setup.py and paddle_build.sh * modify setup.py and paddle_build.sh * modify setup.py * modify run_setup * modify setup.py * fix make_clean * modify setup.py * modify setup.py * delete setting python_libary * debug * debug * debug * debug
-
- 04 1月, 2023 9 次提交
-
-
由 Aurelius84 提交于
-
由 Yuanle Liu 提交于
-
由 Guanghua Yu 提交于
-
由 Wilber 提交于
-
由 zhangbo9674 提交于
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
由 lzy 提交于
-
- 03 1月, 2023 9 次提交
-
-
由 wangzhen38 提交于
-
由 limingshu 提交于
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 kangguangli 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 Yiqun Liu 提交于
* Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad. * Correct the axis when there is only 1 input in BroadcastKernel. * Add the calculate of output's shape.
-