- 04 1月, 2023 6 次提交
-
-
由 Yuanle Liu 提交于
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
This reverts commit 2c444dfa.
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
由 lzy 提交于
-
- 03 1月, 2023 10 次提交
-
-
由 wangzhen38 提交于
-
由 WangZhen 提交于
* Fix param and out grad names in dy2st for high order grad
-
由 Yuanle Liu 提交于
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces
-
由 kangguangli 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 zhoutianzi666 提交于
-
由 Sanbu 提交于
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 01 1月, 2023 1 次提交
-
-
由 gem5 提交于
-
- 30 12月, 2022 6 次提交
-
-
由 zyfncg 提交于
* fix test_conv_bn_fuse_pass_cc * remove comment
-
由 Zhang Jun 提交于
* update conv to convNd * trigger ci
-
由 zyfncg 提交于
* support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * add extra output in op_compat * remove debug log
-
由 HongyuJia 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-
- 29 12月, 2022 4 次提交
-
-
由 xu98bin 提交于
* auto parallel bf16
-
由 zmxdream 提交于
* fix load into memory * fix load into memory * fix code style
-
由 MarDino 提交于
-
由 Wang Bojun 提交于
* fusedAttenGrad_noGrad * code style fix * add ut * remove unnecessary log
-
- 28 12月, 2022 5 次提交
-
-
由 Leo Chen 提交于
* add skip run * alloc minimum memory * skip check_size in Alloc * skip check_size in Alloc * skip check_size in Alloc * fix cases when tensor is initialized or empty * alloc empty output for place info * add test * increase timeout * format code * skip cpu * add cudnn_deterministic * fit for hostAlloc * follow comments * change check_size to fake_alloc
-
由 HappyHeavyRain 提交于
* generate the static op of some ops * add the VERSION of pixel_shuffle * change the API doc of isclose * change the API doc of isclose * fix the isclose op comment
-
由 Yuanle Liu 提交于
-
由 WangZhen 提交于
-
由 wanghuancoder 提交于
* delete old dygraph pylayer
-
- 27 12月, 2022 3 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
由 zhangbo9674 提交于
* cinn use interpretercore * fix bug * fix compile bug * fix scope bug * refine code * refine code by comment * refine code by comment
-
由 Ruibiao Chen 提交于
* Support priority scheduling for standalone executor * Add CPU test
-
- 26 12月, 2022 4 次提交
-
-
由 Roc 提交于
* revert concat and change concat to stack * let stack kernel support int8, uint8 and bool type
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
* Memory search for stream analyzer * Shrink redundant waiters
-