- 20 1月, 2023 1 次提交
-
-
由 GGBond8488 提交于
* replace paddle.fluid.layers.data and remove io.data * partial commit * partial commit * partial commit * partial commit * partial commit * partial commit * remove data in fluid.layers.io.__all__ * fix errors * fix unitests * fix unitest * fix unitests * fix unitest * fix unitest * fix unitests * fix unitest * fix test_layers unitests * fix typro * fix unitest * fix unitest * fix unitest * fix typro * fix unitest test_model_cast_to_bf16 * fix test_reducescatter * fix collective unitest * fix collective unitests * fix collective unitests * add coverage * fix add layers.data * re run ci * fix some typro * fix samplecode error * fix samplecode error
-
- 19 1月, 2023 3 次提交
-
-
由 Feiyu Chan 提交于
-
由 heliqi 提交于
* fix queeze_ bug * fix slove use squeeze_kernel * fix slove use squeeze_kernel * fix slove use squeeze_kernel * add test case
-
由 jameszhang 提交于
* [KUNLUN] add op: maxpool_with_index * use DeviceContext::Alloc() instead of DenseTensor::mutable_data() * fix file format * solve clip unittest failure * minor fix * Revert "solve clip unittest failure" since the issue is fixed in #49535 This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b. * align with xdnn on the definition of mask in max_pool_with_index * minor
-
- 18 1月, 2023 10 次提交
-
-
由 wenbin 提交于
* fix cast issue * add ut
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D for paddle.moveaxis/quantile * fix CI
-
由 RuohengMa 提交于
* add reduce_sum_int64 and reduce_sum_int8 xpu kernels * [PHI] add clip grad kernel with support type float32 and int32 * [PHI unittest] add clip_grad unit test * adapt code to clang-format * update xpu api output with clip_grad api * remove int8 support of reduce_sum xpu kernel since it can not pass unit tests * adapt license date, add code for XPUDataType convertion * add int8 support of reduce_sum * add reduce_sum unit tests for dtype int64, int8, and add more test cases * update license date * remove buggy bitwise and, or and xor xpu kernels, refine bitwise not xpu kernel * change license date
-
由 heliqi 提交于
* add unsqueeze test case * add unsqueeze xpu test case * fix unsqueeze test case * fix unsqueeze test case * fix unsqueeze test case * fix unsqueeze test case * add retain_grads
-
由 houj04 提交于
-
由 heliqi 提交于
* support paddle.t 0d tensor * fix paddle.t test case * merge from develop
-
由 jameszhang 提交于
-
由 Qi Li 提交于
-
由 wawltor 提交于
* Add the cumsum 0d tensor * xpu and cpu judge the 0d tensor * change to 2022 to 2023 in new commit * fix the reverse logic
-
由 Huihuang Zheng 提交于
Followed PR https://github.com/PaddlePaddle/Paddle/pull/49842 , added Digraph and Shape unit test for `case` and `switch_case`. This PR only contained test changes because `case` and `switch_case` call `cond`. The PR https://github.com/PaddlePaddle/Paddle/pull/49842 has already solved the 0d tensor support.
-
- 17 1月, 2023 13 次提交
-
-
由 Zhang Zheng 提交于
* [Zero-Dim] Support input 0D Tensor for masked_select
-
由 xiongkun 提交于
* rebase merge * code fix * fix bugs
-
由 Paulina Gacek 提交于
* reshape_transpose_matmul_pass_tester rewritten * matmul_transpose_reshape_pass_tester rewritten * mkldnn to onednn
-
由 yeliang2258 提交于
* add zero dims test * update code * fix zero dims * update code
-
由 pangyoki 提交于
* new exe supports CUDA Graph * fix * fix * fix * fix FLAGS_use_stream_safe_cuda_allocator in unittest * insert output of coalesce_tensor op to skip_gc_var * fix
-
由 cyber-pioneer 提交于
* support @to_static+to_prime+cinn * fix code logic * debug4 * debug5 * debug6 * debug7 * debug 8 * debug 9 * debug10 * debug11 * debug11 * debug 12 Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 danleifeng 提交于
-
由 Jiabin Yang 提交于
-
由 Huihuang Zheng 提交于
Support 0d Tensor in ConditionalBlockOp 1. Add dygraph 0d tensor support for ConditionalBlockOp 2. Set scalar loss shape when `append_backward`
-
由 姜永久 提交于
* rm retain grad * fix zero_dim * fix zero_dim for xpu * reset zero dim for xpu * reset xpu * reset custom_relu * Reset flip * fix zero dim
-
由 zhouweiwei2014 提交于
-
由 WangZhen 提交于
* Support call backward() without params in dy2st
-
由 Xiaoxu Chen 提交于
* support elementwise base func * fix compiling error and add test * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * another magic * add skip rename strategy * support add vjp * support add with new axis cal * support sub vjp * [prim] add multiply vjp rules * [prim] add multiply vjp rules * [prim] fix no infershape with composite in _append_backward_ops * [prim] add expand vjp rule * [prim] add exp vjp rule * uncomment infer shape for reshape/sum static prim api * [prim] fix tanh nullptr error * remove some print message * fix magic number in run_program relative tests @JiaBinYang * [prim] add expand,multiply,exp vjp rules * fix only support single direction reduce error * infer reduce dims using out dims Co-authored-by: NJiabinYang <360788950@qq.com>
-
- 16 1月, 2023 6 次提交
-
-
由 Weilong Wu 提交于
-
由 wawltor 提交于
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
由 QingshuChen 提交于
-
由 zqw_1997 提交于
-
由 xiaoguoguo626807 提交于
-
- 15 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp * enhance_tests * support more dtype for full
-
- 13 1月, 2023 6 次提交
-
-
由 Weilong Wu 提交于
-
由 cyber-pioneer 提交于
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
由 zhouweiwei2014 提交于
-
由 ronnywang 提交于
* add where, atan2, median 0d ut * add where, atan2, median 0d ut * update * update * update
-
由 Zhang Jun 提交于
* update trt engine to set in/out data type * update * Update engine.cc * Update engine.cc * update * set engine output type before freeze the network * update * update trt autoscan ut * update * update ut * fix equal bug, update ut * fix cast and equal ut * update cast ut using TRT < 8.4 * set datatype from scope * check output var is nullptr * Update op_converter.h * update tensorrt_engine_op_test ut * update
-