- 31 10月, 2022 2 次提交
-
-
由 xiongkun 提交于
* add unittest for einsum-v2-trace and diagonal * repeat labels. * einsum support repeated labels. * forward is ok for diagonal and undiagonalized. TODO: check backward is ok by our theorem. * backward is ok! * fix by PR suggestions. * fix ci error * fix ci error * fix ci warning
-
由 Wang Xin 提交于
-
- 28 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 26 10月, 2022 2 次提交
-
-
由 Lin Manhui 提交于
* Fix paddle.pow() bugs * Add unittest cases * Fix ut cases * Add ut cases on multiple devices
-
由 Wang Xin 提交于
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
-
- 25 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 24 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support selected_rows kernel for multiply in dygraph * delete useless code of slice in fluid * fix complie bug * move slice_array from fluid to phi * fix strided_slice_op_npu
-
- 21 10月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 will-jl944 提交于
-
- 18 10月, 2022 1 次提交
-
-
由 liu zhengxi 提交于
-
- 17 10月, 2022 2 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
- 13 10月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Zhang Ting 提交于
* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"
-
- 11 10月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 10 10月, 2022 1 次提交
-
-
由 Rayman 提交于
support fp16 for deformable conv
-
- 23 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 20 9月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
-
由 Jiabin Yang 提交于
* fix linspace error in amp * fix log * fix amp error * Revert "Simplify size op impl (#45808)" This reverts commit c252b1de. * fix_seg * fix detection Co-authored-by: NChen Weihang <sunny_cwh@163.com>
-
由 Jiabin Yang 提交于
* fix linspace error in amp * fix log * fix amp error * fix ocr error which caused by amp * add more check * rename dtype ns
-
由 HongyuJia 提交于
* polish code comments * polish data_device_transform.cc
-
- 19 9月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* move sum * fix ci bugs * fix ci bugs * fix set_lod bugs * fix infershape bugs * fix ci bugs * fix ci unittest bug * fix ci bugs * perfect code * update code according comment * add unittest * fix ci bugs
-
由 Chen Weihang 提交于
This reverts commit c252b1de.
-
由 RichardWooSJTU 提交于
* fix return order error and duplicate results with specific inputs
-
- 18 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* perfect softmax functor * fix compile bugs * fix ci bugs
-
- 14 9月, 2022 1 次提交
-
-
由 Li Min 提交于
-
- 13 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* add softmax infer kernel
-
由 ykkk2333 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* simplify size op * trans to cuda manuly * fix copy error
-
- 07 9月, 2022 1 次提交
-
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-
- 06 9月, 2022 2 次提交
- 01 9月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* copy kernel file to phi * delete some code * migrate uniform_random, test=kunlun * fix input error, test=kunlun * fix gpu register error, test=kunlun * add include file, test=kunlun * try fix error from CI, test=kunlun * polish other PR * fix CI-coverage error, test=kunlun
-
- 31 8月, 2022 6 次提交
-
-
由 duanyanhui 提交于
* enhance grid_sampler cpu kernel to 5D input * fix bug when 5D input tensor running on the cudnn kernel
-
由 YuanRisheng 提交于
* move elementwise test=kunlun * move add/sub/mul/div kernel to elementwise_kernel, test=kunlun * fix ci bugs,test=kunlun * fix ci bugs * test=kunlun
-
由 HongyuJia 提交于
* migrate truncated_gaussian_random kernel to phi, test=kunlun * reuse CPU kernel, test=kunlun * debug kernel, test=kunlun * migrate truncated_gaussian_random kernel to phi, test=kunlun * split truncated_normal, test=kunlun * try fix error from CI, test=kunlun
-
由 Aurelius84 提交于
* [OpAttr]output_size of unpool support Tensor type * fix coverage * fix contain_var * fix coverage
-
由 Charles-hit 提交于
* fix split bug * solve function redefine * fix fluid.layers.split and add unit test * delete splitInferMeta register in unary.cc * modify test_split_op GPU unit test * modify test_split_op GPU unit test place param * refactor split op and fix infershape bugs * add () in && and || * fix split C++ unit test * fix split infershape
-
由 Li Min 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 WangZhen 提交于
* Adapt tensor axis for argmin/max * Add UT * Polish UT
-