- 24 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com> Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
- 20 10月, 2022 1 次提交
-
-
由 liu zhengxi 提交于
Add value check & error message for gather_tree cherry-pick #47051
-
- 11 10月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 20 9月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* polish code comments * polish data_device_transform.cc
-
由 Jiabin Yang 提交于
* [Eager] Fix ocr (#46124) * fix linspace error in amp * fix log * fix amp error * fix ocr error which caused by amp * add more check * rename dtype ns * [Eager Bug fix]Fix Detection (#46147) * fix linspace error in amp * fix log * fix amp error * Revert "Simplify size op impl (#45808)" This reverts commit c252b1de. * fix_seg * fix detection Co-authored-by: NChen Weihang <sunny_cwh@163.com> Co-authored-by: NChen Weihang <sunny_cwh@163.com>
-
- 19 9月, 2022 2 次提交
-
-
由 RichardWooSJTU 提交于
[vision.ops.nms] Fix return order error and duplicate results with specific inputs (#46148) (#46193) * fix return order error and duplicate results with specific inputs
-
由 Chen Weihang 提交于
This reverts commit c252b1de.
-
- 13 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* simplify size op * trans to cuda manuly * fix copy error
-
- 07 9月, 2022 1 次提交
-
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-
- 06 9月, 2022 2 次提交
- 01 9月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* copy kernel file to phi * delete some code * migrate uniform_random, test=kunlun * fix input error, test=kunlun * fix gpu register error, test=kunlun * add include file, test=kunlun * try fix error from CI, test=kunlun * polish other PR * fix CI-coverage error, test=kunlun
-
- 31 8月, 2022 6 次提交
-
-
由 duanyanhui 提交于
* enhance grid_sampler cpu kernel to 5D input * fix bug when 5D input tensor running on the cudnn kernel
-
由 YuanRisheng 提交于
* move elementwise test=kunlun * move add/sub/mul/div kernel to elementwise_kernel, test=kunlun * fix ci bugs,test=kunlun * fix ci bugs * test=kunlun
-
由 HongyuJia 提交于
* migrate truncated_gaussian_random kernel to phi, test=kunlun * reuse CPU kernel, test=kunlun * debug kernel, test=kunlun * migrate truncated_gaussian_random kernel to phi, test=kunlun * split truncated_normal, test=kunlun * try fix error from CI, test=kunlun
-
由 Aurelius84 提交于
* [OpAttr]output_size of unpool support Tensor type * fix coverage * fix contain_var * fix coverage
-
由 Charles-hit 提交于
* fix split bug * solve function redefine * fix fluid.layers.split and add unit test * delete splitInferMeta register in unary.cc * modify test_split_op GPU unit test * modify test_split_op GPU unit test place param * refactor split op and fix infershape bugs * add () in && and || * fix split C++ unit test * fix split infershape
-
由 Li Min 提交于
-
- 30 8月, 2022 4 次提交
-
-
由 WangZhen 提交于
* Adapt tensor axis for argmin/max * Add UT * Polish UT
-
由 WangZhen 提交于
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
-
由 WangZhen 提交于
-
由 Chen Weihang 提交于
-
- 29 8月, 2022 1 次提交
-
-
由 Siming Dai 提交于
* move incubate to geometric * add paddle.geometric * fix unittest bug * add float16 support for segment op * change reindex and sample neighbors flag name * add heter graph reindex * move sample_neighbors.py to neighbors.py * delete khop_sampler in geometric * delete unused code * change sample_neighbors api input order * fix en doc * fix unittest * fix unittest * change reindex * fix division by 0 * delete unnecessary input argument * delete final_state
-
- 25 8月, 2022 4 次提交
-
-
由 piotrekobi 提交于
* Add omp parallel for directives * Revert "Add omp parallel for directives" This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6. * Add #pragma omp parallel for to correct file * Add check for _OPENMP definition * Disable omp on gpu * Trigger CI * Readd check for _OPENMP definition * Change macro disabling changes on GPU * Improve macro readability
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 Rayman 提交于
-
- 24 8月, 2022 3 次提交
-
-
由 Leo Chen 提交于
* make tensor_util contains no cuda code * refine isfinite * revert ut * move isfinite function to its op * fix test * fix compile * std::isnan is not defined for int type on windows * fix windows compile * fix fp16 * fix rocm compile * revert gradient node
-
由 WangZhen 提交于
-
由 WangZhen 提交于
* Adapt minlength attr for bincount
-
- 23 8月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move distribute_fpn_proposals * fix some code * fix yaml bugs * add set dtype * move proposal_impl to funcs * fix compile bugs
-
- 22 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
some python c api use final state
-
- 18 8月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* transfer trilinear op to phi, change name from trilinear_interp_v2 to trilinear_interp * reserve linear_interp param * change testcase scale if-branch * testcase test_imperative_case * fix trilinear testcase * import paddle in test_trilinear_interp_v2
-
由 HongyuJia 提交于
* transfer bilinear op to phi, change bname from bilinear_interp_v2 to bilinear_interp * reserve linear_interp param * fix cross device import
-
- 17 8月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp * test final_state_bicubic_interp api * testcase match imperative case
-
- 16 8月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* move check finite and unscale kernel into phi * move infershape into phi * move update_loss_scaling kernel into phi * remove original kernels * move update loss scaling infershape into phi * add header for xpu and npu * solve coverage failed * fix npu test failed * remove mutable data in cu file * fix new executor failed * add valid check for meta tensor output
-
由 Siming Dai 提交于
* initial commit * fix op maker bug * fix mul grad bug * add unittest * fix add grad bug, add cpu kernel * add paddle.geometric.message_passing * add paddle.geometric.send_uv api, add unittest * add fp16 judgement * fix file typo, move compute_type to message_op * add impl file * fix unittest timeout time * add review revise
-
由 HongyuJia 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* change name linear_interp_v2 to linear_interp * fix deprecated_op_names * deprecated_op_names add linear_interp_grad
-