- 29 8月, 2022 2 次提交
-
-
由 zyfncg 提交于
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun
-
由 Weilong Wu 提交于
-
- 26 8月, 2022 4 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
- 25 8月, 2022 10 次提交
-
-
由 piotrekobi 提交于
* Add omp parallel for directives * Revert "Add omp parallel for directives" This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6. * Add #pragma omp parallel for to correct file * Add check for _OPENMP definition * Disable omp on gpu * Trigger CI * Readd check for _OPENMP definition * Change macro disabling changes on GPU * Improve macro readability
-
由 Aurelius84 提交于
* [OpAttr]axis of Reverse Support Tensor type * fix coverage * fix unittest
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 wanghuancoder 提交于
* sync_batch_norm_grad delete mean and variance
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 Rayman 提交于
-
由 sprouteer 提交于
* fix unique_kernel bugs * fix unique kernel cu bugs
-
由 haosicheng 提交于
-
- 24 8月, 2022 4 次提交
-
-
由 Leo Chen 提交于
* make tensor_util contains no cuda code * refine isfinite * revert ut * move isfinite function to its op * fix test * fix compile * std::isnan is not defined for int type on windows * fix windows compile * fix fp16 * fix rocm compile * revert gradient node
-
由 WangZhen 提交于
-
由 Rayman 提交于
* 【Hackathon No.34】优化 poisson op * [poisson] code style fix * modify code style * prevent from big number * modify code style * modify code style * modify import * modify import * modify code style
-
由 WangZhen 提交于
* Adapt minlength attr for bincount
-
- 23 8月, 2022 6 次提交
-
-
由 niuliling123 提交于
-
由 zhangkaihuo 提交于
* rename the member function of SparseTensor * use shorter function names
-
由 limingshu 提交于
-
由 Siming Dai 提交于
-
由 thunder95 提交于
* erfinv * fix some tiny issues
-
由 YuanRisheng 提交于
* move distribute_fpn_proposals * fix some code * fix yaml bugs * add set dtype * move proposal_impl to funcs * fix compile bugs
-
- 22 8月, 2022 3 次提交
-
-
由 wanghuancoder 提交于
some python c api use final state
-
由 zhangkaihuo 提交于
-
由 shangliang Xu 提交于
-
- 19 8月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* add group_norm dyanmic plugin
-
- 18 8月, 2022 3 次提交
-
-
由 HongyuJia 提交于
* transfer trilinear op to phi, change name from trilinear_interp_v2 to trilinear_interp * reserve linear_interp param * change testcase scale if-branch * testcase test_imperative_case * fix trilinear testcase * import paddle in test_trilinear_interp_v2
-
由 Aurelius84 提交于
* [OpAttr]Squeeze axes support Tensor * add support_tensor * fix unittest * fix coverage
-
由 HongyuJia 提交于
* transfer bilinear op to phi, change bname from bilinear_interp_v2 to bilinear_interp * reserve linear_interp param * fix cross device import
-
- 17 8月, 2022 4 次提交
-
-
由 Leo Chen 提交于
* use addKernel * fix compile * remove elementwiseAddto * add return * fix custom place
-
由 ykkk2333 提交于
* xpu unittest grad compute supports more types, *test=kunlun * add instance norm xpu, *test=kunlun
-
由 HongyuJia 提交于
* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp * test final_state_bicubic_interp api * testcase match imperative case
-
由 sneaxiy 提交于
* fix squared_l2_norm bug * update buffer.h
-
- 16 8月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* move check finite and unscale kernel into phi * move infershape into phi * move update_loss_scaling kernel into phi * remove original kernels * move update loss scaling infershape into phi * add header for xpu and npu * solve coverage failed * fix npu test failed * remove mutable data in cu file * fix new executor failed * add valid check for meta tensor output
-
由 Siming Dai 提交于
* initial commit * fix op maker bug * fix mul grad bug * add unittest * fix add grad bug, add cpu kernel * add paddle.geometric.message_passing * add paddle.geometric.send_uv api, add unittest * add fp16 judgement * fix file typo, move compute_type to message_op * add impl file * fix unittest timeout time * add review revise
-
由 HongyuJia 提交于
-