- 31 8月, 2022 2 次提交
- 30 8月, 2022 12 次提交
-
-
由 HongyuJia 提交于
* add coalesce_tensor kernel * polist coalesce_tensor kernel * add sig and InferMeta * add testcase * add legacy_api.yaml * fix infermeta * fix yaml * fix kernel implementation * add compile dependency of phi/kernels * fix MetaConfig * add python api * add and fix testcase * rnn.py add import * change _C_ops.coalesce_tensor * remove useless comments * add SetBackend * restore XPU kernel temporarily * fix code according to PR comments
-
由 pangyoki 提交于
* move huber_loss xpu kernel to phi, test=kunlun * fix, test=kunlun * fix paddle_enforce, test=kunlun
-
由 zhangyikun02 提交于
-
由 WangZhen 提交于
* Adapt tensor axis for argmin/max * Add UT * Polish UT
-
由 pangyoki 提交于
* move layer_norm xpu kernel to phi, test=kunlun * fix, test=kunlun
-
由 WangZhen 提交于
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
-
由 Aurelius84 提交于
* [OpAttr]padding_value of Pad support Tensor type * fix unittest * fix unittest * fix coverage
-
由 kangguangli 提交于
* fix memcpy_h2d bug related to cuda stream setting when allocate memory * add header file * fix compile error for cpu only
-
由 Leo Chen 提交于
* move xpu kernel to phi * delete fluid file * fix compile * add guard, test=kunlun * xpu set constant * fix xpu error, test=kunlun
-
由 WangZhen 提交于
-
由 ming1753 提交于
-
由 Chen Weihang 提交于
-
- 29 8月, 2022 13 次提交
-
-
由 YuanRisheng 提交于
* mv elementwise add to xpu , test=kunlun * fix ci bugs, test=kunlun * fix ci bugs , test=kunlun
-
由 Sławomir Siwek 提交于
* abs relu6 fwd * abs bwd * gaussian_random_kernel and mkldnn-onednn renaming * scale kernel * whitespace * whitespace * revert scale migration * whitespaces * revert changes to gaussian kernel * whitespaces
-
由 Weilong Wu 提交于
* [XPU] migrate mul to phi;test=kunlun * rm fluid mul xpu op;test=kunlun
-
由 Chen Weihang 提交于
* migrate assign xpu kernel, test=kunlun * remove assign_value xpu, test=kunlun
-
由 wanghuancoder 提交于
* gather gather_grad gather_nd gaussian_random xpu to phi
-
由 zhangbo9674 提交于
-
由 Zhang Ting 提交于
* fix hard_swish inf * skip_check_grad for mkldnn op * 'fix code style' * fix unittest
-
由 Aurelius84 提交于
* [OpAttr]num_rows/num_colums of eye support Tensor type * fix attr cast with long type
-
由 Siming Dai 提交于
* move incubate to geometric * add paddle.geometric * fix unittest bug * add float16 support for segment op * change reindex and sample neighbors flag name * add heter graph reindex * move sample_neighbors.py to neighbors.py * delete khop_sampler in geometric * delete unused code * change sample_neighbors api input order * fix en doc * fix unittest * fix unittest * change reindex * fix division by 0 * delete unnecessary input argument * delete final_state
-
由 zhangbo9674 提交于
-
由 zhangbo9674 提交于
-
由 zyfncg 提交于
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun
-
由 Weilong Wu 提交于
-
- 26 8月, 2022 4 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
- 25 8月, 2022 9 次提交
-
-
由 piotrekobi 提交于
* Add omp parallel for directives * Revert "Add omp parallel for directives" This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6. * Add #pragma omp parallel for to correct file * Add check for _OPENMP definition * Disable omp on gpu * Trigger CI * Readd check for _OPENMP definition * Change macro disabling changes on GPU * Improve macro readability
-
由 Aurelius84 提交于
* [OpAttr]axis of Reverse Support Tensor type * fix coverage * fix unittest
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 wanghuancoder 提交于
* sync_batch_norm_grad delete mean and variance
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 Rayman 提交于
-
由 sprouteer 提交于
* fix unique_kernel bugs * fix unique kernel cu bugs
-