- 05 1月, 2022 2 次提交
- 04 1月, 2022 18 次提交
-
-
由 niuliling123 提交于
Add OpFunctor and replace cast, scale, clip, bce_loss and abs_grad with elementwise_no_broadcast (#38500)
-
由 Leo Chen 提交于
-
由 Qi Li 提交于
-
由 Aurelius84 提交于
* Fix memcpyD2H sync behavior with other stream * add wait * add wait * add wait
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * move cpu_impl of elementwise kernel to new directory
-
由 furnace 提交于
[NPU] add pad and pad_grad
-
由 LiYuRio 提交于
-
由 jakpiase 提交于
-
由 0x45f 提交于
* fix error when set buffer in forward * add unittest * refine class name * refine not framework.in_dygraph_mode() in if * fix UT error * add comment * refine code * remove useless import
-
由 zhangkaihuo 提交于
-
由 王明冬 提交于
-
由 Zhanlue Yang 提交于
[Unify Tensors PR #3]Port framework::Tensor members & interfaces to pten::DenseTensor, test=allcases (#38473) * Added shared_ptr<Allocation> member & corresponding interfaces to Storage * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly * Fixed issues with storage offset * Used place to malloc allocation for TensorStorage * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor * Fixed issues with place * Added comments * Moved mutable_data with stream argument to DenseTensor * Added set_offset interface * Fixed CI issues,test=allcases * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor * Reverted changes too pten_layout() interface * Removed friend classes
-
由 Weilong Wu 提交于
-
由 houj04 提交于
-
由 Chen Weihang 提交于
* move inner cast api to cast_kernel.h * resolve conflit
-
由 yaoxuefeng 提交于
heter context support dynamic mf dim
-
由 wanghuancoder 提交于
-
由 zlsh80826 提交于
-
- 31 12月, 2021 20 次提交
-
-
由 Zhangjingyu06 提交于
* [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun,*test=kunlun Co-authored-by: NQingshuChen <chenqingshu@baidu.com>
-
由 JYChen 提交于
* add new api/op kthvalue * kthvalue cuda kernel to cub sorting * fix example code error * throw errors instead of LOG in cuda sort * throw errors by Paddle_ENFORCE
-
由 baoachun 提交于
* add mul_gru_fuse_pass ut * update ut * update ut * update ut timeout setting * update ut
-
由 jakpiase 提交于
* glog fix * changed approach
-
由 jakpiase 提交于
* 6 dims fix * removed limitations of max dims
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * fix compile bugs
-
由 Xiaoxu Chen 提交于
* add beta distribution * add kl_divergence and register_kl api
-
由 zmxdream 提交于
-
由 tianshuo78520a 提交于
-
由 fwenguang 提交于
* [MLU]support calling mlu op from python interface * [MLU]fix * fix * [mlu]fix mlu_places * [mlu]fix required mlu * fix * [MLU]fix tensor copy * [mlu] fix MLUPlace call path
-
由 Wilber 提交于
-
由 JYChen 提交于
* add new api paddle.quantile and paddle.Tensor.quantile * add take_todo and fix UT
-
由 zhiboniu 提交于
-
由 xiayanming 提交于
* [Auto Parallel] add gradient merge pass * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix pr review * fix pr review * fix pr review * fix pr review * fix pr review * fix pr review
-
由 xiaoting 提交于
* add fold opereators, test=develop * add fold opereators, test=develop * add fold opereators, test=develop * update fold op error test, test=develop * fix unitext, test=develop * fix unitext, test=develop
-
由 Zhanlue Yang 提交于
-
由 Double_V 提交于
-
由 Huihuang Zheng 提交于
Paddle new APIs: put_along_axis. Xu Huang is on holiday so we created this PR to work on it. It is based on his PR: https://github.com/PaddlePaddle/Paddle/pull/37921
-
由 Chen Weihang 提交于
-
由 zhiboniu 提交于
-