- 28 2月, 2022 13 次提交
-
-
由 0x45f 提交于
* move size, erfinv, pixel_shuffle infershape to phi * fix erfinv infermeta
-
由 Lijunhui 提交于
* init grid_sampler with mode=bilinear * solve error * rm fill constant * rm head * change block size * change block size * optimize * apply existing config
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Add host_trace_level env variable * Revert "Optimize perf of softmax_with_cross_entropy (#39553)" This reverts commit bbe5228c. Co-authored-by: Nliutiexing <liutiexing@google.com> Co-authored-by: NZzSean <18818272991@163.com>
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor * Profile Allocators * Profile Allocators * adjust interface * remove lock for set * fix Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 furnace 提交于
* [Phi] move truncated_gaussian_random, copy kernels * [Phi] move truncated_gaussian_random, kernel register * [Phi] move truncated_gaussian_random, delete useless codes
-
由 zhangchunle 提交于
* update;test=cpu-py3
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostTracer * fix * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zhangbo9674 提交于
* refine bf16 amp-o1 logic * refine amp GLOG * refine unittest * refine unittest
-
由 Wilber 提交于
-
由 zmxdream 提交于
-
由 chenjian 提交于
* add new profiler components * fix bug
-
由 Liu-xiandong 提交于
* [KP] Unify .cu and .xpu files with .kps files * fix CI bug in GPU and modify the list * fix conflict * modify the date
-
- 26 2月, 2022 5 次提交
-
-
由 YuanRisheng 提交于
-
由 From00 提交于
* Move GumbelSoftmax OP to phi * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc * Use axis util in kernel impl * Remove namespace platform::errors * Use GetCPUEngine in Device Context
-
由 From00 提交于
* Move BilinearTensorProduct OP to phi * Set dtype for Infermeta
-
由 Weilong Wu 提交于
* Support Eager Hook, expose interface to python * Fix CI issue
-
由 Chen Weihang 提交于
-
- 25 2月, 2022 22 次提交
-
-
由 Chen Weihang 提交于
-
由 Feiyu Chan 提交于
-
由 jakpiase 提交于
-
由 sneaxiy 提交于
* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS
-
由 0x45f 提交于
* move eye OP to pten * move size OP to pten * merge develop * fix merge * move files * move erfinv OP to phi * remove comment * move pixel_shuffle OP to phi * remove comment * fix PT_REGISTER * fix NPU * fix CR * remove size_sig.cc for PR-CI-Coverage
-
由 YUNSHEN XIE 提交于
* disable some distribute test case when in CPU test env * disable some test case when in CPU test env * fix
-
由 Zhang Zheng 提交于
-
由 Aganlengzi 提交于
* [phi]migrate increment addmm multinomial cholesky InferShapes to phi * set_dtype and mod MultinomialFunctor
-
由 Qi Li 提交于
* [ROCm] fix Managed Memory Alloc on HIP, test=develop * update, test=develop
-
由 Linjie Chen 提交于
-
由 Zhang Ting 提交于
-
由 Zhang Zheng 提交于
* Optimize perf of softmax_with_cross_entropy * fix * fix * fix accuracy error
-
由 zhangbo9674 提交于
* add ele_add * add ele_mul * add ele_sub * sovle conflict * fix npu * refine ele_add * add ele_mul unittest * refine ele_sub * refine ci * refine unittest
-
由 furnace 提交于
[Phi] mv kernel
-
由 joeqiao12 提交于
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
由 YuanRisheng 提交于
* fix bugs * fix bugs
-
由 Li Min 提交于
* Fix compile error on cuda_arch less than 700.
-
由 fwenguang 提交于
-
由 niuliling123 提交于
-
由 Zhanlue Yang 提交于
-
由 WangXi 提交于
-