- 28 2月, 2022 12 次提交
-
-
由 Shang Zhizhou 提交于
* add some trt layers * trtOpConverter pass ok * add comments * add constraints to some attrs in the pd_lower_to_trt patterns * update constraint * fix code style * update pass name * update code style * change .hpp.inc to .cc.inc in mlir_add_rewriter
-
由 zhangchunle 提交于
* update;test=cpu-py3
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostTracer * fix * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zhangbo9674 提交于
* refine bf16 amp-o1 logic * refine amp GLOG * refine unittest * refine unittest
-
由 zyfncg 提交于
* remove empty kernel in fluid and adjust the param of empty dev_api * polish code * revert fluid empty kernel
-
由 Wilber 提交于
-
由 zyfncg 提交于
* fix selected_rows bug in C++ API * add optional for C++ APIO * data transform support optional * remove data transform for optional vector<Tensor> * adjust some format of funtcion * fix empyt bug
-
由 zmxdream 提交于
-
由 chenjian 提交于
* add new profiler components * fix bug
-
由 Liu-xiandong 提交于
* [KP] Unify .cu and .xpu files with .kps files * fix CI bug in GPU and modify the list * fix conflict * modify the date
-
由 Aurelius84 提交于
* [Phi] Add ClearHolder when re-alloc on new place in DeviceContext * fix hostAlloc * foix inferRT unittest * remove dev_ctx ptr
-
- 26 2月, 2022 7 次提交
-
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* remove SetAllocationForOutputTenosr * add place param for copy kernel * recover SetAllocationForOutputTenosr * polish code * fix empty_dev api bug * test=allcases * test=allcases * fix bug * recover empty * recover modify
-
由 From00 提交于
* Move GumbelSoftmax OP to phi * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc * Use axis util in kernel impl * Remove namespace platform::errors * Use GetCPUEngine in Device Context
-
由 zyfncg 提交于
* Support custom implement for C++ API * rename api_invoke_impl to api_custom_impl * remove manual_api * delete mutable_data in copy_to api * fix problem of copy_to * add unittest for infer_meta_fn_factory * fix split cofig in yaml * fix split cofig in yaml * modify sum api yaml * add copy_to wrapped infermeta * rollback copy impl
-
由 From00 提交于
* Move BilinearTensorProduct OP to phi * Set dtype for Infermeta
-
由 Weilong Wu 提交于
* Support Eager Hook, expose interface to python * Fix CI issue
-
由 Chen Weihang 提交于
-
- 25 2月, 2022 21 次提交
-
-
由 Chen Weihang 提交于
-
由 Feiyu Chan 提交于
-
由 jakpiase 提交于
-
由 sneaxiy 提交于
* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS
-
由 0x45f 提交于
* move eye OP to pten * move size OP to pten * merge develop * fix merge * move files * move erfinv OP to phi * remove comment * move pixel_shuffle OP to phi * remove comment * fix PT_REGISTER * fix NPU * fix CR * remove size_sig.cc for PR-CI-Coverage
-
由 YUNSHEN XIE 提交于
* disable some distribute test case when in CPU test env * disable some test case when in CPU test env * fix
-
由 Zhang Zheng 提交于
-
由 Aganlengzi 提交于
* [phi]migrate increment addmm multinomial cholesky InferShapes to phi * set_dtype and mod MultinomialFunctor
-
由 Qi Li 提交于
* [ROCm] fix Managed Memory Alloc on HIP, test=develop * update, test=develop
-
由 Linjie Chen 提交于
-
由 Zhang Ting 提交于
-
由 Zhang Zheng 提交于
* Optimize perf of softmax_with_cross_entropy * fix * fix * fix accuracy error
-
由 zhangbo9674 提交于
* add ele_add * add ele_mul * add ele_sub * sovle conflict * fix npu * refine ele_add * add ele_mul unittest * refine ele_sub * refine ci * refine unittest
-
由 furnace 提交于
[Phi] mv kernel
-
由 Leo Chen 提交于
* refine randint kernel * refine randperm kernel * refine unbind kernel * support op seed
-
由 joeqiao12 提交于
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
由 YuanRisheng 提交于
* fix bugs * fix bugs
-
由 Li Min 提交于
* Fix compile error on cuda_arch less than 700.
-
由 fwenguang 提交于
-
由 niuliling123 提交于
-