- 28 2月, 2022 18 次提交
-
-
由 Lijunhui 提交于
* init grid_sampler with mode=bilinear * solve error * rm fill constant * rm head * change block size * change block size * optimize * apply existing config
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Add host_trace_level env variable * Revert "Optimize perf of softmax_with_cross_entropy (#39553)" This reverts commit bbe5228c. Co-authored-by: Nliutiexing <liutiexing@google.com> Co-authored-by: NZzSean <18818272991@163.com>
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor * Profile Allocators * Profile Allocators * adjust interface * remove lock for set * fix Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zhangkaihuo 提交于
* fix incorrect dims settings * sparse conv3d * fix out dims * test performance * test large shape success * opt scatter, double performance * test float16 * remove profiling code * remove pten * opt code lines * correct boundary judgment * only cpu * test ci * test ci * remove the including paddle/fluid header; extract the conmmon function * opt code lines * use DenseTensor::data() instead of mutable_data * return rulebook for backward * specify layout * rename:conv -> sparse_conv3d
-
由 tianshuo78520a 提交于
-
由 furnace 提交于
* [Phi] move truncated_gaussian_random, copy kernels * [Phi] move truncated_gaussian_random, kernel register * [Phi] move truncated_gaussian_random, delete useless codes
-
由 Shang Zhizhou 提交于
* add some trt layers * trtOpConverter pass ok * add comments * add constraints to some attrs in the pd_lower_to_trt patterns * update constraint * fix code style * update pass name * update code style * change .hpp.inc to .cc.inc in mlir_add_rewriter
-
由 zhangchunle 提交于
* update;test=cpu-py3
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostTracer * fix * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zhangbo9674 提交于
* refine bf16 amp-o1 logic * refine amp GLOG * refine unittest * refine unittest
-
由 zyfncg 提交于
* remove empty kernel in fluid and adjust the param of empty dev_api * polish code * revert fluid empty kernel
-
由 Wilber 提交于
-
由 zyfncg 提交于
* fix selected_rows bug in C++ API * add optional for C++ APIO * data transform support optional * remove data transform for optional vector<Tensor> * adjust some format of funtcion * fix empyt bug
-
由 zmxdream 提交于
-
由 chenjian 提交于
* add new profiler components * fix bug
-
由 Liu-xiandong 提交于
* [KP] Unify .cu and .xpu files with .kps files * fix CI bug in GPU and modify the list * fix conflict * modify the date
-
由 Aurelius84 提交于
* [Phi] Add ClearHolder when re-alloc on new place in DeviceContext * fix hostAlloc * foix inferRT unittest * remove dev_ctx ptr
-
- 26 2月, 2022 7 次提交
-
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* remove SetAllocationForOutputTenosr * add place param for copy kernel * recover SetAllocationForOutputTenosr * polish code * fix empty_dev api bug * test=allcases * test=allcases * fix bug * recover empty * recover modify
-
由 From00 提交于
* Move GumbelSoftmax OP to phi * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc * Use axis util in kernel impl * Remove namespace platform::errors * Use GetCPUEngine in Device Context
-
由 zyfncg 提交于
* Support custom implement for C++ API * rename api_invoke_impl to api_custom_impl * remove manual_api * delete mutable_data in copy_to api * fix problem of copy_to * add unittest for infer_meta_fn_factory * fix split cofig in yaml * fix split cofig in yaml * modify sum api yaml * add copy_to wrapped infermeta * rollback copy impl
-
由 From00 提交于
* Move BilinearTensorProduct OP to phi * Set dtype for Infermeta
-
由 Weilong Wu 提交于
* Support Eager Hook, expose interface to python * Fix CI issue
-
由 Chen Weihang 提交于
-
- 25 2月, 2022 15 次提交
-
-
由 Chen Weihang 提交于
-
由 Feiyu Chan 提交于
-
由 jakpiase 提交于
-
由 sneaxiy 提交于
* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS
-
由 0x45f 提交于
* move eye OP to pten * move size OP to pten * merge develop * fix merge * move files * move erfinv OP to phi * remove comment * move pixel_shuffle OP to phi * remove comment * fix PT_REGISTER * fix NPU * fix CR * remove size_sig.cc for PR-CI-Coverage
-
由 YUNSHEN XIE 提交于
* disable some distribute test case when in CPU test env * disable some test case when in CPU test env * fix
-
由 Zhang Zheng 提交于
-
由 Aganlengzi 提交于
* [phi]migrate increment addmm multinomial cholesky InferShapes to phi * set_dtype and mod MultinomialFunctor
-
由 Qi Li 提交于
* [ROCm] fix Managed Memory Alloc on HIP, test=develop * update, test=develop
-
由 Linjie Chen 提交于
-
由 Zhang Ting 提交于
-
由 Zhang Zheng 提交于
* Optimize perf of softmax_with_cross_entropy * fix * fix * fix accuracy error
-
由 zhangbo9674 提交于
* add ele_add * add ele_mul * add ele_sub * sovle conflict * fix npu * refine ele_add * add ele_mul unittest * refine ele_sub * refine ci * refine unittest
-
由 furnace 提交于
[Phi] mv kernel
-
由 Leo Chen 提交于
* refine randint kernel * refine randperm kernel * refine unbind kernel * support op seed
-