- 06 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* replace prefix pt by pd * replace added kernel * revert util change * pd kernel to phi * resolve conflict * resolve conflict
-
- 04 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* change input vec tensor to pointer * update input between * fix format error * resolve conflict * resolve conflict
-
由 hong 提交于
* move conv to pten * move conv to pten; test=develop * fix bug; * add conv cudnn impl; test=develop * update * update operator; test=develop * fix bug; test=develop * move operator and prepared_operator to develop; test=develop * resolve conflict; test=develop * remove useless code;test=develop * add depency ; test=develop * fix bug; * add sig.cc ; test=develop * fix use_op error; test=develop * fix bug; test=develop * fix bug; test=develop * add conv3d register; test=develop * fix star gan and conv_nn_grad test failed; test=develop * add header; test=develop * manul to recover to develop; * resolve confilct; test=develop * remove useless code * fix bug; * remove conv2d_cudnn; test=develop * fix bugs; test=develop * fix cpu rocm compile bugs; test=develop * fix blas error; test=develop * fix compile bug; test=develop * fix windows compile error; test=develop * fix windows error; test=develop * resolve confilct; test=develop
-
- 03 3月, 2022 2 次提交
-
-
由 ronnywang 提交于
-
由 zhangkaihuo 提交于
* sparse conv3d: gpu code
-
- 02 3月, 2022 1 次提交
-
-
由 hong 提交于
* move sgd to phi; test=develop * update * add sgd kernel; test=develop
-
- 01 3月, 2022 4 次提交
-
-
由 chentianyu03 提交于
* modify infershape utils and rm reduce infershape * merge develop * fix infermete bug * add IsForInferShape func in ArgumentMappingContext * add reduce_mean infermeta * modify annotation * add default dims
-
由 zyfncg 提交于
* remove SetAllocationForOutputTenosr * add place param for copy kernel * recover SetAllocationForOutputTenosr * polish code * fix empty_dev api bug * remove reseting dtype and layout for output in executor * fix merge bug * [Phi] Add ClearHolder when re-alloc on new place in DeviceContext * fix hostAlloc * remove setting output allocation * remove full_kernel_impl.h * fix bug of xpu full_like Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 Chen Weihang 提交于
* support kps backend and compile * resolve conflict * fix kps backend trans * test in xpu2 device * remove dummy kernel
-
由 zyfncg 提交于
* add multi input for infer_shape * support multi output for infershape * fix split bug * fix bug of concat * support vector<MetaTensor*> in infrt * fix bug
-
- 28 2月, 2022 4 次提交
-
-
由 Aganlengzi 提交于
-
由 zhangkaihuo 提交于
* fix incorrect dims settings * sparse conv3d * fix out dims * test performance * test large shape success * opt scatter, double performance * test float16 * remove profiling code * remove pten * opt code lines * correct boundary judgment * only cpu * test ci * test ci * remove the including paddle/fluid header; extract the conmmon function * opt code lines * use DenseTensor::data() instead of mutable_data * return rulebook for backward * specify layout * rename:conv -> sparse_conv3d
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
由 Aurelius84 提交于
* [Phi] Add ClearHolder when re-alloc on new place in DeviceContext * fix hostAlloc * foix inferRT unittest * remove dev_ctx ptr
-
- 26 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove SetAllocationForOutputTenosr * add place param for copy kernel * recover SetAllocationForOutputTenosr * polish code * fix empty_dev api bug * test=allcases * test=allcases * fix bug * recover empty * recover modify
-
- 25 2月, 2022 2 次提交
-
-
由 Linjie Chen 提交于
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
- 24 2月, 2022 4 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
由 Aurelius84 提交于
* [Phi] Fix XPU OP segmentation Fault problem * fix cast_op_xpu in kunlun1 * fix cast_op_xpu in kunlun1
-
由 chentianyu03 提交于
* modify infershape by args_def * add optional type for infermate * add optional type for infermate * add optional type for infermate * support scalar type * change OptionalInputAt function to none template * support phi::DataType
-
由 xiongkun 提交于
-
- 23 2月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* move randperm to phi * fix npu * fix memory::Copy
-
由 Liu-xiandong 提交于
* [KP] Add elementwise add xpu, test=develop * modify the File Permissions * modify the copyright time * modify code style * modify code style
-
- 22 2月, 2022 3 次提交
-
-
由 Aganlengzi 提交于
* [custom kernel]Delete useless * change RegType enum names * mod notes * merge * update
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
由 Chen Weihang 提交于
* unify register macro * rename declare macro * fix infrt error
-
- 21 2月, 2022 3 次提交
-
-
由 Aganlengzi 提交于
* [PluggableDevice]custom kernel to pten core structs * mod extension.h for custom op * compatible python for CI * support custom context * refactor to pten * fix windows and ut
-
由 YuanRisheng 提交于
* remove infershape and Xshape * add xshape * fix bugs when run ci * fix bugs when run ci * fix bugs when run infrt test * pass converage
-
由 zyfncg 提交于
* add copy_to wrapped infermeta * test=allcases * test=allcases * test=allcases
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-