- 08 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
Reduce amax/amin frobenius_norm_kerne原始实现为Eigen实现,文件编译时间较长,因此本PR将其替换为KP实现 删除DefaultElementwiseOperator中重复功能支持,减少elementwise_double_grad OP编译时间
-
- 22 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* modify out and out_grad order in reduce_grad_kernel * delete unsed boolReduceKernel * fix conflict
-
- 28 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
- 25 2月, 2022 1 次提交
-
-
由 joeqiao12 提交于
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 11 2月, 2022 2 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
由 Chen Weihang 提交于
* move grad get expected pten kernel args * fix reduce sum error * fix element_sub_grad failed * revert kernel judge change
-
- 09 2月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 27 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove remake densetensor * fix eager test error * fix bug in eager
-
- 25 1月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
-
由 niuliling123 提交于
This reverts commit 9059ef69.
-
由 niuliling123 提交于
-
由 Noel 提交于
-
- 21 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* add kernel for c++ api * fix compile bugs * fix kunlun compile bugs * perfect cmake * fix compile bugs when run ci-inference * fix compile bugs * add non-raw kernel for fluid op * fix compile bugs * fix compile bugs * fix unit test bug
-
由 Wilber 提交于
* add cpu_context. * update * update * update * update * update * fix ci problem * fix npu ci problem * update * fix ci compile
-
- 18 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
- 13 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
- 05 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * fix compile bugs * move reduce files by new rule * add set header * format code style * merge develop and fix conflict * merge develop and fix conflict Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 24 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* combine reduce_cuda codes * support float16 in pten redcue_mean * replace ReduceCudaKernel impl with pten reduce impl * mv reduce funcs into reduce_cuda_impl * rm unsed codes and headers * mv GetReduceDim into reduce_cuda_impl * recover GetReduceDim in reduce_op.h * add new dispatch macro * fix pool op output not inited and cause transform to pten::denseTensor error * fix output tensor not initialized error * rename new dispatch macro and format code style * rm reduce_functor_op.h file
-
- 21 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* mean first version * fix scalar mean * add fp16 dtype for api
-
- 17 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* modify sum mean args * add GetExpectedPtenKernelArgs for redcue_op * modify kernel args number * modify kernel args number
-
- 16 12月, 2021 1 次提交
-
-
由 niuliling123 提交于
* Add the transformop parameter in TensorReduceFunctorImpl
-
- 13 12月, 2021 1 次提交
-
-
由 Noel 提交于
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 29 11月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
- 08 9月, 2021 1 次提交
-
-
由 Zhong Hui 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 hong 提交于
* first test version * add test exec; * add data transfer; test=develop * add new exec head; * add memcpy; test=develop * add python fetch * add new test * add graph node; test=develop * remove useless new executor test; test=develop * remove gperf dependency; test=develop * fix compile bugs; test=develop * remove useless code; test=develop * remove useless code; test=develop * add uni test; test=develop * polish code; test=develop * polish code; test=develop * add interpreter cmakefile; test=develop * remove useless code; test=develop
-
- 02 8月, 2021 1 次提交
-
-
由 furnace 提交于
* [NPU] add reduce_max * [NPU] delete skipIf * [NPU] add atrrs support or check * [NPU] add attr out_dtype * [NPU] delete debug codes
-
- 02 7月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 30 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 21 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 14 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 25 8月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 21 8月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 13 8月, 2020 1 次提交
-
-
由 wawltor 提交于
Update the code for the min,max, test=develop
-
- 09 6月, 2020 1 次提交
-
-
由 wawltor 提交于
Add the support the 5d,6d tensor support for the reduce ops; Add the same time, the compile time, it was 22 minutes, it was 21 minutes after fixed.
-
- 17 4月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
* fix error message of l2_normalize, matmul, mean, etc. * add the test case for those ops
-