- 08 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
Reduce amax/amin frobenius_norm_kerne原始实现为Eigen实现,文件编译时间较长,因此本PR将其替换为KP实现 删除DefaultElementwiseOperator中重复功能支持,减少elementwise_double_grad OP编译时间
-
- 10 5月, 2022 1 次提交
-
-
由 fwenguang 提交于
* [MLU] add mlu new profiler (#41138) * [MLU] add mlu new profiler * fix format * [MLU] support add callback to stream (#41831) * [MLU] add gather mlu kernel (#41969) * [MLU] add mlu activation kernels (#41751)
-
- 28 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* Optimize the performanece of sum api (#42231) * optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log * Add move construct for KernelSignature (#42253) * add move construct for KernelSignature * add noexcept * fix cherry-pick problem
-
- 30 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
This reverts commit 054fc997.
-
- 29 3月, 2022 2 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit 9c0eaada.
-
由 津 提交于
* [Phi] trans logsumexp op * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * add sig * fix sig bugs * fix sig bugs * fix xpu bugs * fix review bugs * test=develop
-
- 24 3月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 22 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* modify out and out_grad order in reduce_grad_kernel * delete unsed boolReduceKernel * fix conflict
-
- 21 3月, 2022 1 次提交
-
-
由 From00 提交于
-
- 18 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
- 15 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add reduce_min kernel * remove raw reduce_min kernel * add reduce min * add reduce any all impl * add bool reduce Kernel * remove raw any/all kernel * add any all kernel * rm comment
-
- 11 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add reduce_sum grad kernel * add reduce_grad * modify reduce grad * update reduce grad functions * fix build error * add argument mapping * move cast input after grad * add dims.size=1 cpu reduce_sum grad compute method * update reduce grad GPU * remove raw reduce_sum_grad kernel * modify header files * add namespace funcs for reduce_grad_funcstions
-
- 09 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add reduce_max kernel * add reduce max kernel * update reduce max Argumentmapping * remove reduce_max kernel * remove reduce_max kernel * add reduce max infermeta * rename reduce infermeta
-
- 08 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 06 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* replace prefix pt by pd * replace added kernel * revert util change * pd kernel to phi * resolve conflict * resolve conflict
-
- 04 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move reduce gpu impl funcs into pten/kernels/funcs * change reduce header name and namespace * fix spell word error * change mutable_data to dev_ctx.Alloc * modify place to devcontex * format code style * fix build error * fix build error * fix conflict
-
- 02 3月, 2022 2 次提交
- 01 3月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* modify infershape utils and rm reduce infershape * merge develop * fix infermete bug * add IsForInferShape func in ArgumentMappingContext * add reduce_mean infermeta * modify annotation * add default dims
-
由 zhangbo9674 提交于
* add layer norm * add p norm * add reduce sum * refine layer norm register bf16 for cudnn811 * add bf16 cast for hip * add unittest * refine rocm * refine layer_norm unittest * refine reduce op * refine unittest * enhance atol for reduce unittest
-
- 28 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
- 25 2月, 2022 1 次提交
-
-
由 joeqiao12 提交于
-
- 21 2月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* rm reduce_sum raw kernel * remove reduce_mean kernel * remove reduce_mean kernel * reduce support int and int64_t * mean support int and int64_t type
-
- 20 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 sneaxiy 提交于
* add more op int16 support * fix xpu ci
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 18 2月, 2022 1 次提交
-
-
由 qipengh 提交于
* [MLU]add matmul and matmul_v2 op * [MLU] fix data_type and del matmul * [MLU] fix compile error * [MLU] fix ci_check error
-
- 15 2月, 2022 2 次提交
-
-
由 jakpiase 提交于
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 11 2月, 2022 2 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
由 Chen Weihang 提交于
* move grad get expected pten kernel args * fix reduce sum error * fix element_sub_grad failed * revert kernel judge change
-
- 10 2月, 2022 1 次提交
-
-
由 furnace 提交于
[NPU] add reduce_min
-
- 09 2月, 2022 2 次提交
-
-
由 niuliling123 提交于
-
由 Yiqun Liu 提交于
-
- 08 2月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 27 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove remake densetensor * fix eager test error * fix bug in eager
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 25 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
-
由 Zhang Jun 提交于
* [inference] update convert reduce op&ut,test=develop * update * update * update * add int32 support * add int32 support * add comments * trt < 7.0 do not support int32 * test=develop * update * test=develop
-