- 20 1月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
- 18 1月, 2022 3 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 Yiqun Liu 提交于
-
由 YuanRisheng 提交于
-
- 17 1月, 2022 2 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 Zhang Ting 提交于
-
- 14 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
- 12 1月, 2022 4 次提交
-
-
由 Lijunhui 提交于
* init elem_max_grad op * optimize code and reply review comments * ternary functors * apply new reduce func * move functor to .h * multi-outputs init * rearrange code * modifed functors * optimizer code * pass nullptr * revert the last change as seg fault occurs * optimize code * remove inplace * remove comments
-
由 Lijunhui 提交于
* ini commit * multi-outputs init commit * optimize code * remove inplace
-
由 YuanRisheng 提交于
* refactor the impl of elementwise grad kernel * refactor impl of elementwise grad kernel(cuda) * fix compile bugs
-
由 limingshu 提交于
* first commit * fix wrong filename * fix the wrong spell name * fix gpu config warper * modify according to pr advices * fix GpuLauchConfig1D api bugs * change the config for dropout grad * fix bugs * modification according to pr advices * modification according to pr advices
-
- 11 1月, 2022 4 次提交
-
-
由 Zhang Zheng 提交于
* fix bug when inplace strategy * fix * fix * fix * fix * fix
-
由 niuliling123 提交于
-
由 limingshu 提交于
* fix the wrong filename * first commit * first commit * remove rest useless headers * for ci approval
-
由 Sing_chan 提交于
* support vs2019 compilation in windows * not modify pow_op's original compute logic
-
- 10 1月, 2022 1 次提交
-
-
由 taixiurong 提交于
-
- 06 1月, 2022 3 次提交
-
-
由 limingshu 提交于
* fix the wrong filename * first commit
-
由 YuanRisheng 提交于
* move gpu_impl of elementwise kernel * change copyright to 2022
- 05 1月, 2022 2 次提交
-
-
由 Lijunhui 提交于
* init commit: new elem_mul_grad * add template speciallization for complex in multiply * reply review comments * correct dx and dy computation when T is complex * reply review comments * update to new ReduceRunctor * mul-output broadcast * call functions * call functions with comments * remove comments
-
由 crystal 提交于
* add elementwise div * move mul and div grad functor * Combine multiple CUDA kernels * Update the reduce interface call * add multi-output * add multi-output div * add branch judge * Package branch * Combine the x and y functions into one
-
- 04 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * move cpu_impl of elementwise kernel to new directory
-
- 31 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs
-
- 29 12月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* first commit * pass ctest of elementwise_div_grad
-
- 21 12月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 20 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* support FP16 for more ops * add amp list tests * refine reduce_mean_grad * fix OP benchmark ci * fix fp16 reduce_mean * updat ut, but still have some problems * remove mean/reduce_mean fp16 kernel
-
- 18 12月, 2021 1 次提交
-
-
由 Feiyu Chan 提交于
* add complex op and `paddle.complex`.
-
- 17 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* fix_bugs_for_elementwise_branch_selection * fix merge_dims bugs * fix all influenced file
-
- 16 12月, 2021 3 次提交
-
-
由 LJQ❤️ 提交于
Add elementwise_fmax and elementwise_fmin operators
-
由 niuliling123 提交于
* Add the transformop parameter in TensorReduceFunctorImpl
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile * modify register name * fix compile bugs
-
- 15 12月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
test=document_fix
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 08 12月, 2021 2 次提交
-
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
由 crystal 提交于
* add boardcast_sub * add boardcast_sub
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 24 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* elementwise_mul refactor * perfect code in test * delete redundant code * fix bugs when run test_multiply * adjust the location of macro * fix bugs when run ci
-