- 29 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* open header for custom kernel * add core utils * tidy core code * tify header * tidy include * tidy namespace * resolve conflit * fix unittest and coverage * remove platform using * resolve conflict * resolve conflict * fix digamma namespace error * fix xpu full kernel error * fix xpu full kernel error * polish details * add place for lib storage
-
- 28 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* update forward argument mapping * fix compile failed * fix test failed
-
- 27 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 25 1月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again
-
- 24 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048) * refactor elementwise add grad * fix compile bugs * fix unit test bugs * fix file conflicts * fix bugs when buildPtenContext
-
- 21 1月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* add kernel for c++ api * fix compile bugs * fix kunlun compile bugs * perfect cmake * fix compile bugs when run ci-inference * fix compile bugs * add non-raw kernel for fluid op * fix compile bugs * fix compile bugs * fix unit test bug
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
由 Wilber 提交于
* add cpu_context. * update * update * update * update * update * fix ci problem * fix npu ci problem * update * fix ci compile
-
- 20 1月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
- 18 1月, 2022 3 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 Yiqun Liu 提交于
-
由 YuanRisheng 提交于
-
- 17 1月, 2022 2 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 Zhang Ting 提交于
-
- 14 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
- 12 1月, 2022 4 次提交
-
-
由 Lijunhui 提交于
* init elem_max_grad op * optimize code and reply review comments * ternary functors * apply new reduce func * move functor to .h * multi-outputs init * rearrange code * modifed functors * optimizer code * pass nullptr * revert the last change as seg fault occurs * optimize code * remove inplace * remove comments
-
由 Lijunhui 提交于
* ini commit * multi-outputs init commit * optimize code * remove inplace
-
由 YuanRisheng 提交于
* refactor the impl of elementwise grad kernel * refactor impl of elementwise grad kernel(cuda) * fix compile bugs
-
由 limingshu 提交于
* first commit * fix wrong filename * fix the wrong spell name * fix gpu config warper * modify according to pr advices * fix GpuLauchConfig1D api bugs * change the config for dropout grad * fix bugs * modification according to pr advices * modification according to pr advices
-
- 11 1月, 2022 4 次提交
-
-
由 Zhang Zheng 提交于
* fix bug when inplace strategy * fix * fix * fix * fix * fix
-
由 niuliling123 提交于
-
由 limingshu 提交于
* fix the wrong filename * first commit * first commit * remove rest useless headers * for ci approval
-
由 Sing_chan 提交于
* support vs2019 compilation in windows * not modify pow_op's original compute logic
-
- 10 1月, 2022 1 次提交
-
-
由 taixiurong 提交于
-
- 06 1月, 2022 3 次提交
-
-
由 limingshu 提交于
* fix the wrong filename * first commit
-
由 YuanRisheng 提交于
* move gpu_impl of elementwise kernel * change copyright to 2022
- 05 1月, 2022 2 次提交
-
-
由 Lijunhui 提交于
* init commit: new elem_mul_grad * add template speciallization for complex in multiply * reply review comments * correct dx and dy computation when T is complex * reply review comments * update to new ReduceRunctor * mul-output broadcast * call functions * call functions with comments * remove comments
-
由 crystal 提交于
* add elementwise div * move mul and div grad functor * Combine multiple CUDA kernels * Update the reduce interface call * add multi-output * add multi-output div * add branch judge * Package branch * Combine the x and y functions into one
-
- 04 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * move cpu_impl of elementwise kernel to new directory
-
- 31 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs
-
- 29 12月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* first commit * pass ctest of elementwise_div_grad
-
- 21 12月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 20 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* support FP16 for more ops * add amp list tests * refine reduce_mean_grad * fix OP benchmark ci * fix fp16 reduce_mean * updat ut, but still have some problems * remove mean/reduce_mean fp16 kernel
-
- 18 12月, 2021 1 次提交
-
-
由 Feiyu Chan 提交于
* add complex op and `paddle.complex`.
-
- 17 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* fix_bugs_for_elementwise_branch_selection * fix merge_dims bugs * fix all influenced file
-
- 16 12月, 2021 2 次提交
-
-
由 LJQ❤️ 提交于
Add elementwise_fmax and elementwise_fmin operators
-
由 niuliling123 提交于
* Add the transformop parameter in TensorReduceFunctorImpl
-