- 05 12月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Replace mutable_data with DeviceContext.Alloc in phi kernels * Fix CI errors * Fix CI errors * Fix CI errors, test=kunlun * Fix CI errors, test=kunlun * Handle rnn_functor * Update approvals
-
- 18 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi * update copyright years * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi * fix rocm-complie bugs
-
- 11 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
- 02 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* set device id of Place() to get GPUContext needed by LimitGridDim in ElemwiseGradBroadcast * fix code style
-
- 03 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-
- 25 3月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
- 07 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move elementwise_div grad * change mutable_data to alloc * fix compile bugs
-
- 02 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move elementwise function to funcs directory * fix compile bugs * modify according to comment
-
- 26 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove SetAllocationForOutputTenosr * add place param for copy kernel * recover SetAllocationForOutputTenosr * polish code * fix empty_dev api bug * test=allcases * test=allcases * fix bug * recover empty * recover modify
-
- 23 2月, 2022 1 次提交
-
-
由 Liu-xiandong 提交于
* [KP] Add elementwise add xpu, test=develop * modify the File Permissions * modify the copyright time * modify code style * modify code style
-
- 22 2月, 2022 1 次提交
-
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
- 20 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 Yiqun Liu 提交于
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 11 2月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* improve backward performance * support different dtypes for elementwise ops
-
- 09 2月, 2022 2 次提交
-
-
由 niuliling123 提交于
-
由 Yiqun Liu 提交于
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 27 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
-
由 Aurelius84 提交于
* Support allocate_from in Tensor and allocate_data in Context * fix #ifdef CUDA * fix cycle depends * fix test_xxx_dev_api failed * fix windows compiling error * fix unittest * modify into PImpl * fix selected rows * add TODO comment * refine interface according reviewer
-
- 25 1月, 2022 2 次提交
-
-
由 niuliling123 提交于
This reverts commit 9059ef69.
-
由 niuliling123 提交于
-
- 24 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048) * refactor elementwise add grad * fix compile bugs * fix unit test bugs * fix file conflicts * fix bugs when buildPtenContext
-
由 石晓伟 提交于
* updates callers, test=develop * updates tensor, test=develop * fixes errors, test=develop * remove some dtypes, test=develop * fix errors in the base storage modification, test=develop * fixes a bug, test=develop * fixes the bugs in push the whole, test=develop * updates, test=develop * update * update, test=develop * fixes the mac-py3 CI, test=develop * remove the storage impl, test=develop * updates some codes, test=develop * update, test=develop * updates pten allocation, test=develop
-
- 21 1月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
- 18 1月, 2022 2 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 YuanRisheng 提交于
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 14 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 12 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* refactor the impl of elementwise grad kernel * refactor impl of elementwise grad kernel(cuda) * fix compile bugs
-
由 limingshu 提交于
* first commit * fix wrong filename * fix the wrong spell name * fix gpu config warper * modify according to pr advices * fix GpuLauchConfig1D api bugs * change the config for dropout grad * fix bugs * modification according to pr advices * modification according to pr advices
-
- 11 1月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 06 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move gpu_impl of elementwise kernel * change copyright to 2022
-
- 29 12月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* first commit * pass ctest of elementwise_div_grad
-
- 27 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* No harm to KP * Pass the compile stage * change the WriteData function * fix template bugs and pass ctest of current elementwise * for passing partial template specialization of tempalte function in CI-ROCm * To make 'WriteData' funtion flexible. * a less harmful way to support multi-output * a less harmful way to support multi-output
-