- 16 5月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 24 4月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 19 4月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add recompute support * update
-
- 18 4月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 07 4月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 06 4月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 03 4月, 2023 1 次提交
-
-
由 engineer1109 提交于
-
- 24 3月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
- 13 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 23 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move fluid generator to phi * move fluid generator to phi * update .gitignore * fix bugs * fix cannot find "glog/logging.h" in "generator.h" * fix bugs
-
- 30 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
- 23 12月, 2022 1 次提交
-
-
由 QingshuChen 提交于
-
- 05 12月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Replace mutable_data with DeviceContext.Alloc in phi kernels * Fix CI errors * Fix CI errors * Fix CI errors, test=kunlun * Fix CI errors, test=kunlun * Handle rnn_functor * Update approvals
-
- 29 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
-
- 18 11月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix bug of zero_allocator in host * fix test compile bug * add unittest * update test
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 25 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 03 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 29 7月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
- 19 7月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 14 7月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* adapt mkldnn kernel in PHI * fix ci compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * delete comment * fix compile bugs in windows-inference * delete code for converage * modify code by review * modify code by review * add todo * fix compile bugs * fix compile bugs * fix compile bugs * fix unittest bugsx
-
- 11 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 07 7月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* feat(): dynamic_to_static support for ipu. * fix(): format fix. * fix format * fix cpplint error * use phi::errors * fix format * fix format * fix(): add api to restore patched function. * fix(): identity_loss uses cpu place as expected kernel type. * doc(): add IPU dy2static related docs. * fix(): combine test cases. * fix format * fix comment * fix format * apply comment * fix compiling * fix(): align docs. * fix(): fix identity_loss function docs. * fix(): adjust mean and sum in identity_loss. * fix(): minor docs. * move API to paddle.incubate.identity_loss * fix UT Co-authored-by: Nzhaorui chen <zhaoruic@graphcore.ai>
-
- 02 7月, 2022 2 次提交
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 10 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor DeviceContextPool * Adjust header file order
-
- 08 6月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 07 6月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 23 5月, 2022 2 次提交
-
-
由 Jacek Czaja 提交于
* - prototype of reimplemented fixes * - compilation fixes * - compilation fix * - cosmetic info * - hopefully fix * - compilation fix * - supported for nested blocking of cache clearing * - fix * - Unit test to changes * - Compilation fix to windows (hopefully) * - Moved resetting layout to ResetBlob * - fixes after review
-
由 zhouweiwei2014 提交于
* remove is_init_py of RandomGenerator, and use Global Generator if not OP seed * fix comment
-
- 09 4月, 2022 1 次提交
-
-
由 limingshu 提交于
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode. * Use the system cudaMalloc and cudaFree to allocate workspace during searching. * Enable switch of two kind of workspace setting methods. Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 01 4月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 31 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix bug that some op has no op_role attr * add mkldnn support for new executor * fit for mkldnn data_transfer * fit for mkldnn data_transfer
-