- 14 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
- 08 9月, 2022 1 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI * Refactor grad kernels into separate files * Fix CI failures * Fix Codestyle * Implement reviewer suggestions * Add new lines after includes for readability Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
- 06 9月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 05 9月, 2022 2 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 sneaxiy 提交于
-
- 04 9月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * change header path * change fluid import to phi
-
- 02 9月, 2022 1 次提交
-
-
由 kangguangli 提交于
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 24 8月, 2022 1 次提交
-
-
由 Rayman 提交于
* 【Hackathon No.34】优化 poisson op * [poisson] code style fix * modify code style * prevent from big number * modify code style * modify code style * modify import * modify import * modify code style
-
- 23 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add profiler apis * migrate CalculateEstOccupancy into cuda_tracer * update * add ut
-
- 22 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 18 8月, 2022 1 次提交
-
-
由 zhangxiaoci 提交于
* change to async mode for xpu multi-card training in static graph mode * minor bugfix * irrelevant. move to another pr * move change to other pr * fix stream issue * fix 'stream not meet with current context' error * fix branch diverge, test=kunlun
-
- 10 8月, 2022 2 次提交
-
-
由 zhangxiaoci 提交于
* add macro control in enforce_xpu.h, test=kunlun * minor bugfix * minor bugfix
-
由 Leo Chen 提交于
* set cuda device before run * add header file * fix compile
-
- 05 8月, 2022 1 次提交
-
-
由 Qi Li 提交于
-
- 01 8月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Wilber 提交于
* infer context fix place error. * update * update
-
- 29 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
- 26 7月, 2022 2 次提交
- 22 7月, 2022 1 次提交
-
-
由 yuguo 提交于
-
- 20 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* compile into one static library * fix xpu compile * fix xpu compile * fix inference compile * fix inference compile * add custom test * revert one file
-
- 18 7月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 ronnywang 提交于
-
- 15 7月, 2022 1 次提交
-
-
由 zhangxiaoci 提交于
* update xccl lib * use separate streams for compute/comm on XPU * add broadcast op to xpu2_op_list
-
- 14 7月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* adapt mkldnn kernel in PHI * fix ci compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * delete comment * fix compile bugs in windows-inference * delete code for converage * modify code by review * modify code by review * add todo * fix compile bugs * fix compile bugs * fix compile bugs * fix unittest bugsx
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl api * add ut
-
- 13 7月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomKernel] add capi eager mode support * add ut * add capi test
-
- 12 7月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* clean glog header in public header * move marco pos
-
- 06 7月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 05 7月, 2022 1 次提交
-
-
由 ronnywang 提交于
* Dataloader add custom device support * update test=document_fix
-
- 02 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* unify cpu context * fix init() * delete test_device_context * fix test_scalar
-
- 28 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Sparse]add SparseTensor mv kernel(csr*dense_vec->dence_vec, coo*dense_vec->dense_vec) * fix CI
-
- 24 6月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 xiongkun 提交于
-
- 18 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 16 6月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomKernel] add custom kernel c api * update * update * fix unable to export capi Co-authored-by: Nronny1996 <524019753@qq.com>
-
- 15 6月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul * fix CI * fix CI * fix comment * fix comment
-
由 Yiqun Liu 提交于
Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to support large tensor. (#43506) * Change some data type from int to int64_t in GetGpuLaunchConfig1D to support large tensor. * Use int64_t in ElementwiseKernel as index type to support large tensor.
-