- 28 9月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Sławomir Siwek 提交于
* Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
- 26 9月, 2022 1 次提交
-
-
由 cifar10 提交于
-
- 25 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 Chenxiao Niu 提交于
-
- 18 9月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
-
- 16 9月, 2022 5 次提交
-
-
由 sneaxiy 提交于
* support int64 non-broadcast * support broadcast case for int64 index * fix bug * support more Arity * remove some codes * upgrade patchelf to v0.15.0 to pass CI build * fix bug * fix patchelf installation * add debug flags * remove useless codes * fix viterbi_decode and set_value op uts * remove always enable int64
-
由 chenjian 提交于
* avoid to synchronize all devices * synchronize custom device
-
由 JingZhuangzhuang 提交于
-
由 Leo Chen 提交于
* add interpretercore for jit engine * add ut
-
由 ronnywang 提交于
* [CustomDevice] add custom_device_resource_pool & device_event_custom_device * update * update * update * update
-
- 15 9月, 2022 2 次提交
-
-
由 Jacek Czaja 提交于
* - mul & matmul changes - fix - bs16 correction of strides * - cosmetic fixes * - lint * - fix * - fix * - format -> mem_desc * - fix * - fix * - fix * - fix * - fix
-
由 Nyakku Shigure 提交于
-
- 14 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* merge python lib * Update third_party.cmake * Update CMakeLists.txt
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
- 09 9月, 2022 3 次提交
-
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
由 Chenxiao Niu 提交于
-
- 08 9月, 2022 2 次提交
-
-
由 chenjian 提交于
-
由 taixiurong 提交于
* add gemm_epilogue * xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
-
- 07 9月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 chenjian 提交于
* update protobuf format * fix protobuf content * fix file mode * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * support rocm
-
- 05 9月, 2022 3 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 chalsliu 提交于
-
由 sneaxiy 提交于
-
- 02 9月, 2022 1 次提交
-
-
由 kangguangli 提交于
-
- 01 9月, 2022 3 次提交
-
-
由 houj04 提交于
-
由 taixiurong 提交于
test=kunlun
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 29 8月, 2022 3 次提交
-
-
由 Sławomir Siwek 提交于
* abs relu6 fwd * abs bwd * gaussian_random_kernel and mkldnn-onednn renaming * scale kernel * whitespace * whitespace * revert scale migration * whitespaces * revert changes to gaussian kernel * whitespaces
-
由 Allen Guo 提交于
* support depthwise_conv2d ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> * fix duplicate name Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
-
由 Allen Guo 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 25 8月, 2022 2 次提交
-
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 haosicheng 提交于
-
- 24 8月, 2022 1 次提交
-
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun
-
- 23 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add profiler apis * migrate CalculateEstOccupancy into cuda_tracer * update * add ut
-
- 22 8月, 2022 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
- 19 8月, 2022 2 次提交
-
-
由 houj04 提交于
-
由 dongfangshenzhu 提交于
* add merged_momentum *test=kunlun * add merged_momentum *test=kunlun * add fp16 to merged_momentum,*test=kunlun * change dist_model.cc * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun * add merged_momentum unittest and change momentum,test=kunlun
-