- 07 9月, 2021 19 次提交
-
-
由 niuliling123 提交于
-
由 Pei Yang 提交于
-
由 Qi Li 提交于
* [NPU] update batch norm op, test=develop * add NHWC support for bn, test=develop
-
由 XiangGao 提交于
Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
-
由 Aurelius84 提交于
* Add DPADDLE_WITH_CUDA for GCC * polish code
-
由 YUNSHEN XIE 提交于
-
由 furnace 提交于
* [NPU] fix for test_norm_op_npu * [NPU] add norm_grad * [NPU] add CheckAxis for axis * [NPU] delete debug codes * norm can not use L2Normalize, norm_grad can use L2NormalizeGrad * [NPU] delete useless codes * [NPU] optimize norm_grad OpMaker * Update python import path
-
由 Qi Li 提交于
* [NPU] log_softmax_grad, test=develop * remove debug files, test=develop * update lookup_table_v2 for CANN 5.0.x, test=develop
-
由 Jacek Czaja 提交于
* - refactoring progressing - Fix - compilation fix - another compilation fix - refactoring * - fix * - compilation fix * - compilation fix * - missing set_format * - compilation fix * - reverted setting memeory format * - Brought back format * - Fix * - fixes after review * CI rerun * CI rerun
-
由 jakpiase 提交于
* fix for reshape2 * added reviewers sugestions
-
由 ceci3 提交于
-
由 dyning 提交于
* operators/flatten_op.cc add AsExtra * operators/flatten_op.cc add AsExtra * fix format
-
由 XiangGao 提交于
* add AsExtra in data_norm op * pass data_layout from python to data_norm op * fix data_layout in data_norm op Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
-
由 Aurelius84 提交于
* fix commit * Open unittest * fix unittest on Windows * fix constructor
-
由 Sing_chan 提交于
-
由 Peihan 提交于
* notest,test=inference;support test different suite type * notest,test=inference;fix script bugs * notest,test=inference;fix count time issue * test=document_fix; fix readme grammar
-
由 Aurelius84 提交于
* open test_resnet_amp on Windows * disable on Windows CPU CI for timeout * disable on Windows CPU CI for timeout * fix code style
-
由 wawltor 提交于
* transfer the static.accurcay to v2 api * remove the unused code
-
由 xiayanming 提交于
* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug
-
- 06 9月, 2021 14 次提交
-
-
由 wangguanzhong 提交于
* support double in deformable conv * add double for dcn v2
-
由 wawltor 提交于
* Add the extra flag for the some ops * fix the compile problem in matmul extra
-
由 joanna.wozna.intel 提交于
* Add fusion_lstm INT8 PTQ * Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test * Change mkldnn_cache_capacity
-
由 Wei Shengyu 提交于
* add pool2d grad grad * dbg * add unittest * update format * add more unittests * dbg
-
由 feng_shuai 提交于
* transpose/slice/stride_slice/squeeze/unsqueeze op_def_enhance-1.0 * delete infer_flags and decrease-axis * delet infer_flags and decrea_axis
-
由 Double_V 提交于
* add pool2d pool3d extra() * delete ceil_mode extra() * delete ceil_mode extra() * delete ceil_mode extra() * add extra to use_mkldnn
-
由 Double_V 提交于
* add kernel, stride check * add unitest for param out of range * delete max limit check
-
由 Aurelius84 提交于
* Support Reset for DeviceEvent * fix code * add more unittest
-
由 wangxinxin08 提交于
* add AsExtra tag for conv transpose op * check the existence of use_cudnn before get this attribute
-
由 heliqi 提交于
* add depthwise_conv_npu_grad op * add depthwise_conv_npu_grad op * add depthwise_conv_npu_grad op * add NHWC test case
-
由 WeiXin 提交于
* support numpy dtype and polish code of list index. * polish code.
-
由 Feng Xing 提交于
This PR adds error exception in fused transformer python interface. The function body are not implemented (will be implemented later). Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".
-
由 Yuang Liu 提交于
-
由 Wilber 提交于
-
- 05 9月, 2021 1 次提交
-
-
由 furnace 提交于
* where op add broadcast, when x_shape == y_shape, and x_shape != cond_shape * add static api tests, and delete debug codes
-
- 04 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 03 9月, 2021 5 次提交
-
-
由 Aurelius84 提交于
-
由 wanghuancoder 提交于
* modify gc logic, use new device_event, test=develop * use GenerateDeviceEventFlag, test=develop * refine, test=develop * fix test_standalone_executor.py, test=develop * refine, test=develop
-
由 lilong12 提交于
* support lodtensorarray
-
由 zhulei 提交于
* [NPU] Add huber_loss op * [NPU] Add huber_loss op * [NPU] Add huber_loss p[ * [NPU] Add huber_loss
-
由 Qi Li 提交于
* [NPU] add int64 kernels, test=develop * update ci scripts to be able to trun WITH_ASCEND_INT64 on, test=develop
-