- 07 9月, 2021 9 次提交
-
-
由 wangxinxin08 提交于
* add conv op check for illegal input or attributes
-
由 Qi Li 提交于
* [NPU] update batch norm op, test=develop * add NHWC support for bn, test=develop
-
由 furnace 提交于
* [NPU] fix for test_norm_op_npu * [NPU] add norm_grad * [NPU] add CheckAxis for axis * [NPU] delete debug codes * norm can not use L2Normalize, norm_grad can use L2NormalizeGrad * [NPU] delete useless codes * [NPU] optimize norm_grad OpMaker * Update python import path
-
由 Qi Li 提交于
* [NPU] log_softmax_grad, test=develop * remove debug files, test=develop * update lookup_table_v2 for CANN 5.0.x, test=develop
-
由 jakpiase 提交于
* fix for reshape2 * added reviewers sugestions
-
由 Aurelius84 提交于
* fix commit * Open unittest * fix unittest on Windows * fix constructor
-
由 Aurelius84 提交于
* open test_resnet_amp on Windows * disable on Windows CPU CI for timeout * disable on Windows CPU CI for timeout * fix code style
-
由 wawltor 提交于
* transfer the static.accurcay to v2 api * remove the unused code
-
由 xiayanming 提交于
* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug
-
- 06 9月, 2021 6 次提交
-
-
由 wangguanzhong 提交于
* support double in deformable conv * add double for dcn v2
-
由 Wei Shengyu 提交于
* add pool2d grad grad * dbg * add unittest * update format * add more unittests * dbg
-
由 Double_V 提交于
* add kernel, stride check * add unitest for param out of range * delete max limit check
-
由 heliqi 提交于
* add depthwise_conv_npu_grad op * add depthwise_conv_npu_grad op * add depthwise_conv_npu_grad op * add NHWC test case
-
由 WeiXin 提交于
* support numpy dtype and polish code of list index. * polish code.
-
由 Wilber 提交于
-
- 05 9月, 2021 1 次提交
-
-
由 furnace 提交于
* where op add broadcast, when x_shape == y_shape, and x_shape != cond_shape * add static api tests, and delete debug codes
-
- 04 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 03 9月, 2021 11 次提交
-
-
由 Aurelius84 提交于
-
由 lilong12 提交于
* support lodtensorarray
-
由 zhulei 提交于
* [NPU] Add huber_loss op * [NPU] Add huber_loss op * [NPU] Add huber_loss p[ * [NPU] Add huber_loss
-
由 Qi Li 提交于
* [NPU] add int64 kernels, test=develop * update ci scripts to be able to trun WITH_ASCEND_INT64 on, test=develop
-
由 Jack Zhou 提交于
* Add AsExtra for transpose * add AsExtra for lstm op * add AsExtra for gru
-
由 Fan Zhang 提交于
* [iscan] bugfix * test_standalone_executor modify
-
由 heliqi 提交于
* add conv2d_transpose npu op * CopyRight 2020 to 2021 * add fp32 * delete repeat test case * delete repeat test case * fix paddle.NPUPlace
-
由 danleifeng 提交于
-
由 0x45f 提交于
-
由 WJJ1995 提交于
* add elementwise_pow_grad_npu * fixed bug for CI * deal with comments * fixed bug for CI * deal with comments
-
由 沉潜的鱼儿 提交于
* add log_softmax_op_npu * log_softmax_op_v1 * import test_log_softmax_grad
-
- 02 9月, 2021 5 次提交
-
-
由 JingZhuangzhuang 提交于
* [NPU] Support npu kernel for gather_ng op * [NPU] Support npu kernel for gather_nd op * [NPU] Support npu kernel for gather_nd and gather_nd_grad op * update py format error. * modify gather_nd_op_npu * modify gather_nd 910 test * modify gather_nd 910 test Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
-
由 xiongkun 提交于
* Add SVD Op and it's GPU and CPU kernel * Remove CUDAPlace in test_svd_op, make the test available in CPU package * modfity the file * fix windows bug/ fix ROCM / fix test timeout * for pass the CIs * improve error report * for code review * some modification to test_svd_op * change python code style * expose the svd interface for document
-
由 zhulei 提交于
* [NPU] Add label_smooth_op * [NPU] Add label_smooth_op
-
由 JZ-LIANG 提交于
* support shard reader * support shard reader * add parallel mode * update process mesh * add method to compute comm_group * implement dist_embedding forward func * implement dist matmul forward func * implement dist reshape forward func * add transpiler framework * add transpiler forward * implement transpiler forward * implement transpiler backward & update * add process * add unitest * chmod * chmod * chmod * update unitest * add unitest for gpt * remove unused print * rename transpiler --> partitioner * rename transpiler --> partitioner * chmod * chmod * bug fixed * remove amp function * update case for dp mode * update case for dp mode
-
由 Baibaifan 提交于
-
- 01 9月, 2021 7 次提交
-
-
由 jakpiase 提交于
* aded slice FWD FP32 * added tests for slice FWD FP32 * added slice bwd * added bf16 tests * CI fix * CI fix * added reason to skip_if * minor change * temporary fix for failing test * temporary fix * changes after review * CI rerun
-
由 ShenLiang 提交于
* add cache for send_recv * add eval_batch for pipeline * add eval batch for pipelineparallel * add style code
-
由 baoachun 提交于
* add strided_slice_grad op for npu
-
由 Leo Chen 提交于
* support setting linewith when printing tensor * fix ut * refine code * update comments * use small precision since windows/linux has different ramdom value * fix typo * adjust parameter order for consistency
-
由 0x45f 提交于
* modify dy2stat error message in compile time * fix variable name
-
由 WeiXin 提交于
* fix bug:When axes in paddle.sile is a tuple, an error occurs. * polish code.
-
由 QingshuChen 提交于
* support KL label smooth * update UT for KL label_smooth
-