- 14 4月, 2023 11 次提交
-
-
由 Kim Yann 提交于
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-
由 ronnywang 提交于
-
由 zhangbo9674 提交于
* move paddle_ir_test to test_ir * fix bug * fix bug
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
-
由 zhangyuqin1998 提交于
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
-
由 risemeup1 提交于
* test * fix test error * fix test error * fix test error
-
由 石晓伟 提交于
-
- 13 4月, 2023 29 次提交
-
-
由 Yuanle Liu 提交于
-
由 Wangzheee 提交于
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
-
由 jjyaoao 提交于
* remove code with PADDLE_WITH_ASCEND * try pass codestyle
-
由 jjyaoao 提交于
* delete WITH_ASCEND_CL * delete NPU/ and WITH_MLU
-
由 superwinner1 提交于
* No55 add channel_shuffle FP16/BF16 support and tests
-
由 Difer 提交于
* add_fp_bf_for_dot & bf_for_cross * fix error * fix some error * fix some error * change something * fix magic number
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support fp16&bf16 in reduce_max
-
由 zhoutianzi666 提交于
* fix bilinear_interp_v2 && some other bugs in trt 7011 * add version check in test_trt_convert_bilinear_interp_v2.py
-
由 Leo Guo 提交于
-
由 niuliling123 提交于
-
由 chenxujun 提交于
-
由 zyfncg 提交于
-
由 Zman 提交于
* Add GaussianNLLLoss API. * Change `rotl` `atol`.Check `var` in dynamic graph * remove assertTrue * update unittest * update unittest for ci-covarage.add broadcast with same dim. * Supply static err print. * Repair note and example. * Split unitest. * empty commit. * for standard commit. * for standard commit. * Add int dynamic graph test. * Repair parameters name. * Repair unitest parameters name. * Repair unitest parameters name * Repair unitest parameters name * Repair unitest parameters name * add square in code-block * fit few notes. * fit few notes. * fit few notes. * fit few notes. * add few interpretations. * add few interpretations. * add few interpretations. * fix import. * fix space. * empty commit for ci.
-
由 cyber-pioneer 提交于
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of c_softmax_with_cross_entropy_op. Support ignore_index is negative number.
-
由 csy0225 提交于
-
由 wanghuancoder 提交于
-
由 RedContritio 提交于
-
由 Sanbu 提交于
-
由 chenxujun 提交于
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
由 huangjiyi 提交于
* add get_kernels.py * update * update * update * update * update * update test=document_fix --------- Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 zhangyuqin1998 提交于
* rename PD_REGISTER_GENERAL_KERNEL * Update feed_op.cc * fix * Update strings_empty_kernel.cc
-
由 Vvsmile 提交于
* adjust test_elementwise_sub's tolerance, max_relative_error of grad and atol/rtol of output to 1e-3 * fix the dtype in setUp * fix the elementwise_sub optest * modify elementwise_sub optest * fix and add bf16/fp16 to elementwise_sub * fix elementwise_sub bugs * fix bugs * fix elementwise_sub op * fix the data type * fix elementwise_sub * fix elementwise * fix elementwise_sub * fix bugs * fix elementwise sub * fix elementwise_sub * remove scalar and vector
-
由 Wang Xin 提交于
* add autogen code support for expand_as op * bug fixed
-
由 Sanbu 提交于
* Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Update op_compat.yaml * fix * fix
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
* Fix _param_grad_names when like 'param@GRAD@GRAD'
-