- 13 4月, 2023 27 次提交
-
-
由 RedContritio 提交于
-
由 Sanbu 提交于
-
由 chenxujun 提交于
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
由 huangjiyi 提交于
* add get_kernels.py * update * update * update * update * update * update test=document_fix --------- Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 zhangyuqin1998 提交于
* rename PD_REGISTER_GENERAL_KERNEL * Update feed_op.cc * fix * Update strings_empty_kernel.cc
-
由 Vvsmile 提交于
* adjust test_elementwise_sub's tolerance, max_relative_error of grad and atol/rtol of output to 1e-3 * fix the dtype in setUp * fix the elementwise_sub optest * modify elementwise_sub optest * fix and add bf16/fp16 to elementwise_sub * fix elementwise_sub bugs * fix bugs * fix elementwise_sub op * fix the data type * fix elementwise_sub * fix elementwise * fix elementwise_sub * fix bugs * fix elementwise sub * fix elementwise_sub * remove scalar and vector
-
由 Wang Xin 提交于
* add autogen code support for expand_as op * bug fixed
-
由 Sanbu 提交于
* Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Update op_compat.yaml * fix * fix
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
* Fix _param_grad_names when like 'param@GRAD@GRAD'
-
由 umiswing 提交于
-
由 TaoTao Li 提交于
-
由 niuliling123 提交于
-
由 liuruyan 提交于
-
由 Guoxia Wang 提交于
-
由 jiangcheng 提交于
* [CINN] optest add cinn check test * replace set self.check_cinn to pass check_cinn by function parameter * fix ci bug * add cinn atol/rtol
-
由 zhangyuqin1998 提交于
-
由 HongyuJia 提交于
-
由 lzydev 提交于
* fix bug in cross_entropy in static mode * fix ci-coverage
-
由 Yuanle Liu 提交于
-
由 Kim Yann 提交于
* rem cncl in new test * rem cncl in build sh * rem cncl in old test
-
由 csy0225 提交于
-
由 RedContritio 提交于
-
由 risemeup1 提交于
* fix ninja error * fix_ninja_error_qa
-
由 gaoziyuan 提交于
--------- Co-authored-by: NZhang Jun <ewalker@live.cn>
-
- 12 4月, 2023 13 次提交
-
-
由 Wang Xin 提交于
* add autogen code for clip_by_norm op * bug fixed
-
由 zqw_1997 提交于
* slight modify * support cuda12+ arch, Hopper arch and discard 30 arch * add arch 90 for each paddle_known_gpu_archs12 * for comments
-
由 ShenLiang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Speedup the completion process * [Auto Parallel] Skip the property of dist_context when deepcopying * [Auto Parallel] Remove the unnecessary print * [Auto Parallel] Move some changes from 2.4 branch to develop * Update engine.py * [Auto Parallel] Fix a bug
-
由 zhangbo9674 提交于
* add Value OpResult OpOperand class * add Value OpResult OpOperand class * fix bug * fix bug * add utils * refine code * add ptr offset and reset method * add value impl * fix bug * refine comment of ValueImpl * refine code of OpResult * refine code of Value * add some comment * fix cpu compile bug * refine code * add op * add method for op & test value * refine unittest * refine code by comment * refine code * refine code * refine code * refine code
-
由 Zhang Zheng 提交于
* Optimize performance of unique kernel * fix ci
-
由 gouzil 提交于
* [phi] mv sequence_pooling_test * [test] fix include
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
* Eval during train for ResNet
-
由 xiaoguoguo626807 提交于
-
由 liuruyan 提交于
-
由 YepKong 提交于
* add autogen code support for squared_l2_norm_op * Update ops.yaml
-
由 jiangcheng 提交于
* [CINN] add python.version.cinn_commit api * update cinn version get function * fix cinn_commit in setup.py also need len>0 check bug
-