- 14 3月, 2023 22 次提交
-
-
由 engineer1109 提交于
-
由 chenxujun 提交于
-
由 ccrrong 提交于
* add split_with_num composite rule * add split_with_num composite rule * add split composite rule * update * update test * update test * delete split_with_num_grad
-
由 engineer1109 提交于
fix abi fix tab
-
由 limingshu 提交于
* first commit * fix code bugs in for_loop * fix bugs in cuLoadAddStridedInputs. * optimization for LayerNormBackwardComputeGradInput * add unitest for validating the optimization * fix windows ci error
-
由 gouzil 提交于
-
由 pangyoki 提交于
* cuda graph support multi-stream for new executor * fix windows compile error * delete create_cuda_graph_stream
-
由 wangxiaoning 提交于
-
由 wangxiaoning 提交于
-
由 Infinity_lee 提交于
-
由 zhiboniu 提交于
* add fp16 and bf16 test * update
-
由 Ackeraa 提交于
add register of select Co-authored-by: Nwqgo <1552367872@qq.com>
-
由 HongyuJia 提交于
-
由 cxxly 提交于
-
由 cxxly 提交于
-
由 cxxly 提交于
-
由 zhangbo9674 提交于
* add builtin-type DenseTensorType Float16Type Float64Type Int16Type Int64Type * refine comment * refine comment * add classof for Type class * refine test code * add get param func for DenseTensorType * add dyn_cast and refine isa * set default WITH_NEWIR=OFF * refine cast_utils * Refine code by comment * refine code by comment * refine code by comment * refine code by comment * fix bug of dyn_cast * set WITH_NEWIR=OFF * refine code by comment
-
由 Huang Jiyi 提交于
* remove device_context include * fix bug * fix bug
-
由 Sonder 提交于
-
由 Zhang Ting 提交于
-
由 denglianbin 提交于
* finish task * add static_check and fix unittest. * add int32/64 * Update test_cross_op.py --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 Infinity_lee 提交于
-
- 13 3月, 2023 18 次提交
-
-
由 lubiu 提交于
-
由 TaoTao Li 提交于
* add all_gather and fix conflicts * fix code format * fix ut * fix broadcast ut
-
由 duanyanhui 提交于
-
由 heyanru 提交于
* refresh * compat * register * testop * fix * fix * fox * cast * cast * fix * type * fix * out * cast * fix * fix * fix * broad * broad * broad * fix * fix * fix * fix * fix * broad * broad * numel * fix * fix * fix * fix * cinn * fix * fix * fix * fix
-
由 ronnywang 提交于
* add UpdateWaitChain for process_group_custom * add UpdateWaitChain for process_group_custom
-
由 shentanyue 提交于
[Lite] Change the source code integration of Paddle Lite to the compilation library integration (#51405)
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 Huang Jiyi 提交于
* add from_blob * fix test * fix test * fix codestyle * add gpu test * fix test * update * add comment * fix comment * update comment * fix CI bug * add thread_local * update * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix cmake * fix CI-Py3 make * update * use api_reg * fix include * update * update * update * fix bug * fix bug * fix bug * fix bug
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 engineer1109 提交于
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 wenbin 提交于
* squeeze2_op * add ut * fix ut * fix static * modity ut
-
由 risemeup1 提交于
-