- 18 3月, 2022 7 次提交
-
-
由 ShenLiang 提交于
* fix bug * fix bug
-
由 Zhanlue Yang 提交于
* Refactored Final State Python-C Code Generation Scripts. * Bug fix
-
由 lilong12 提交于
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
由 furnace 提交于
[NPU] fix fp16 (PART II)
-
由 王明冬 提交于
* [infrt] rename pd dialect from mlir to infrt. test=develop * [infrt] fix the kernel signature generator bug.
-
由 Zhang Zheng 提交于
* Optimize perf of softmax_with_cross_entropy_bwd * fix * fix
-
- 17 3月, 2022 19 次提交
-
-
由 Chen Weihang 提交于
* move assign kernel init commit * change vec<tensor> to vec<tensor*> * support tensor array * support api declare * fix test_list failed * fix npu and xpu failed * fix infrt failed * remove assign array size in operator * move assign sr header into sr dir * add infermeta for assign * test op success * fix test_list failed * fix kunlun failed * add set host allocator in tests * support tensor array in arg ctx * open set layout in share_meta * fix meta tensor layout error * fix test failed
-
由 seemingwang 提交于
* extract sub-graph * graph-engine merging * fix * fix * fix heter-ps config
-
由 TeFeng Chen 提交于
-
由 heliqi 提交于
* add onnxruntime predictor * Add code comments * support link paddle2onnx onnxruntime * support onnxruntime with python * support onnxruntime with python * support onnxruntime with windows * paddle2onnx compile with windows * supoort windows compile * supoort windows compile with onnxruntime * supoort windows compile with paddle2onnx * supoort mac compile * compile with mac * compile with mac * add code comments * fix remind word * code optimization * add test case * add test case * add inference demo_ci test case * fix compile paddle2onnx with no python * add inference demo_ci test case * add inference demo_ci test case * add inference infer_ut test case * support c go api and test cases * add converage test case * add converage test case * add capi test case * add capi test case * fix onnxruntime copyfromcpu and copytocpu * fix goapi * modify code
-
由 Qi Li 提交于
-
由 Weilong Wu 提交于
* [Eager] Support eager grad interface, draft version * Support eager grad interface with allow_unused and multi startup_op * Fix code format * Fix allow_unused case, return PyNone if tensor not initialize * Support output's stop_gradient related to create_graph * Support grad exception case in eager mode, fix coverage CI * Update ToPyObject, return PyNone if not initialize * AccumulationNode add FLAGS_retain_grad_for_all_tensor * Fix ci issue * Fix CI issue * fix, use core.eager.Tensor * Add func SetBufferSlotRankZeros for GradTensorHolder * Support retain_graph by using ClearTensorWrappers * Support retain_graph by using ClearTensorWrappers * Update retain_graph and no_grad_vars related test case * Update code gen logic for ClearTensorWrappers * Fix by override statement * fix override func args * Support retain_graph, update unit tests * Updated ClearTensorWrappers logic * fix grad python interface * Use deep copy and update unit tests * Polish code * Polish code * Fix CI issue, Deep copy only use when user set grad_tensors * Fix CI, use Backward instead RunBackward * Fix CI, Declare kernel explicitly in test file * Polish, remove vector of TensorWrapper * Refactor the logic of grad/backward, polish codes * Update code after merge upstream develop * Polish after merge upstream develop * Update to adapt new GradNodeBase superclass * Fix error introduced during conflict resolution * Update purify potential_startup_nodes logic * Fix errors * Polish code * Remove useless args for ToPyObject * Remove useless TensorWrappersSet * Fix code-format, re-install pre-commit * Fix pre-process logic for potential_startup_ops * Update unit tests, use eager mode * Fix conflicts
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* move roi_pool forward kernel to phi * move roi_pool_grad to phi * fix compile bug * fix compile bug * fix register data_type
-
由 hong 提交于
* update * fix bugs; test=develop * update; test=develop * fix test compile error; test=develop * fix cpu compile error; test=develop * fix test error; test=develo * fix layer_norm_op plugin error; test=develop * fix error; test=develop * fix test bug; test=develop * update; test=develop * polish code; test=develop * fix bugs; test=develop * remove unused depency; test=develop * polish code; test=develop
-
由 zyfncg 提交于
-
由 YuanRisheng 提交于
-
由 Zhang Zheng 提交于
* [Phi]Move infershape of top_k/expand_as/kron/searchsorted to phi * add set_dtype * fix order
-
由 Yuang Liu 提交于
-
由 baoachun 提交于
-
由 Weilong Wu 提交于
* [Eager] Support eager grad interface, draft version * Support eager grad interface with allow_unused and multi startup_op * Fix code format * Fix allow_unused case, return PyNone if tensor not initialize * Support output's stop_gradient related to create_graph * Support grad exception case in eager mode, fix coverage CI * Update ToPyObject, return PyNone if not initialize * AccumulationNode add FLAGS_retain_grad_for_all_tensor * Fix ci issue * Fix CI issue * fix, use core.eager.Tensor * Add func SetBufferSlotRankZeros for GradTensorHolder * Support retain_graph by using ClearTensorWrappers * Support retain_graph by using ClearTensorWrappers * Update retain_graph and no_grad_vars related test case * Update code gen logic for ClearTensorWrappers * Fix by override statement * fix override func args * Support retain_graph, update unit tests * Updated ClearTensorWrappers logic * fix grad python interface * Use deep copy and update unit tests * Polish code * Polish code * Fix CI issue, Deep copy only use when user set grad_tensors * Fix CI, use Backward instead RunBackward * Fix CI, Declare kernel explicitly in test file * Polish, remove vector of TensorWrapper * Refactor the logic of grad/backward, polish codes * Update code after merge upstream develop * Polish after merge upstream develop * Update to adapt new GradNodeBase superclass * Fix error introduced during conflict resolution * Update purify potential_startup_nodes logic * Fix errors * Polish code * Remove useless args for ToPyObject * Remove useless TensorWrappersSet * Fix code-format, re-install pre-commit * Fix pre-process logic for potential_startup_ops * Update unit tests, use eager mode
-
由 Jiabin Yang 提交于
* fix copy_ problem by doing it with phi copy * improve test coverage * refactor copy with sr kernel
-
由 Chen Weihang 提交于
-
由 Leo Chen 提交于
* Move the computation of moving average scale to device * Use register to save local maximum in a thread
-
- 16 3月, 2022 14 次提交
-
-
由 Chen Weihang 提交于
-
由 Lijunhui 提交于
* init commit * correct namespace
-
由 From00 提交于
* Add yaml config for pool2d * Fix CI error * Fix code format error
-
由 Aurelius84 提交于
* [Phi] Migrate mode_op and mode_grad_op into Phi * fix omp * add ifdef * migrate infershape * modify according reviewer
-
由 piotrekobi 提交于
* Refactor elementwise op grad classes * Add more refactor changes * Revert set layout and format deletion * Fix failing elementwise test
-
由 Zuza 提交于
* Quantize elementwise mul op * Parametrize elementwise functions * Fix code formatting
-
由 ronnywang 提交于
-
由 zlsh80826 提交于
-
由 zyfncg 提交于
* move roi_align_grad kernel * move roi_align grad kernel and infershape to phi * remove roi_align infershape
-
由 chenenquan 提交于
* [PHI] Migrate roll op * 【phi】migrate eigh op to phi (#40213) * migrate eigh to phi * optimize code * modify code according to comment * conflict resolution * [PHI] Migrate roll op * [PHI] Fix converage of roll_sig * [PHI] Fix infermate of roll_sig * [Phi] Fix unittest coverage of roll op * [PHI] Fix infermeta in unary * [PHI] Fix parameter type of roll op * [PHI] Fix parameter type of roll op * [PHI] Fix parameter of roll op Co-authored-by: Ncrystal <62974595+Zjq9409@users.noreply.github.com>
-
由 chenenquan 提交于
* [PHI] Migrate index_select op * [PHI] Fix bug in test_variable * [PHI] migrate index_select op
-
由 YuanRisheng 提交于
-
由 Liu-xiandong 提交于
* [kp]fix bug that cannot fallback to CPU normally in XPU KP * fix bug in static graph
-
由 caozhou 提交于
* migrate multiplex op kernel * migrate qr cpu kernel * migrate tril_triu op kernel * fix multiplex kernel * add kernel sig * fix dependence and bug * fix multiplex error * fix npu include error * fix conflict * fix conflict and delete tril_triu * fix date and multiplex input * adapt header file order * fix header file include * fix conflict * delete cholesky_solve_op.h * delete triangular_solve_op.h
-