- 09 4月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Register exp/expm1/logit bf16 activation op kernels (#48702) * register more bf16 ops * update to register coresponding backward ops * Addition of bf16 type support for Compare OP (#46413) * first commit * clarify the quotes * change code style format * support bfloat16 * add bfloat16 support for more ops (#48272) * [Bfloat16]register bfloat16 datatype for squared l2 norm (#50908) * Sync the pull request #51903. * Add some header files back. * modify cmake file for cuda11.8 compile (#49020) * modify cmake file for cuda11.8 compile * add op_library(fused_embedding_eltwise_layernorm_op DEPS bert_encoder_functor) * Fix compling error. * Cherry-pick pull request #51396. --------- Co-authored-by: Nsneaxiy <32832641+sneaxiy@users.noreply.github.com> Co-authored-by: Nlimingshu <61349199+JamesLim-sy@users.noreply.github.com> Co-authored-by: Shaojie WANG <wsjmessi@163.com> Co-authored-by: Nzqw_1997 <118182234+zhengqiwen1997@users.noreply.github.com>
-
- 02 9月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [XPU]Migrate Adam XPU kernel into Phi * test=kunlun
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 19 8月, 2022 1 次提交
-
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun
-
- 15 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 09 8月, 2022 1 次提交
-
-
由 yeliang2258 提交于
-
- 08 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clean tensor.h * fix gather_nd
-
- 03 8月, 2022 1 次提交
-
-
由 z8hanghuan 提交于
* add sequence_unpad for xpu,*test=kunlun * add sequence_unpad, *test=kunlun * fix bug in testcase,should not be sequence_pad,*test=kunlun
-
- 02 8月, 2022 1 次提交
-
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 29 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
- 28 7月, 2022 1 次提交
-
-
由 Charles-hit 提交于
* Move frame OP to phi、add frame OP yaml config and supplement single test * add Header file of in_dygraph_mode * Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta * move seq2col to phi
-
- 25 7月, 2022 1 次提交
-
-
由 lyq 提交于
-
- 13 7月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Phi] Migrate matrix_solve to phi * replace mutable_data with Alloc
-
- 05 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 04 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 02 7月, 2022 2 次提交
- 28 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* boost::variant -> paddle::variant * boost::variant.apply_visit -> paddle::visit * Update pybind_boost_hraders.h * Fix CINN compilation errors * Revert FetchResultType
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 18 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 Zhang Jun 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 25 5月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix maybe-uninitialized warning * fix compile * fix xpu compile * fix npu compile * fix infer compile * fix compile * fix compile
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 16 4月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
- 30 3月, 2022 1 次提交
-
- 29 3月, 2022 1 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit e77a947e.
-
- 28 3月, 2022 3 次提交
-
-
由 hong 提交于
* move momentum, rmsprop to phi; test=develop * update * update * update * update * udpate; test=develop * fix xpu npu bugs; test=develop * fix npu bug; test=develop * fix windows compile error; test=develop * fix windows compile error; test=develop * polish code; test=develop * fix conflict; test=develop * add meshgrid; * update * polish code * polish code; * fix bug * format; remove useless code * fix npu bug * fix bug
-
由 hong 提交于
* update * add forward case * update * update; test=develop * add some grad kernel; test=develop * move gpu kernel; test=develop * update * update; * update test; * fix selected rows bug; * add mix vector include ; * add mixed vector depen; test=develop * add logit grad signature; * polish code * fix bug; * add namespace for abs * revert code * not move softsign * revmove duplate register; * fix softsign bug * polish code * format * format * fix bug * remove cmake dep * add square sqrt selected rows support * update * remove clip norm * add standalone executor sqrt dep * standalone exec denp sqrt * remove sqrt op in cmkaelist * open some case
-
由 0x45f 提交于
* moving OP * move forward * move grad and infershape * code format * format code * fix code * fix code * fix CMakerLists.txt * fix comments * Refine CMakeLists for rocm ci
-
- 25 3月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
- 21 3月, 2022 1 次提交
-
-
由 zn 提交于
-
- 15 3月, 2022 1 次提交
-
-
由 xiongkun 提交于
[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770) * tranfer and pass the lgamma unittest * merge and pass the test * transfer kldiv_loss and kldiv_loss_grad; pass the unitest * trafer the isclose and cumprod kernel * change PT_REGISTER -> PD_REGISTER * fix by code review * fix by code review * fix * remove enforce include dependence from scalar * fix * fix by code review * fix by code review
-
- 14 3月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-
- 10 3月, 2022 1 次提交
-
-
由 Zhong Hui 提交于
* move segment_pool to phi. * mark summed ids as optional tensor. * fix as reviews.
-
- 08 3月, 2022 1 次提交
-
-
由 Yang 提交于
* [Phi] move put_along_axis/take_along_axis/maxout * use phi::Copy
-