- 19 12月, 2022 7 次提交
-
-
由 huangjiyi 提交于
* move gather_scatter_kernel from fluid to phi * mv gather_scatter_kernel to gather_scatter_functor
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 Wen Sun 提交于
-
由 Wangzheee 提交于
* restart looup_table_v2
-
由 Wangzheee 提交于
* General optimization for no_varlen embedding layernorm
-
由 huangjiyi 提交于
* move maxouting from fluid to phi * move matrix_bit_code from fluid to phi * replace mutable_data and fix include * fix include * move gather_scatter_kernel from fluid to phi * Revert "move gather_scatter_kernel from fluid to phi" This reverts commit 3d0b1eaf179656072e8c483dfca688cccccdda01.
-
- 17 12月, 2022 2 次提交
-
-
由 Wen Sun 提交于
-
由 xiaoxiaohehe001 提交于
-
- 16 12月, 2022 4 次提交
-
-
由 Wen Sun 提交于
-
由 hong 提交于
* change staticRNN to while * update code * fix rnn bug * update * fix _find_op_path_ bugs in append_backward. * polish code * revert op proto * update * udpate while * format * revert test while loop op * fix create array * fix windows error * fix bug * update * fix array write bug Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
由 Yuanle Liu 提交于
-
由 Ruibiao Chen 提交于
-
- 15 12月, 2022 9 次提交
-
-
由 zyfncg 提交于
* add validity check for config in yaml * delete debug log
-
由 Zhang Jun 提交于
* move IsFloatVar() from tensorrt/ to api/
-
由 huangjiyi 提交于
-
由 zlsh80826 提交于
-
由 Sławomir Siwek 提交于
* fix wrong handler name * mkldnn_engine -> onednn_engine * remove fluid/errors.h imports * remove fluid/enforce.h imports * remove note and unnecessary import * remove fluid/pretty_log.h imports * remove fluid/place.h imports * remove fluid/data_layout_transform.h imports * remove fluid/device_context.h imports * remove mkldnn_helper code * remove fluid/mkldnn_reuse.h imports * pretty_log import
-
由 Ruibiao Chen 提交于
-
由 Wangzheee 提交于
-
由 Wilber 提交于
* memory_optimize and mkdlnn problem * update * update * update
-
由 Wen Sun 提交于
-
- 14 12月, 2022 8 次提交
-
-
由 Ming-Xu Huang 提交于
-
由 Yuanle Liu 提交于
-
由 limingshu 提交于
* First Commit. * add some codes * add elementwise loader * fix code styles * merge with develop * add some changes both in elementwise and transpose * add init operation in broadcast kernel. * change codes according to pr suggestions about transpose file * fix error for op-benchmark ci * fix according to ci
-
由 james 提交于
* nullptr bugfix for XPU pg mode Also a few kernels is added to xpu whitelist * increase error msg length
-
由 zqw_1997 提交于
* modify cmake file for cuda11.8 compile * add op_library(fused_embedding_eltwise_layernorm_op DEPS bert_encoder_functor)
-
由 Hulek 提交于
* Deleted mkldnn_inplace_pass code * Fixed error with cmake * Resolve conflicts
-
由 Zhang Jun 提交于
* add more unary op and square
-
由 Yiqun Liu 提交于
-
- 13 12月, 2022 6 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 sneaxiy 提交于
* save fused_attention memory when dropout_rate = 0.0 * add ut * fix ut bug * fix fused_layernorm_residual_dropout_bias_test.cu
-
由 HappyHeavyRain 提交于
* generate static graph code of some ops by yaml * fix the code-style of yaml * fix the framework_ci for triangular_solve * change the 'data_type' of scatter * add the 'out: Out' of scatter_nd_add
-
由 engineer1109 提交于
-
由 weishengying 提交于
-
由 Wangzheee 提交于
* fix some transformer unitest
-
- 12 12月, 2022 4 次提交
-
-
由 wanghuancoder 提交于
This reverts commit 592ed40b.
-
由 RichardWooSJTU 提交于
* fit paddle.matmul in fleetx.gpt
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
由 傅剑寒 提交于
* fix codestyle * add double complex<float> complex<double> dtype support for syevj_batched * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case * optimize eigh in different case * fix missing ; bug * fix use_syevj bug * fix use_cusolver_syevj_batched flag
-