- 09 3月, 2023 10 次提交
-
-
由 iSerendipity 提交于
* add output defs for fused_adam kernel * complete the oters defs for cpu and gpu * remove register for param_out
-
由 iSerendipity 提交于
* Add output defs for sgd kernel * add datatype infer for sgd * add infer logic
-
由 iSerendipity 提交于
* add output defs for edit_distance kernel * change seqnum as output0
-
由 Wang Xin 提交于
-
由 张春乔 提交于
* add REGISTER of float32 in accuracy * fix something
-
由 Wang Xin 提交于
-
由 TaoTao Li 提交于
* * add comm context for device context * add broadcast phi operator kernel and api * add broadcast support dtype, update ut * fix broadcast bfloat16 type * fix ut * update test_collective_broadcast_api timeout to 300
-
由 张春乔 提交于
-
由 张春乔 提交于
-
由 RedContritio 提交于
* add visit_place to phi/core/utils * remove reference to operator.h in phi/kernels/funcs/math_function.h * update data type from framework.proto to phi * fix enforce error in fluid
-
- 08 3月, 2023 8 次提交
-
-
由 Wang Xin 提交于
* add output defs for qr kernel * Revert "add output defs for qr kernel" This reverts commit 2704d4053219418245180564aa5fb2f3f19d1af5. * remove op from blacklist
-
由 Ryan 提交于
* add isfinite * add isnan & isinf * add xpu isNaN
-
由 Ainavo 提交于
-
由 Huang Jiyi 提交于
-
由 张春乔 提交于
-
由 Ryan 提交于
-
由 Huang Jiyi 提交于
* add output defs * add output defs for kps
-
由 Ruibiao Chen 提交于
-
- 07 3月, 2023 3 次提交
-
-
由 zhupengyang 提交于
-
由 Chen Weihang 提交于
-
由 Ruibiao Chen 提交于
-
- 06 3月, 2023 3 次提交
-
-
由 Paulina Gacek 提交于
* first approach * test finished * cpp test deleted * CmakeList corrected * multi_gru_seq_fuse_pass rewritten * dummy cout deleted * review changes * timeout extended
-
由 Ruibiao Chen 提交于
* Remove InterpretercoreInferShapeContext * Fix lod errors
-
由 Sławomir Siwek 提交于
-
- 02 3月, 2023 3 次提交
-
-
由 Ruibiao Chen 提交于
* Check structed kernel for new executor static build * Update code * Ready for resnet50 * Move transfer_dtype to phi * Ready for transformer * Fix CI errors * Fix layer_norm InferMeta * Remove layer_norm infermeta fix
-
由 zyfncg 提交于
* fix performance drop in BF16 models * fix test_cpu_quantize_squash_pass
-
由 Yuanle Liu 提交于
-
- 01 3月, 2023 2 次提交
-
-
由 csy0225 提交于
-
由 zhupengyang 提交于
-
- 28 2月, 2023 4 次提交
-
-
由 Hulek 提交于
* Added file * Tests separated and rewritten, fixed fc_lstm_fuse_pass * Resolve conflicts
-
由 zhupengyang 提交于
-
由 zhoutianzi666 提交于
* forbid tensorrt_engine op's output is a persistable var
-
由 niuliling123 提交于
-
- 27 2月, 2023 4 次提交
-
-
由 jiangcheng 提交于
-
由 Wang Bojun 提交于
* add sm version check * use GetGPUComputeCapability
-
由 HongyuJia 提交于
* [Error Msg] Polish error message when GPU kernel not found * Only test in GPU environment
-
由 csy0225 提交于
-
- 24 2月, 2023 3 次提交
-
-
由 Sławomir Siwek 提交于
* ConvertToFusedOp * change static to inline Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 niuliling123 提交于
-
由 Aurelius84 提交于
* [CINN]Enhance CacheKey hash logic by considering input dtypes * add unittest * fix typo * fix typo * fix map.at * fix find * fix test * fix cinn cache key structure realize * using ordered map for attributes * add test by review advice --------- Co-authored-by: Njiangcheng <thisjiang@qq.com>
-