- 22 2月, 2023 2 次提交
-
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
由 zhupengyang 提交于
-
- 21 2月, 2023 3 次提交
-
-
由 HappyHeavyRain 提交于
* support bw invoke fw * fix scale in static_backward.yaml * fix the bug in tensorrt/convert * move 'scale','sign' into ops.yaml * add scale_grad of scale in op_compat.yaml * change generated_static_op in CMakeLists.txt
-
由 duanyanhui 提交于
* add static custom back_list * rm comments * fix log * fix comment
-
由 csy0225 提交于
* Optimize the ernie inference performance on xpu * fix enable runtime cache logic * when op's input shape has changed, should create a new runtime context * fix * set flag when input shape has changed
-
- 20 2月, 2023 4 次提交
-
-
由 shentanyue 提交于
-
由 HongyuJia 提交于
* polish tensor operants implementation * change year, 2021->2023
-
由 Huang Jiyi 提交于
* move save_op to fluid * fix namespace * move_load_kernel * fix kernel_register * move serialization to fluid * fix test * fix bugs
-
由 pangyoki 提交于
* change error * fix
-
- 17 2月, 2023 6 次提交
-
-
由 Sławomir Siwek 提交于
* change SHA * update to oneDNN 2.7 * update to 2.7.1 * update to 2.7.2 * add supported hardsigmoid * update to 2.7.3 * limit cpu threads for int8 test * group activations
-
由 Huang Jiyi 提交于
* move platform::transform to phi * fix bugs * move transform_test to phi * fix cmake * update namespace * fix cmake
-
由 zhupengyang 提交于
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
-
由 Ruibiao Chen 提交于
* Consider kernel argument def for data device transform in standalone executor * Fix ALL_BACKEND errors * Fix CI errors
-
由 jiangcheng 提交于
-
由 risemeup1 提交于
-
- 16 2月, 2023 6 次提交
-
-
由 joanna.wozna.intel 提交于
* Add matmul_v2 to the quantization process and adjust Ernie model test * Correct cpu_quantize_pass test * Move op to fuse transformation to placement pass * Correct test
-
由 Hulek 提交于
* New onednn test * checkopoint * added new test, fixed issue with onednn bias * fix bias check * remove prints, refactor code * delete old test * update python tests cmake * Delete depracated conv bias * Delete outdated bias from convolution test
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 zhupengyang 提交于
-
由 Ruibiao Chen 提交于
* Use StandaloneExecutor in FleetExecutor * Update FLAGS * Fix CI errors * Update code * Add force_root_scope_vars config * Update code * Fix CI errors * Fix test_layer_new errors
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 15 2月, 2023 5 次提交
-
-
由 Leo Chen 提交于
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads * skip getWorkqueue in tracing mode
-
由 Hulek 提交于
* Done * Deleted old python test, fixed new python test, changed names in parallel_UT * Revert parallel UT changes * Revert parallel UT changes v2 * Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass * delete sqrt from possible activations from conv_concat_relu test * review refactor * merge main * delete sqrt from list of compatible activations * Test with no outdated inputs
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
由 risemeup1 提交于
-
由 YuhangLi 提交于
* [CUSTOM]custom device add black_list * change log level * fix some issues
-
- 14 2月, 2023 2 次提交
-
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace
-
- 13 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* fix NLP-Bert model performance loss * fix windows compile error
-
由 zhupengyang 提交于
-
- 09 2月, 2023 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Adjust mkldnn_placement_pass to check library type and data type * Check if var has inputs * Remove unrelated test * Refactor
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 4 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Sławomir Siwek 提交于
* add support for bf16 fused_ops * fused_matmul only
-
由 wangxiaoning 提交于
* fix codestyle * fix std
-
由 Yuang Liu 提交于
-