- 16 2月, 2023 11 次提交
-
-
由 joanna.wozna.intel 提交于
* Add matmul_v2 to the quantization process and adjust Ernie model test * Correct cpu_quantize_pass test * Move op to fuse transformation to placement pass * Correct test
-
由 Hulek 提交于
* New onednn test * checkopoint * added new test, fixed issue with onednn bias * fix bias check * remove prints, refactor code * delete old test * update python tests cmake * Delete depracated conv bias * Delete outdated bias from convolution test
-
由 Tomasz Socha 提交于
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace * support add, subtract, divide * add unit test * polish unittest * fix cmake error * polish unittest
-
由 Leo Chen 提交于
-
由 houj04 提交于
* [XPU] update xccl to 1.0.8 * update xdnn. add uint8 for concat and split. * update xdnn to 20230215.
-
由 Huang Jiyi 提交于
* move layer_norm_kernel.cu.h to phi * fix bugs * fix namespace * fix bugs * fix CI-Windwos * replace mutable_data * fix bugs * fix bugs
-
由 zhupengyang 提交于
-
由 Ruibiao Chen 提交于
* Use StandaloneExecutor in FleetExecutor * Update FLAGS * Fix CI errors * Update code * Add force_root_scope_vars config * Update code * Fix CI errors * Fix test_layer_new errors
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 15 2月, 2023 10 次提交
-
-
由 duanyanhui 提交于
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads * skip getWorkqueue in tracing mode
-
由 Hulek 提交于
* Done * Deleted old python test, fixed new python test, changed names in parallel_UT * Revert parallel UT changes * Revert parallel UT changes v2 * Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass * delete sqrt from possible activations from conv_concat_relu test * review refactor * merge main * delete sqrt from list of compatible activations * Test with no outdated inputs
-
由 zyfncg 提交于
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
由 risemeup1 提交于
-
由 lzy 提交于
* make FusedMultiTransformer supports variable-lengths. * modify ffn2 when cuda_version >= 11.6 because of #49392. * code style * delete remove_padding
-
由 risemeup1 提交于
* Improved prootbuf upgrades * Improved prootbuf upgrades * Improved prootbuf upgrades * limit protobuf version>=3.20.0
-
由 YuhangLi 提交于
* [CUSTOM]custom device add black_list * change log level * fix some issues
-
- 14 2月, 2023 5 次提交
-
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace
-
由 HongyuJia 提交于
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
- 13 2月, 2023 5 次提交
-
-
由 RedContritio 提交于
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 HongyuJia 提交于
* fix py::array_t calling bug * polish code
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 6 次提交
-
-
由 Aurelius84 提交于
* Fix inferMefer in transpose2_grad * fix infershape * fix unittest
-
由 HongyuJia 提交于
* fix NLP-Bert model performance loss * fix windows compile error
-
由 risemeup1 提交于
* fix test_fleet_exe_dist_model_run * test
-
由 Weilong Wu 提交于
-
由 zhupengyang 提交于
-
由 HongyuJia 提交于
-