- 16 2月, 2023 2 次提交
-
-
由 Ruibiao Chen 提交于
* Use StandaloneExecutor in FleetExecutor * Update FLAGS * Fix CI errors * Update code * Add force_root_scope_vars config * Update code * Fix CI errors * Fix test_layer_new errors
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 15 2月, 2023 13 次提交
-
-
由 duanyanhui 提交于
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads * skip getWorkqueue in tracing mode
-
由 Hulek 提交于
* Done * Deleted old python test, fixed new python test, changed names in parallel_UT * Revert parallel UT changes * Revert parallel UT changes v2 * Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass * delete sqrt from possible activations from conv_concat_relu test * review refactor * merge main * delete sqrt from list of compatible activations * Test with no outdated inputs
-
由 cyber-pioneer 提交于
* map output from composite rule to origin op add mean layer_norm dropout op map add input map check composite softmax support input shape [] * composite softmax support shape [] * polish log * solve conflict * polish code * polish op map output * add check dtype
-
由 zyfncg 提交于
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
由 risemeup1 提交于
-
由 zhangyikun02 提交于
-
由 QingshuChen 提交于
-
由 lzy 提交于
* make FusedMultiTransformer supports variable-lengths. * modify ffn2 when cuda_version >= 11.6 because of #49392. * code style * delete remove_padding
-
由 risemeup1 提交于
* Improved prootbuf upgrades * Improved prootbuf upgrades * Improved prootbuf upgrades * limit protobuf version>=3.20.0
-
由 YuhangLi 提交于
* [CUSTOM]custom device add black_list * change log level * fix some issues
-
- 14 2月, 2023 9 次提交
-
-
由 engineer1109 提交于
fix X remove TensorCopy codestyle add fluid memory header fix symbol fix cmake fix cmake fix context fix header fix place fix context fix context fix context fix code fix custom context fix custom context fix copy fix data_transform fix style remove changes of custom fix scalar
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace
-
由 seemingwang 提交于
-
由 HongyuJia 提交于
-
由 risemeup1 提交于
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
- 13 2月, 2023 9 次提交
-
-
由 zyfncg 提交于
* delete axis of fmin * fix bug
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
由 HongyuJia 提交于
* Tensor support void* data() function * add unittest * add selectedRows unittest * polish unittest * polish unittest * polish unittest * polish unittest
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 RedContritio 提交于
* support size 0 dot input * prevent div 0 in grad * add unittest * remove unnecessary vlog * add unittests
-
由 HongyuJia 提交于
* fix py::array_t calling bug * polish code
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 4 次提交
-
-
由 umiswing 提交于
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 Aurelius84 提交于
* Fix inferMefer in transpose2_grad * fix infershape * fix unittest
-
由 ykkk2333 提交于
-