- 15 2月, 2023 2 次提交
-
-
由 YuhangLi 提交于
* [CUSTOM]custom device add black_list * change log level * fix some issues
-
由 zachary sun 提交于
* add new approvals * modify github id
-
- 14 2月, 2023 17 次提交
-
-
由 engineer1109 提交于
fix X remove TensorCopy codestyle add fluid memory header fix symbol fix cmake fix cmake fix context fix header fix place fix context fix context fix context fix code fix custom context fix custom context fix copy fix data_transform fix style remove changes of custom fix scalar
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 HongyuJia 提交于
-
由 wangzhen38 提交于
-
由 kangguangli 提交于
* process unit test matched test_p* * fix ci bug * fix codestyle * remove all tests about pe and restore some irrelated tests * delete test_parallel_executor_test_while_train.py
-
由 Aurelius84 提交于
* [Dy2St]Enhance @not_to_static API * del breakpoint()
-
由 Nyakku Shigure 提交于
* update flake8 config * remove _pb2 from linter ignore list * refine config * empty commit, test=document_fix
-
由 mhy-666 提交于
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace
-
由 seemingwang 提交于
-
由 HongyuJia 提交于
-
由 GGBond8488 提交于
* add gelu composite rule * use full replace fill_constant * change the form of calculation * remove float16 test for composite gelu * reformate code * remove float16 test case * add forwad with prim and backward without prim test * add float16 test for composite gelu and add high dims test * add float16 test case and high dims test * shield float16 and cpu test case * increase train step to 10 in test cinn prim gelu * replace pow to multiply
-
由 risemeup1 提交于
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
由 Zhang Jun 提交于
* update trt workspace size for inference predictor ut
-
由 littleforest 提交于
-
- 13 2月, 2023 14 次提交
-
-
由 zyfncg 提交于
* delete axis of fmin * fix bug
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
由 Ryan 提交于
test=docoument_fix
-
由 HongyuJia 提交于
* Tensor support void* data() function * add unittest * add selectedRows unittest * polish unittest * polish unittest * polish unittest * polish unittest
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 YangZhou 提交于
-
由 RedContritio 提交于
* support size 0 dot input * prevent div 0 in grad * add unittest * remove unnecessary vlog * add unittests
-
由 HongyuJia 提交于
* fix py::array_t calling bug * polish code
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style * [Auto Parallel] Clear some fluid APIs * [Auto Parallel] Fix a bug of dist_scale
-
由 risemeup1 提交于
* optimize setup.py for conda envir * check python dependency * optimize code after reviewed
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 4 次提交
-
-
由 umiswing 提交于
-
由 Ruibiao Chen 提交于
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 risemeup1 提交于
-