- 14 2月, 2023 9 次提交
-
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace
-
由 seemingwang 提交于
-
由 HongyuJia 提交于
-
由 GGBond8488 提交于
* add gelu composite rule * use full replace fill_constant * change the form of calculation * remove float16 test for composite gelu * reformate code * remove float16 test case * add forwad with prim and backward without prim test * add float16 test for composite gelu and add high dims test * add float16 test case and high dims test * shield float16 and cpu test case * increase train step to 10 in test cinn prim gelu * replace pow to multiply
-
由 risemeup1 提交于
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
由 Zhang Jun 提交于
* update trt workspace size for inference predictor ut
-
由 littleforest 提交于
-
- 13 2月, 2023 14 次提交
-
-
由 zyfncg 提交于
* delete axis of fmin * fix bug
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 HongyuJia 提交于
* fix copysign compile error on Windows * fix more files' macro
-
由 Ryan 提交于
test=docoument_fix
-
由 HongyuJia 提交于
* Tensor support void* data() function * add unittest * add selectedRows unittest * polish unittest * polish unittest * polish unittest * polish unittest
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 YangZhou 提交于
-
由 RedContritio 提交于
* support size 0 dot input * prevent div 0 in grad * add unittest * remove unnecessary vlog * add unittests
-
由 HongyuJia 提交于
* fix py::array_t calling bug * polish code
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style * [Auto Parallel] Clear some fluid APIs * [Auto Parallel] Fix a bug of dist_scale
-
由 risemeup1 提交于
* optimize setup.py for conda envir * check python dependency * optimize code after reviewed
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 14 次提交
-
-
由 umiswing 提交于
-
由 Ruibiao Chen 提交于
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 risemeup1 提交于
-
由 Aurelius84 提交于
* Fix inferMefer in transpose2_grad * fix infershape * fix unittest
-
由 ykkk2333 提交于
-
由 yuehuayingxueluo 提交于
-
由 wangxiaoning 提交于
* fluid clean * fix optimizer * fix distributed_transpiler * fix fluid.__init__ * remove from fluid.init
-
由 Infinity_lee 提交于
-
由 RedContritio 提交于
* add dim check in scatter * add check in scatter.cu * add unittest * remove unnecessary log and comment --------- Co-authored-by: RedContritio <>
-
由 HongyuJia 提交于
* fix NLP-Bert model performance loss * fix windows compile error
-
由 risemeup1 提交于
* fix test_fleet_exe_dist_model_run * test
-
由 Weilong Wu 提交于
-
由 zhupengyang 提交于
-