- 16 2月, 2023 10 次提交
-
-
由 whs 提交于
Add PTQ and exporting function 1. Add Post-Training Quantization 1.1 Abstract some functions from QAT to Quantization class 1.2 Add Post-Training Quantization by extending Quantization class 1.3 Add observers for PTQ 1.4 Add unittest for PTQ 2. Add exporting function for QAT and PTQ
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace * support add, subtract, divide * add unit test * polish unittest * fix cmake error * polish unittest
-
由 zyfncg 提交于
-
由 Aurelius84 提交于
-
由 ronnywang 提交于
* [XPU] add group_norm kernel * update * add xpu sin, cos, randint, linspace kernels * update * update
-
由 zhupengyang 提交于
-
由 Ruibiao Chen 提交于
* Use StandaloneExecutor in FleetExecutor * Update FLAGS * Fix CI errors * Update code * Add force_root_scope_vars config * Update code * Fix CI errors * Fix test_layer_new errors
-
由 姜永久 提交于
* disable unewanted dygraph tests * mine_hard_exa
-
由 zqw_1997 提交于
* beta * small commit * add batch_norm composite rule move composite test case remove unuseful var add composite op blacklist * small change v2 * finish the test_composite_mean and test_composite_mean_grad * add ops assertion to the tests * add cinn test * fix the error and inappropriate usage in func: mean_composite * remove the ref of outer lib in primtives.py * modify sample code of reduce_sum * fix composite mean op map * modify testcases to test more float type * remove cpu float16 test * cinn test fix * remove reduce_max * change the name sum to sum_x * change the use of reduce_sum to sum --------- Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>
-
- 15 2月, 2023 11 次提交
-
-
由 Hulek 提交于
* Done * Deleted old python test, fixed new python test, changed names in parallel_UT * Revert parallel UT changes * Revert parallel UT changes v2 * Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass * delete sqrt from possible activations from conv_concat_relu test * review refactor * merge main * delete sqrt from list of compatible activations * Test with no outdated inputs
-
由 xu98bin 提交于
* auto parallel align tool * modify function get_var's return * add save and load in align_tool * modify load function and save function * add finding different ops in align tool * full auto parallel align tool add test file for auto parallel align tool set timeout for test modify get_backward_tmp_var function add annotation for align tool modify test file modify code to restart CI remove timeout * set timeout
-
由 wangzhen38 提交于
-
由 cyber-pioneer 提交于
* map output from composite rule to origin op add mean layer_norm dropout op map add input map check composite softmax support input shape [] * composite softmax support shape [] * polish log * solve conflict * polish code * polish op map output * add check dtype
-
由 zhangyikun02 提交于
-
由 lzy 提交于
* make FusedMultiTransformer supports variable-lengths. * modify ffn2 when cuda_version >= 11.6 because of #49392. * code style * delete remove_padding
-
由 zqw_1997 提交于
* remove incubate.data_generator * modify the setup.py * modifyt the setup.py.in
-
由 wangxiaoning 提交于
* move ascend_transpiler * move transpiler.collective * remver checkport * fix * fix import * fix import * add init * fix * fix * fix
-
由 zqw_1997 提交于
-
由 wangzhen38 提交于
-
由 WangZhen 提交于
-
- 14 2月, 2023 10 次提交
-
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 wangzhen38 提交于
-
由 kangguangli 提交于
* process unit test matched test_p* * fix ci bug * fix codestyle * remove all tests about pe and restore some irrelated tests * delete test_parallel_executor_test_while_train.py
-
由 Aurelius84 提交于
* [Dy2St]Enhance @not_to_static API * del breakpoint()
-
由 mhy-666 提交于
-
由 GGBond8488 提交于
* add gelu composite rule * use full replace fill_constant * change the form of calculation * remove float16 test for composite gelu * reformate code * remove float16 test case * add forwad with prim and backward without prim test * add float16 test for composite gelu and add high dims test * add float16 test case and high dims test * shield float16 and cpu test case * increase train step to 10 in test cinn prim gelu * replace pow to multiply
-
由 risemeup1 提交于
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
由 Zhang Jun 提交于
* update trt workspace size for inference predictor ut
-
由 littleforest 提交于
-
- 13 2月, 2023 7 次提交
-
-
由 RedContritio 提交于
-
由 Ryan 提交于
test=docoument_fix
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 YangZhou 提交于
-
由 RedContritio 提交于
* support size 0 dot input * prevent div 0 in grad * add unittest * remove unnecessary vlog * add unittests
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style * [Auto Parallel] Clear some fluid APIs * [Auto Parallel] Fix a bug of dist_scale
-
- 11 2月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-