- 27 2月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
* register bfloat16 datatype for squared l2 norm * register bfloat16 datatype for softmax with upper triangular mask * register bfloat16 for tril triu cuda kernel
-
- 26 2月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Enable matmul + bias fusion in fused_gat_attention. * Add a variable to control whether using fused matmul + bias.
-
- 24 2月, 2023 4 次提交
-
-
由 Jiabin Yang 提交于
* change amp with to_prim * fix prim amp * fix rules * fix liear * add amp test * add test * disable this test on cpu * disable this test on cpu --------- Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>
-
由 Charles-hit 提交于
-
由 HappyHeavyRain 提交于
* support 'backend' in static ops * change bitwise_xx comment in python * change bitwise_xxx comment in python * change 'backend' and 'data_type' in GetExpectedKernelType
-
由 xiaoguoguo626807 提交于
* support prim test in OpTest * fix cmake * fix op test * fix test_input_spec * disable cinn in reduce_sum unit test * add bfloat16 dtype for sum * add approve rules * polish code * add clear jit program function * convert grad out from tensor to numpy * remove unnecessary code * add only_prim flag * fix flag * fix op test * add attr * fix optest comp inplace error * fix op test * fix op test with guard * add initialization of check_comp flag * fix comp inplace error in op test * rename check_comp with check_prim and add bfloat16 dtype convert * rename comp_op_type to prim_op_type * rename comp to prim * remove useless code * skip ci check for only prim * add no_grad_vars and grad_outputs in prim test * fix var_dict * fix op test for only_prim * fix dy2static bugs * polish some code * temp * modify op test * except cinn test * modify bfp16 * modify pad grad * add pad_grad dtype * start cinn part --------- Co-authored-by: NCharles-hit <wanghao107@baidu.com>
-
- 23 2月, 2023 3 次提交
-
-
由 Huang Jiyi 提交于
* move fluid generator to phi * move fluid generator to phi * update .gitignore * fix bugs * fix cannot find "glog/logging.h" in "generator.h" * fix bugs
-
由 HappyHeavyRain 提交于
* support 'complex promote' in yaml * change the compplex_promote * change 'kron' in math.py * change 'kron' comment in python * change kron comment in python * change kron comment in python
-
由 jameszhang 提交于
* kunlun support c_softmax_with_cross_entropy * fix grad calc error * replace mutable_data() and ShareDataWith() * update xdnn * update xpu toolchain to 20230215 * remove fluid from test file
-
- 22 2月, 2023 3 次提交
-
-
由 TaoTao Li 提交于
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
由 Jiabin Yang 提交于
* tmp gather vjp * support gather * remove useless code * fix compiling error * fix ut * add eager test * add eager test * add seed * fix cpu error * fix transpose op compat * remove tensor index case * fix prim_cinn * fix ut
-
- 21 2月, 2023 3 次提交
-
-
由 HappyHeavyRain 提交于
* support bw invoke fw * fix scale in static_backward.yaml * fix the bug in tensorrt/convert * move 'scale','sign' into ops.yaml * add scale_grad of scale in op_compat.yaml * change generated_static_op in CMakeLists.txt
-
由 QingshuChen 提交于
-
由 Huang Jiyi 提交于
* move sequence_padding to phi * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix buga * fix bugs * revert and update phi::XPUContext
-
- 20 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move save_op to fluid * fix namespace * move_load_kernel * fix kernel_register * move serialization to fluid * fix test * fix bugs
-
- 17 2月, 2023 3 次提交
-
-
由 yuehuayingxueluo 提交于
* rename multi_tensor_adam to fused_adam * fix some bugs * fix CI coverage * rename test_fused_adam.py * fix some bug * add test_fused_adam_op.py * fix some bugs * fix fused_adam_op.cc * fix CI bugs * fix CI bug * fix CI bug
-
由 Sławomir Siwek 提交于
* change SHA * update to oneDNN 2.7 * update to 2.7.1 * update to 2.7.2 * add supported hardsigmoid * update to 2.7.3 * limit cpu threads for int8 test * group activations
-
由 Huang Jiyi 提交于
* move platform::transform to phi * fix bugs * move transform_test to phi * fix cmake * update namespace * fix cmake
-
- 16 2月, 2023 4 次提交
-
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 Huang Jiyi 提交于
* move layer_norm_kernel.cu.h to phi * fix bugs * fix namespace * fix bugs * fix CI-Windwos * replace mutable_data * fix bugs * fix bugs
-
由 Ruibiao Chen 提交于
* Use StandaloneExecutor in FleetExecutor * Update FLAGS * Fix CI errors * Update code * Add force_root_scope_vars config * Update code * Fix CI errors * Fix test_layer_new errors
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 15 2月, 2023 5 次提交
-
-
由 duanyanhui 提交于
-
由 zyfncg 提交于
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
由 lzy 提交于
* make FusedMultiTransformer supports variable-lengths. * modify ffn2 when cuda_version >= 11.6 because of #49392. * code style * delete remove_padding
-
由 risemeup1 提交于
* Improved prootbuf upgrades * Improved prootbuf upgrades * Improved prootbuf upgrades * limit protobuf version>=3.20.0
-
- 14 2月, 2023 1 次提交
-
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
- 13 2月, 2023 2 次提交
-
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
由 risemeup1 提交于
* upgrade protobuf to 3.19.0 in cmake * recover protobuf python version * fix distribute compile * fix * fix framework.data_feed_pb2 * fix macos ifdef * fix lite * test * update protoc from 3.19.0 t0 3.20.0 * test * debug * test * test * debug * debug * debug * debug * test * debug * update protocol from 3.20.0 to 4.21.12 * modify graph_brpc_client.h * modify graph_brpc_client.h * test * test * test * fix third_party cache problem on build ci * updata proto * test * test * test * test * test * test * fix coverage failed test * try to fix test_exe_fleet_model_run * fix cinn bug * fix windows compile problem * fix python/requirements --------- Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 10 2月, 2023 2 次提交
-
-
由 Aurelius84 提交于
* Fix inferMefer in transpose2_grad * fix infershape * fix unittest
-
由 zhupengyang 提交于
-
- 09 2月, 2023 4 次提交
-
-
由 Zhang Jun 提交于
* update * support int64 shape tensor as engine input * add inference_predictor ut
-
由 Huang Jiyi 提交于
* decouple strided_memcpy * move strided_memcpy * move strided_memcpy to phi * fix namespace * update * fix gpu compile bugs
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 kangguangli 提交于
* fix judgement about scope validation * fix ci bug: same address is not enough for data consistency * remove useless check
-
- 08 2月, 2023 2 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Yuang Liu 提交于
-