- 17 1月, 2023 3 次提交
-
-
由 zhouweiwei2014 提交于
-
由 WangZhen 提交于
* Support call backward() without params in dy2st
-
由 Xiaoxu Chen 提交于
* support elementwise base func * fix compiling error and add test * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * another magic * add skip rename strategy * support add vjp * support add with new axis cal * support sub vjp * [prim] add multiply vjp rules * [prim] add multiply vjp rules * [prim] fix no infershape with composite in _append_backward_ops * [prim] add expand vjp rule * [prim] add exp vjp rule * uncomment infer shape for reshape/sum static prim api * [prim] fix tanh nullptr error * remove some print message * fix magic number in run_program relative tests @JiaBinYang * [prim] add expand,multiply,exp vjp rules * fix only support single direction reduce error * infer reduce dims using out dims Co-authored-by: NJiabinYang <360788950@qq.com>
-
- 16 1月, 2023 6 次提交
-
-
由 Weilong Wu 提交于
-
由 wawltor 提交于
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
由 QingshuChen 提交于
-
由 zqw_1997 提交于
-
由 xiaoguoguo626807 提交于
-
- 15 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp * enhance_tests * support more dtype for full
-
- 13 1月, 2023 16 次提交
-
-
由 Weilong Wu 提交于
-
由 cyber-pioneer 提交于
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
由 zhouweiwei2014 提交于
-
由 ronnywang 提交于
* add where, atan2, median 0d ut * add where, atan2, median 0d ut * update * update * update
-
由 Zhang Jun 提交于
* update trt engine to set in/out data type * update * Update engine.cc * Update engine.cc * update * set engine output type before freeze the network * update * update trt autoscan ut * update * update ut * fix equal bug, update ut * fix cast and equal ut * update cast ut using TRT < 8.4 * set datatype from scope * check output var is nullptr * Update op_converter.h * update tensorrt_engine_op_test ut * update
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp
-
由 wuhuachaocoding 提交于
-
由 jameszhang 提交于
* kunlun add support for c_concat and c_split * replace mutable_data() and ShareDataWith()
-
由 ykkk2333 提交于
-
由 jameszhang 提交于
* fix xpu unittest issue: zero_dim_tensor * deal with leftout issue introduced by #49470
-
由 Leo Guo 提交于
-
由 zhouweiwei2014 提交于
-
由 wangshengxiang 提交于
-
由 Weilong Wu 提交于
* [PHI] rrelu add yaml * polish * polish
-
由 wuhuachaocoding 提交于
-
- 12 1月, 2023 7 次提交
-
-
由 sunli 提交于
* lerp support 0 Tensor * fix lerp grad * fix lerp zero test * fix 0D + ND/ND + 0D * fix check * update code * fix lerp infer shape * static backward test * updata static graph test
-
由 zhangkaihuo 提交于
-
由 jameszhang 提交于
* Fix reduce func bug in process_group_bkcl Also catch up with a recent process_group PR that failed to add XPU branch. Note that reduce is still accomplished by allreduce for xpu. Fix this should xccl lib be updated. * fix compile issue for non-XPU
-
由 wenbin 提交于
* compile fix * fix compile * compile fix * add more preln
-
由 FlyingQianMM 提交于
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss * [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor * [Zero-Dim] support input 0D Tensor for fmax,fmin,complex
-
由 yuehuayingxueluo 提交于
-
由 zhaoyingli 提交于
* recovery annotation * bugfix
-
- 11 1月, 2023 6 次提交
-
-
由 niuliling123 提交于
-
由 yuehuayingxueluo 提交于
* add FusedLinear pass * add fused_op_list and renname PASSES to OP_FUSION * add fused_passes_list to constants.py * add test_passes.py * fix test_fused_passes.py * fix add if float(paddle.version.cuda()) >= 11.6: * renamed test_fused_passes.py * fix CMakeList.txt
-
由 Ryan 提交于
* add enable_to_static and drop some methods of ProgramTranslator * fix code style * fix cant import enable_to_static and update unitest * change unitest and rollback code of PT * fix can't import as of utils * roll back PT * fix roll back * add some unitest * add unitest and fix codestyle bug in api.py * finish all unitest * remove ProgramTranslator * fix code style * restore test_program_translator * api.py remove get_func * TestDygraphToStaticCode * fix check_type and import err * roll back PT without getcode * roll back pt with get_code * convert_to_static * fix import __all__
-
由 Linjie Chen 提交于
-
由 Linjie Chen 提交于
* fix nll_loss * fix nll_loss * update * update * update * fix
-
由 姜永久 提交于
* rm retain_grad_flag for tests * modify transpose op * retain grads for xpu tests * lint * modify xpu test
-
- 10 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-