- 17 3月, 2023 13 次提交
-
-
由 risemeup1 提交于
* limit protobu_version > 3.20.0 * limit protobu_version > 3.20.0
-
由 niuliling123 提交于
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float & bfloat16 when using cub * fix compile error * fix * fix rocm compile error
-
由 Nyakku Shigure 提交于
-
由 chenxujun 提交于
-
由 gouzil 提交于
* [phi][jit] rm Softmax StrideScal * [phi][jit] rm kStrideScal * [phi][jit] fix Softmax clean omission * [phi][jit] fix Softmax clean omission * [phi][jit] fix StrideScal clean omission * [phi][jit] fix mkl SoftmaxKernel clean omission * [phi][jit] fix test error * [phi][jit] fix test error * [phi][jit] rm NCHW16CMulNC * [phi][jit] fix test error * [phi][jit] rm HSum HMax * [phi][jit] fix test error * [phi][jit] rm StrideASum * add AUTHORS.md * [phi][jit] fix test error
-
由 zyfncg 提交于
* update unittest of fp16 for slice op * set atol to default value * update max_relative_eror
-
由 Leo Chen 提交于
* support fetch empty tensor on CPUPlace * fix the shape in unittest of empty output
-
由 HongyuJia 提交于
-
由 cyber-pioneer 提交于
* add bn vjp * fix example * fix code * fix code * fix cinn case * fix code * fix example * fix code * fix example * fix example
-
由 PommesPeter 提交于
* added paddle.polar to paddle * added paddle.polar unitest * failed to use pytorch for evaluating results, and changed with numpy implementation * updated code style * updated __init__.py * updated code style * fixed unitest code * lkh test polar * polar add * fixed errors and optimized code * fixed error * optimized polor api * updated code style * updated code style --------- Co-authored-by: Ndiadestiny <1247889154@qq.com>
-
由 mhy-666 提交于
* add sqrt composite rule/test * add sqrt composite rule/test * fix ops/sqrt, add cinn test * fix sqrt_comp * fix sqrt_comp * fix sqrt_comp * fix * fix codestyle * fix codestyle * add fp16 test * add ops/sqrt * fix * fix * fix unitest * fix * fix * fix
-
由 Nyakku Shigure 提交于
-
- 16 3月, 2023 27 次提交
-
-
由 Roc 提交于
-
由 HongyuJia 提交于
* init unit test commit, contains register thinking * support inplace * get inplaced x.grad * Try support inplace and hook at the same time * Support inplace, need debug * Support inplace successfully * Inplace use Tensor&, consistent with Tensor* * fix MapPlainOutputs bug * fix double grad inplace error
-
由 Chitsing KUI 提交于
* rename flash_attn_raw to flash_attn_unpadded * fix static api * fix static return
-
由 xjmxyt 提交于
* add dynamic support * add more test * fix bug * change test * change test
-
由 shaojie_wang 提交于
* add fp32 grad plus fp16 param in adamw * add python UT * fix test case * in test_adamw_op py file, force the moment2 value LE 0 * add a compare option * remove bf16 fused adam kernel case
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
由 JZ-LIANG 提交于
* update env setting * update pass logic * dist op support bf16 * backward cast update * update setting * update backward * revert amp pass * update fp16 backward logic * register c_embedding bf16 * revert engine * add unitest * add unitest * update unitest * update cmake * update math * update math.py * update unitest * update unitest * revise unitest * revise unitest * update unitest * update unitest * update unitest
-
由 PuQing 提交于
* add rnn and searchsorted output defs * add gpu kernel
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
由 wenbin 提交于
* split pass * fix compile * fix ut * more time * modify ut * reduce dim * fix compile * reshape weight * tensor * remove enforce * static shape ut * batchsize * reorder pass * minus test cases * windows timeout * windows time out * remove test for windows * correct * sssss * xxx
-
由 kangguangli 提交于
-
由 kangguangli 提交于
* rm Executor._run_parallel * remove compiledProgram related tests of standaloneExecutor
-
由 kangguangli 提交于
-
由 Infinity_lee 提交于
* fix atan2 * fix * fix * fix * fix error * fix error * fix
-
由 liuruyan 提交于
-
由 HongyuJia 提交于
* [Custom Op] Add Python.h include dir path automatically * add debug info * fix expand str error
-
由 Nyakku Shigure 提交于
* [CodeStyle] initial ruff config * update F401 config * [CodeStyle][F401] replace autoflake with ruff * empty commit; test=document_fix * restore unfixable; test=document_fix
-
由 Leo Chen 提交于
* skip copy beta pow when use_global_beta_pow=1 * revert infermeta * register dtype and place * refine code
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 Nyakku Shigure 提交于
* [CodeStyle] initial ruff config * update F401 config * [CodeStyle][NPY001] replace numpy deprecated type alias
-
由 张春乔 提交于
* fix the circle import of NodeVarType * rollback sth. * rename the ast * add utils_helper.py
-
由 Vegetable dog 提交于
* update rnn.py * update common.py * update rnn.py * update common.py * fix CI
-
由 Sanbu 提交于
* Add output defs for generate_proposals,instance_norm kernel * fix
-
由 Leo Chen 提交于
-
由 xjmxyt 提交于
* add index select op * add to op teller * add trt version control * delete useless code
-
由 Shijie 提交于
* add sync * Fix nccl_op_test
-
由 Jiabin Yang 提交于
* support amp logic for layer_norm and softmax * fix layer_norm amp * fix layernorm api and dropout fp16 * fix layernorm api and dropout fp16 * fix bn, ln dtype in float16 * fix dropout fp16 * fix comment * fix cinn dropout amp error
-