- 16 3月, 2023 27 次提交
-
-
由 Roc 提交于
-
由 HongyuJia 提交于
* init unit test commit, contains register thinking * support inplace * get inplaced x.grad * Try support inplace and hook at the same time * Support inplace, need debug * Support inplace successfully * Inplace use Tensor&, consistent with Tensor* * fix MapPlainOutputs bug * fix double grad inplace error
-
由 Chitsing KUI 提交于
* rename flash_attn_raw to flash_attn_unpadded * fix static api * fix static return
-
由 xjmxyt 提交于
* add dynamic support * add more test * fix bug * change test * change test
-
由 shaojie_wang 提交于
* add fp32 grad plus fp16 param in adamw * add python UT * fix test case * in test_adamw_op py file, force the moment2 value LE 0 * add a compare option * remove bf16 fused adam kernel case
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
由 JZ-LIANG 提交于
* update env setting * update pass logic * dist op support bf16 * backward cast update * update setting * update backward * revert amp pass * update fp16 backward logic * register c_embedding bf16 * revert engine * add unitest * add unitest * update unitest * update cmake * update math * update math.py * update unitest * update unitest * revise unitest * revise unitest * update unitest * update unitest * update unitest
-
由 PuQing 提交于
* add rnn and searchsorted output defs * add gpu kernel
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
由 wenbin 提交于
* split pass * fix compile * fix ut * more time * modify ut * reduce dim * fix compile * reshape weight * tensor * remove enforce * static shape ut * batchsize * reorder pass * minus test cases * windows timeout * windows time out * remove test for windows * correct * sssss * xxx
-
由 kangguangli 提交于
-
由 kangguangli 提交于
* rm Executor._run_parallel * remove compiledProgram related tests of standaloneExecutor
-
由 kangguangli 提交于
-
由 Infinity_lee 提交于
* fix atan2 * fix * fix * fix * fix error * fix error * fix
-
由 liuruyan 提交于
-
由 HongyuJia 提交于
* [Custom Op] Add Python.h include dir path automatically * add debug info * fix expand str error
-
由 Nyakku Shigure 提交于
* [CodeStyle] initial ruff config * update F401 config * [CodeStyle][F401] replace autoflake with ruff * empty commit; test=document_fix * restore unfixable; test=document_fix
-
由 Leo Chen 提交于
* skip copy beta pow when use_global_beta_pow=1 * revert infermeta * register dtype and place * refine code
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 Nyakku Shigure 提交于
* [CodeStyle] initial ruff config * update F401 config * [CodeStyle][NPY001] replace numpy deprecated type alias
-
由 张春乔 提交于
* fix the circle import of NodeVarType * rollback sth. * rename the ast * add utils_helper.py
-
由 Vegetable dog 提交于
* update rnn.py * update common.py * update rnn.py * update common.py * fix CI
-
由 Sanbu 提交于
* Add output defs for generate_proposals,instance_norm kernel * fix
-
由 Leo Chen 提交于
-
由 xjmxyt 提交于
* add index select op * add to op teller * add trt version control * delete useless code
-
由 Shijie 提交于
* add sync * Fix nccl_op_test
-
由 Jiabin Yang 提交于
* support amp logic for layer_norm and softmax * fix layer_norm amp * fix layernorm api and dropout fp16 * fix layernorm api and dropout fp16 * fix bn, ln dtype in float16 * fix dropout fp16 * fix comment * fix cinn dropout amp error
-
- 15 3月, 2023 13 次提交
-
-
由 umiswing 提交于
-
由 Infinity_lee 提交于
* fix eig * fix * fix * fix * fix
-
由 SylarTiaNII 提交于
* add assign composite backward op * fix log msg * code style * fix comp rule * replace assign with by_pass
-
由 Jiabin Yang 提交于
* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes * add unittest * fix typo * fix typo * fix map.at * fix find * fix test * fix cinn cache key structure realize * using ordered map for attributes * add test by review advice --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * Pr 50885 (#7) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes * add unittest * fix typo * fix typo * fix map.at * fix find * fix test * fix cinn cache key structure realize * using ordered map for attributes * add test by review advice --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * fix code in a dy2static-friendly way. * [dystatic] add hooker for prim --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [prim] enable dygraph_to_static to support custom_vjp * fix cast prim and vjp dtype mapping error bug * Cxx prim custom vjp (#8) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * Pr 50885 (#7) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * fix code in a dy2static-friendly way. * [dystatic] add hooker for prim --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [prim] enable dygraph_to_static to support custom_vjp * fix cast prim and vjp dtype mapping error bug * [dy2static-ci] fix dy2static ci errors. --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [Prim] enable whitelist and blacklist for custom_vjp * support softmax grad * remove additional code * add test back --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Nxiongkun <807377414@qq.com>
-
由 limingshu 提交于
-
由 risemeup1 提交于
* add option for setup.py * add option for setup.py * add option for setup.py * add option for setup.py * add ennv_dict.py and dist/ to .gitignore * add ennv_dict.py and dist/ to .gitignore * modify .gitignore
-
由 risemeup1 提交于
* optimizing setup.py develop command * add libpaddle.so * modify setup.py * add python/paddle/distributed/fleet/.gitignore * add libpaddle.so to .gitignore * add *.so to python/paddle/libs/.gitignore * add new gitignore
-
由 thunder95 提交于
* untracked files * prelu_perf * remove unused files * upd * fix bug
-
由 umiswing 提交于
-
由 Leo Chen 提交于
-
由 Leo Chen 提交于
* support set_default_dtype bf16 * support float
-
由 Kang Zhao 提交于
* feat: add relu composite rule * feat: add relu composite rule, maximum op * feat: add relu composite rule, maximum op * feat: add relu composite rule, polish comments * feat: add relu composite rule, polish comments * feat: add relu composite rule, add python api of relu * feat: add relu composite rule, commit hook * fix: maximum type error & ban cinn test * fix: maximum input sequence bugs * resolve conflicts * fix: code style bugs * add: relu fp16 test * feat: add rsqrt composite rule * feat: add rsqrt composite rule * resolve conflicts of composite rule * fix: delete check eager
-
由 ronnywang 提交于
* [XPU] add int32,fp32 support for conv2d_transpose* * update
-