- 20 3月, 2023 11 次提交
-
-
由 limingshu 提交于
* optimization for fused linear op * fix code format * optimization for linear fused forward * merge with develop * fix bugs for gemm_ephilog * package of cublaslt ephilogue type with enmu * final fix before code reviewing * fix missed fusedType typo * fix code according to review suggestions * fix windows ci error * change location of MatmulPlanner * add some changes for compiler error fix ---------
-
由 Ainavo 提交于
* add_up004_for_ruff * 修改配置文件并清除object * fix md
-
由 zhouweiwei2014 提交于
-
由 Weilong Wu 提交于
-
由 mayang002 提交于
-
由 Sonder 提交于
* update codes about pad3d * add codes about Tensor type Padding * update * 更新单测文件 * format code style * update and to &&' * rewrite codes about pad3d * add codes about converting paddle pad format to tensorrt pad format * fix some errors * 指定trt版本范围 * 修正dims初始化方式 * fix code style * update test pad values * 指定pad3d trt版本 * 更新 单测 文件范围 * 更新单测文件 * update pad3d paddings convert codes * update pad3d * add static mode support * update test file * fix bugs about dynamic mode test codes * fix bug and add limite in op_teller * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the post pad] * fix PADDLE_THROW grammaly error * update test codes * 添加对于Tensor padding 的 size 判断
-
由 tianshuo78520a 提交于
-
由 xiongkun 提交于
* merge * fix bugs while backward multi-times. * code format by ci
-
由 Jiabin Yang 提交于
-
由 HongyuJia 提交于
* [Tensor Operants & Prim-Relevant] Tensor supports compare operants * fix dependence of test_comp_static * fix unit test
-
由 wanghuancoder 提交于
-
- 19 3月, 2023 3 次提交
-
-
由 Charles-hit 提交于
-
由 Difer 提交于
* resgister for ftt_r2c, ftt_c2_r * fix clang-format
-
由 Sanbu 提交于
* Add output defs for argsort kernel * Update argsort_kernel.cc * Update argsort_kernel.cu * Update argsort_kernel.cc
-
- 18 3月, 2023 1 次提交
-
-
由 Leo Chen 提交于
-
- 17 3月, 2023 3 次提交
-
-
由 PuQing 提交于
* add multinomial output defs * fix register on gpu
-
由 Leo Chen 提交于
* support fetch empty tensor on CPUPlace * fix the shape in unittest of empty output
-
由 cyber-pioneer 提交于
* add bn vjp * fix example * fix code * fix code * fix cinn case * fix code * fix example * fix code * fix example * fix example
-
- 16 3月, 2023 13 次提交
-
-
由 HongyuJia 提交于
* init unit test commit, contains register thinking * support inplace * get inplaced x.grad * Try support inplace and hook at the same time * Support inplace, need debug * Support inplace successfully * Inplace use Tensor&, consistent with Tensor* * fix MapPlainOutputs bug * fix double grad inplace error
-
由 xjmxyt 提交于
* add dynamic support * add more test * fix bug * change test * change test
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
由 JZ-LIANG 提交于
* update env setting * update pass logic * dist op support bf16 * backward cast update * update setting * update backward * revert amp pass * update fp16 backward logic * register c_embedding bf16 * revert engine * add unitest * add unitest * update unitest * update cmake * update math * update math.py * update unitest * update unitest * revise unitest * revise unitest * update unitest * update unitest * update unitest
-
由 PuQing 提交于
* add rnn and searchsorted output defs * add gpu kernel
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
由 wenbin 提交于
* split pass * fix compile * fix ut * more time * modify ut * reduce dim * fix compile * reshape weight * tensor * remove enforce * static shape ut * batchsize * reorder pass * minus test cases * windows timeout * windows time out * remove test for windows * correct * sssss * xxx
-
由 Infinity_lee 提交于
* fix atan2 * fix * fix * fix * fix error * fix error * fix
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 Sanbu 提交于
* Add output defs for generate_proposals,instance_norm kernel * fix
-
由 Leo Chen 提交于
-
由 xjmxyt 提交于
* add index select op * add to op teller * add trt version control * delete useless code
-
由 Shijie 提交于
* add sync * Fix nccl_op_test
-
- 15 3月, 2023 9 次提交
-
-
由 Infinity_lee 提交于
* fix eig * fix * fix * fix * fix
-
由 SylarTiaNII 提交于
* add assign composite backward op * fix log msg * code style * fix comp rule * replace assign with by_pass
-
由 Jiabin Yang 提交于
* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes * add unittest * fix typo * fix typo * fix map.at * fix find * fix test * fix cinn cache key structure realize * using ordered map for attributes * add test by review advice --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * Pr 50885 (#7) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes * add unittest * fix typo * fix typo * fix map.at * fix find * fix test * fix cinn cache key structure realize * using ordered map for attributes * add test by review advice --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * fix code in a dy2static-friendly way. * [dystatic] add hooker for prim --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [prim] enable dygraph_to_static to support custom_vjp * fix cast prim and vjp dtype mapping error bug * Cxx prim custom vjp (#8) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * Pr 50885 (#7) * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) * [CINN]Enhance CacheKey hash logic by considering input dtypes --------- Co-authored-by: Njiangcheng <thisjiang@qq.com> * [prim] enable dygraph_to_static to support custom_vjp * fix code in a dy2static-friendly way. * [dystatic] add hooker for prim --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [prim] enable dygraph_to_static to support custom_vjp * fix cast prim and vjp dtype mapping error bug * [dy2static-ci] fix dy2static ci errors. --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> * [Prim] enable whitelist and blacklist for custom_vjp * support softmax grad * remove additional code * add test back --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Nxiongkun <807377414@qq.com>
-
由 JingZhuangzhuang 提交于
-
由 iSerendipity 提交于
* Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)" This reverts commit b9d91531. * remove cout * add header * fix missing header * fix refer fluid error * fix missing header * 更新 repeat_interleave_grad_kernel_impl.h Change to phi style datatype. * 更新 repeat_interleave_grad_kernel_impl.h Fix missing header * datatype fluid -> phi * paddle::experimental -> phi * fix reference error * fix reference error * fix reference error * fix errors * fix missing FLAGS * fix missing headers * fix missing headers * fix missing headers * fix missing headers * fix missing header * fix missing header * fix errors
-
由 pangengzheng 提交于
-
由 RedContritio 提交于
-
由 HappyHeavyRain 提交于
* test_get_kernel * add invoke signature * change reduce_max * change frobenius_norm * reset reduce_max according to composite and change reduce_all * fix the bug when Scalar(*) * fix 'scalar when support_tensor' * change code according to review * change 'keep_signature' to 'manual_signature' and add some erro info
-
由 pangyoki 提交于
-