- 21 3月, 2023 1 次提交
-
-
由 ShenLiang 提交于
* set order for hybridparallel * fix bug * fix -> * fix -> * fix -> * add topology * fix utest
-
- 20 3月, 2023 4 次提交
-
-
由 chalsliu 提交于
-
由 mayang002 提交于
-
由 Sonder 提交于
* update codes about pad3d * add codes about Tensor type Padding * update * 更新单测文件 * format code style * update and to &&' * rewrite codes about pad3d * add codes about converting paddle pad format to tensorrt pad format * fix some errors * 指定trt版本范围 * 修正dims初始化方式 * fix code style * update test pad values * 指定pad3d trt版本 * 更新 单测 文件范围 * 更新单测文件 * update pad3d paddings convert codes * update pad3d * add static mode support * update test file * fix bugs about dynamic mode test codes * fix bug and add limite in op_teller * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the post pad] * fix PADDLE_THROW grammaly error * update test codes * 添加对于Tensor padding 的 size 判断
-
由 tianshuo78520a 提交于
-
- 19 3月, 2023 2 次提交
- 17 3月, 2023 1 次提交
-
-
由 PuQing 提交于
* add multinomial output defs * fix register on gpu
-
- 16 3月, 2023 8 次提交
-
-
由 HongyuJia 提交于
* init unit test commit, contains register thinking * support inplace * get inplaced x.grad * Try support inplace and hook at the same time * Support inplace, need debug * Support inplace successfully * Inplace use Tensor&, consistent with Tensor* * fix MapPlainOutputs bug * fix double grad inplace error
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
由 PuQing 提交于
* add rnn and searchsorted output defs * add gpu kernel
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
由 wenbin 提交于
* split pass * fix compile * fix ut * more time * modify ut * reduce dim * fix compile * reshape weight * tensor * remove enforce * static shape ut * batchsize * reorder pass * minus test cases * windows timeout * windows time out * remove test for windows * correct * sssss * xxx
-
由 Infinity_lee 提交于
* fix atan2 * fix * fix * fix * fix error * fix error * fix
-
由 Sanbu 提交于
* Add output defs for generate_proposals,instance_norm kernel * fix
-
由 Leo Chen 提交于
-
- 15 3月, 2023 5 次提交
-
-
由 Infinity_lee 提交于
* fix eig * fix * fix * fix * fix
-
由 iSerendipity 提交于
* Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)" This reverts commit b9d91531. * remove cout * add header * fix missing header * fix refer fluid error * fix missing header * 更新 repeat_interleave_grad_kernel_impl.h Change to phi style datatype. * 更新 repeat_interleave_grad_kernel_impl.h Fix missing header * datatype fluid -> phi * paddle::experimental -> phi * fix reference error * fix reference error * fix reference error * fix errors * fix missing FLAGS * fix missing headers * fix missing headers * fix missing headers * fix missing headers * fix missing header * fix missing header * fix errors
-
由 pangengzheng 提交于
-
由 HappyHeavyRain 提交于
* test_get_kernel * add invoke signature * change reduce_max * change frobenius_norm * reset reduce_max according to composite and change reduce_all * fix the bug when Scalar(*) * fix 'scalar when support_tensor' * change code according to review * change 'keep_signature' to 'manual_signature' and add some erro info
-
由 pangyoki 提交于
-
- 14 3月, 2023 6 次提交
-
-
由 chenxujun 提交于
-
由 pangyoki 提交于
* cuda graph support multi-stream for new executor * fix windows compile error * delete create_cuda_graph_stream
-
由 Infinity_lee 提交于
-
由 Ackeraa 提交于
add register of select Co-authored-by: Nwqgo <1552367872@qq.com>
-
由 cxxly 提交于
-
由 Sonder 提交于
-
- 13 3月, 2023 13 次提交
-
-
由 lubiu 提交于
-
由 TaoTao Li 提交于
* add all_gather and fix conflicts * fix code format * fix ut * fix broadcast ut
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 risemeup1 提交于
-
由 kangguangli 提交于
* find relevant testcase * remove with_data_parallel * trigger CI * do not apply ParameterServerGraphOptimizer * remove useless optimizer * remove with_data_parallel in test_dist_base * fix test_fleet_base_3 * only reserve changes for GraphExecutionOptimizer * fix bug * fix test_minst_dgc_nccl * fix typo * fix test_dist_mnist_gradient_merge * rm TestDistMnistNCCL2DGCMultiCards * fix optimizer conflicts * fix dist_mnist * fix test_dist_hapi * delete test_fleet_graph_execution_meta_optimizer & test_fleet_graph_executor * temporally not delete unittest * fix unittests * fix ci * recover prune in python/paddle/hapi/model.py
-
由 kangguangli 提交于
-
由 zhupengyang 提交于
-
由 junxiu777 提交于
* add register of KthvalueKernel add register of KthvalueKernel * Update kthvalue_kernel.cc * Update kthvalue_kernel.cu
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 Little-chick 提交于
* Update interpreter_util.cc * Update auc_kernel.cc * Update auc_kernel.cu * Update auc_kernel.cc * Update auc_kernel.cu
-