- 13 12月, 2022 6 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 sneaxiy 提交于
* save fused_attention memory when dropout_rate = 0.0 * add ut * fix ut bug * fix fused_layernorm_residual_dropout_bias_test.cu
-
由 HappyHeavyRain 提交于
* generate static graph code of some ops by yaml * fix the code-style of yaml * fix the framework_ci for triangular_solve * change the 'data_type' of scatter * add the 'out: Out' of scatter_nd_add
-
由 engineer1109 提交于
-
由 weishengying 提交于
-
由 Wangzheee 提交于
* fix some transformer unitest
-
- 12 12月, 2022 14 次提交
-
-
由 wanghuancoder 提交于
This reverts commit 592ed40b.
-
由 RichardWooSJTU 提交于
* fit paddle.matmul in fleetx.gpt
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
由 傅剑寒 提交于
* fix codestyle * add double complex<float> complex<double> dtype support for syevj_batched * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case * optimize eigh in different case * fix missing ; bug * fix use_syevj bug * fix use_cusolver_syevj_batched flag
-
由 feng_shuai 提交于
* fix: there are some bugs with trt 8.0 * fix:windows CI trt is too old
-
由 Yiqun Liu 提交于
* Enable to print device info. * Enhance the nan and inf checking for cpu. * Implement a common print function. * Unify the check of complex numbers. * Rewrite the omp method. * Count and print the number of nan and inf. * Change the print content. * Add unittest.
-
由 feng_shuai 提交于
-
由 zhoutianzi666 提交于
-
由 huangjiyi 提交于
* move norm_utils.cu.h from fluid to phi * remove norm_utils.h in fluid * fix bugs and replace mutable_data with Alloc * replace mutable_data with Alloc
-
由 zyfncg 提交于
-
由 zhoutianzi666 提交于
-
由 HappyHeavyRain 提交于
* generate static graph code of some ops by yaml, test = develop * fix 'take_along_axis' yaml style * reset scatter/scatter_nd_add * delete the comments of put_along_axis
-
由 Ruibiao Chen 提交于
* Add UT * Support cross-step stream synchronization for standalone executor * Fix typos * Fix typos * Update UTs
-
由 Wen Sun 提交于
* chore: unify `SingleTensor` * feat: dynamic check
-
- 11 12月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 10 12月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
* Add cast between int64 tensor and Paddle-TRT * Add Unit testing.
-
- 09 12月, 2022 11 次提交
-
-
由 Sławomir Siwek 提交于
* reshape * typo * remove header
-
由 Yuanle Liu 提交于
* clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout
-
由 HongyuJia 提交于
* support custom_xpu * update cmake to test xpu * support custom_xpu, verify mechanism * fix test_custom_relu_op_xpu_setup.py, test=kunlun * fix FLAGS_init_allocated_mem * cancel TIMEOUT property * reset FLAGS_init_allocated_mem property
-
由 Zhang Jun 提交于
* add prelu
-
由 MarDino 提交于
-
由 huangjiyi 提交于
-
由 Weilong Wu 提交于
* mv fused_bias_dropout_residual_ln to fluid manual dir * rm useless comments
-
由 zhoutianzi666 提交于
* add cutlass act set in conv_elementwise_add_act_fuse_pass
-
由 zyfncg 提交于
* add suppport_tensor for code_gen to static graph * support code-gen for int_array * polish code * fix bug of data_type
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
由 PuQing 提交于
-
- 08 12月, 2022 7 次提交
-
-
由 jiangcheng 提交于
-
由 limingshu 提交于
-
由 Kai Song 提交于
* fix 'BlasAXPBY unimplemented' error with custom device * fix utils CmakeLists bug
-
由 RichardWooSJTU 提交于
* rewrite delete_weight_deqquant_linear_op_encoder/decoder pass
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
由 huangjiyi 提交于
* move cuda_graph from fluid to phi * move device_memory_aligment from fluid to phi * Revert "move device_memory_aligment from fluid to phi" This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f. * update xpu cmake
-
由 Wilber 提交于
-