- 01 8月, 2022 5 次提交
-
-
由 Wangzheee 提交于
* add varlen_token_prune plugin, pass, convert
-
由 Xiaoxu Chen 提交于
-
由 Asthestarsfalll 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 heliqi 提交于
-
- 30 7月, 2022 1 次提交
-
-
由 zhiboniu 提交于
* phi_prior_box * add float[] support * phi_prior_box_optest * update
-
- 29 7月, 2022 22 次提交
-
-
由 Ainavo 提交于
* add paddle.nn.functional.pairwise_distance (cattidea/Paddle#273) * remove the test case for undefined behavior Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 zhangkaihuo 提交于
* add sparse SyncBatchNorm
-
由 Haohongxiang 提交于
* migrate lstsq op * update * fix bugs for CIs * update * fix bugs * add uts * update * update * update * fix bugs of jip * fix bugs of hip * update * update according to review * update * update * update * update
-
由 caozhou 提交于
-
由 QingshuChen 提交于
* add some fp16 op for kunlun resnet50 model *test=kunlun * tmp *test=kunlun
-
由 zhiboniu 提交于
-
由 Aganlengzi 提交于
* add FLAGS_enable_api_kernel_fallback * deal with more cases * add ut for coverage
-
由 Jacek Czaja 提交于
* - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint
-
由 thunder95 提交于
* add count_nonzero api * remove grad test
-
由 Wang Bojun 提交于
* add yaml and utests of phi softplus add yaml of softplus fix softplus bug in phi * update utests * bug fix * bug fix for test_layers * layer api match * match def and doc in ops.py * doc polish * fix unwanted modified of thresholded_relu * style imporve
-
由 kuizhiqing 提交于
-
由 ccrrong 提交于
* skip cast trt convert when input dtype is bool
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Allen Guo 提交于
-
由 Jiabin Yang 提交于
-
由 fwenguang 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 Yulong Ao 提交于
* [Phi] Add yaml for assign_value * [Phi] Fix the bug of the assign api and modify the unittest * [Phi] Fix the bug when the tensor does not have the backend info * [Phi] Replace the functional-style cast init by the brace-init * [Phi] Cast the data explicitly
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
由 houj04 提交于
-
- 28 7月, 2022 12 次提交
-
-
由 heliqi 提交于
-
由 Weilong Wu 提交于
-
由 Ruibiao Chen 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk
-
由 xiaoxiaohehe001 提交于
* depthwise_fp16 * depthwise_fp16 * depthwise_fp16 * depthwise_fp16
-
由 HongyuJia 提交于
* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete * fix test_activation_op unitest error, add yaml and dygraph test * fix code style with pre-commit * try to fix namespace error of abs in activation_functor.h * fix namespace error of abs
-
由 Xiaoxu Chen 提交于
* migrate dirichlet op kernel to phi * fix dirichlet sample memory leak
-
由 Haohongxiang 提交于
-
由 zyfncg 提交于
* support auto fallback to cpu kernel for cusom device * fix some problem of kernel fallback
-
由 zhaoyingli 提交于
-
由 Chenxiao Niu 提交于
-
由 niuliling123 提交于
-