- 29 7月, 2022 11 次提交
-
-
由 kuizhiqing 提交于
-
由 ccrrong 提交于
* skip cast trt convert when input dtype is bool
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Allen Guo 提交于
-
由 Jiabin Yang 提交于
-
由 fwenguang 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 Yulong Ao 提交于
* [Phi] Add yaml for assign_value * [Phi] Fix the bug of the assign api and modify the unittest * [Phi] Fix the bug when the tensor does not have the backend info * [Phi] Replace the functional-style cast init by the brace-init * [Phi] Cast the data explicitly
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
由 houj04 提交于
-
- 28 7月, 2022 23 次提交
-
-
由 heliqi 提交于
-
由 Weilong Wu 提交于
-
由 Ruibiao Chen 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk
-
由 xiaoxiaohehe001 提交于
* depthwise_fp16 * depthwise_fp16 * depthwise_fp16 * depthwise_fp16
-
由 HongyuJia 提交于
* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete * fix test_activation_op unitest error, add yaml and dygraph test * fix code style with pre-commit * try to fix namespace error of abs in activation_functor.h * fix namespace error of abs
-
由 Xiaoxu Chen 提交于
* migrate dirichlet op kernel to phi * fix dirichlet sample memory leak
-
由 Haohongxiang 提交于
-
由 zyfncg 提交于
* support auto fallback to cpu kernel for cusom device * fix some problem of kernel fallback
-
由 zhaoyingli 提交于
-
由 Chenxiao Niu 提交于
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* Move frame OP to phi、add frame OP yaml config and supplement single test * add Header file of in_dygraph_mode * Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta * move seq2col to phi
-
由 Charles-hit 提交于
* Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * modify code style * add x to X mapping * add definition of lgamma * delete redundant lgamma definitions * Modify code comments * Modify ops.py code format * add lgamma single test and lgamma api in fluid * Optimized lgamma unittest
-
由 kuizhiqing 提交于
* add launch test * launch test for cpu * bs 1
-
由 z8hanghuan 提交于
-
由 Weilong Wu 提交于
* refactor general_grad and fix some bugs * add TODO: support prune logic deeper
-
由 LiYuRio 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Fix: Move out_grad to first * Register new kernels * Remove old kernels * Move out_grad to last * Fix bugs * Transfer infermeta * Add yaml files * Add blank line * Fix code style * Optimize directory structure Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 Jiabin Yang 提交于
-
由 houj04 提交于
* [XPU] add top_k op. test=kunlun * [XPU] add top_k op. test=kunlun * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
-
由 Feiyu Chan 提交于
* fix typos in template for codegen of operators * change the way to set attributes for grad op maker
-
由 Yuang Liu 提交于
-
- 27 7月, 2022 6 次提交
-
-
由 shangliang Xu 提交于
-
由 Zhong Hui 提交于
-
由 ykkk2333 提交于
-
由 zhaoyingli 提交于
-
由 qipengh 提交于
-
由 ming1753 提交于
-