- 29 7月, 2022 15 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 Haohongxiang 提交于
* migrate lstsq op * update * fix bugs for CIs * update * fix bugs * add uts * update * update * update * fix bugs of jip * fix bugs of hip * update * update according to review * update * update * update * update
-
由 QingshuChen 提交于
* add some fp16 op for kunlun resnet50 model *test=kunlun * tmp *test=kunlun
-
由 zhiboniu 提交于
-
由 Aganlengzi 提交于
* add FLAGS_enable_api_kernel_fallback * deal with more cases * add ut for coverage
-
由 Jacek Czaja 提交于
* - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint
-
由 ccrrong 提交于
* skip cast trt convert when input dtype is bool
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Allen Guo 提交于
-
由 Jiabin Yang 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 Yulong Ao 提交于
* [Phi] Add yaml for assign_value * [Phi] Fix the bug of the assign api and modify the unittest * [Phi] Fix the bug when the tensor does not have the backend info * [Phi] Replace the functional-style cast init by the brace-init * [Phi] Cast the data explicitly
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
由 houj04 提交于
-
- 28 7月, 2022 13 次提交
-
-
由 heliqi 提交于
-
由 HongyuJia 提交于
* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete * fix test_activation_op unitest error, add yaml and dygraph test * fix code style with pre-commit * try to fix namespace error of abs in activation_functor.h * fix namespace error of abs
-
由 Xiaoxu Chen 提交于
* migrate dirichlet op kernel to phi * fix dirichlet sample memory leak
-
由 Haohongxiang 提交于
-
由 Chenxiao Niu 提交于
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* Move frame OP to phi、add frame OP yaml config and supplement single test * add Header file of in_dygraph_mode * Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta * move seq2col to phi
-
由 Charles-hit 提交于
* Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * modify code style * add x to X mapping * add definition of lgamma * delete redundant lgamma definitions * Modify code comments * Modify ops.py code format * add lgamma single test and lgamma api in fluid * Optimized lgamma unittest
-
由 z8hanghuan 提交于
-
由 Weilong Wu 提交于
* refactor general_grad and fix some bugs * add TODO: support prune logic deeper
-
由 LiYuRio 提交于
-
由 Lin Manhui 提交于
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Fix: Move out_grad to first * Register new kernels * Remove old kernels * Move out_grad to last * Fix bugs * Transfer infermeta * Add yaml files * Add blank line * Fix code style * Optimize directory structure Co-authored-by: NBobholamovic <linmanhui@baidu.com>
-
由 houj04 提交于
* [XPU] add top_k op. test=kunlun * [XPU] add top_k op. test=kunlun * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
-
- 27 7月, 2022 11 次提交
-
-
由 qipengh 提交于
-
由 Allen Guo 提交于
-
由 Allen Guo 提交于
* add more loss ops * add authors Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
-
由 pangyoki 提交于
* fix RemoveNode in fuse_elewise_add_act_pass * fix * change pointer to share_ptr * fix * fix * fix format * fix * fix graph_safe_remove_nodes
-
由 freeliuzc 提交于
* move crop_tensor from fluid to phi * delete fluid header files * fix crop_tensor_op dygraph_mode bug * modify header files, add out tensor check
-
由 Hui Zhang 提交于
* jit.save support peropty serilization * extract set property function * fix property test file name * fix typing error * fix typing error * fix test coverage
-
由 WangZhen 提交于
* Remove include fluid head files in JitLayer * Format code * Remove const to fix ci error * Fix param error * Polish jit layer include and cp some headers to python/include * Fix comment
-
由 Yuang Liu 提交于
-
由 Wang Bojun 提交于
* move average_accumulates op to phi kernel
-
由 Allen Guo 提交于
* sync misc changes * add authors Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> * up x * Revert "up x" This reverts commit f3fde458c6cc48613269a643cfe2acf689caccd3. * add guarg for ipu Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
-
由 z8hanghuan 提交于
* fix bug of elementwise_add_grad, *test=kunlun * fix bug, *test=kunlun * rm pooling_t, *test=kunlun * fix bug of ew_add_grad when inplace, *test=kunlun
-
- 26 7月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix new dygraph record event for op * update unit test
-