- 16 12月, 2022 1 次提交
-
-
由 MarDino 提交于
* cast to higher precision type to prevent fp16 diff problem * fix bilinear backward and add more unittest case
-
- 15 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 12 12月, 2022 2 次提交
- 11 12月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit. * refine performance with fast_divmod * refine performance with fast_divmod
-
- 09 12月, 2022 2 次提交
-
-
由 niuliling123 提交于
-
由 PuQing 提交于
-
- 08 12月, 2022 3 次提交
-
-
由 201716010711 提交于
-
由 Netpunk 提交于
* remove bbox_util.h from phi * add file bbox_util.h * reframe bbox_util.h
-
由 Netpunk 提交于
-
- 07 12月, 2022 2 次提交
- 06 12月, 2022 1 次提交
-
-
由 xiongkun 提交于
* make bilinear interpolate stable. * fix code
-
- 05 12月, 2022 4 次提交
-
-
由 limingshu 提交于
* first commit * fix bugs according to ci * add some changes * change file name into function.cu.h * remove const_cast
-
由 Roc 提交于
-
由 Ruibiao Chen 提交于
* Replace mutable_data with DeviceContext.Alloc in phi kernels * Fix CI errors * Fix CI errors * Fix CI errors, test=kunlun * Fix CI errors, test=kunlun * Handle rnn_functor * Update approvals
-
由 sneaxiy 提交于
* register more bf16 ops * update to register coresponding backward ops
-
- 02 12月, 2022 2 次提交
-
-
由 Bo Zhang 提交于
* profile reduce kernel for fp16 and reduceHigherdim * use reinterpret_cast * fix for CI on ROCm * add Macro for ROCm * ROCm CI config * ROCm CI config * unit test repair * pull * add common_funcs.h * reduceType * Update reduce_function.h * not higher * rename
-
由 Jiabin Yang 提交于
* [Eager] Fix paddle.grad interface * [Eager] Support minimum SubGraph for GeneralGrad * Add needed_nodes to prune grad graph more thoroughly * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode * [Eager] Fix paddle.grad interface * Polish code * remove potential_stop_node * Add endding_nodes to enhance genSugraph logic * clear endding_nodes_ * polish code * rename endding_nodes to endding_nades_ * Refactor grad interface * Add register_hook case to fix coverage-ci * Fix code format * Refactor general_grad * Add more code comments * call clear directly to release GradSlotMeta * fix a mistake * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on. * fix batch_norm_double_grad yaml optional config * fix tanh_triple_grad yaml and kernels * fix MultiplyTripleGradKernel optional logic * fix merge mistake * fix compile error * remove legacy attr for bn * polish code * fix some kernel * merge develop * fix error * remote log * fix kernel with full like * hide value log behind * hide value log behind * fix matmul_triple grad Co-authored-by: NWeilong Wu <veyron_wu@163.com>
-
- 01 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* rename kernel for top_k, slogdeterminant, generate_proposals_v2 * fix bug
-
- 30 11月, 2022 3 次提交
-
-
由 Netpunk 提交于
* migrate transpose_op.cu.h and gpu_utils.h * format code style * fix some problems * format code * reset tranpose_op.cc * test commit * recover transpose_op.h * delete transpose_op.h * adjust header files order in transpose_op.cc
-
由 Aurelius84 提交于
* [Perf]Fix interploate OutSize data transform problem * fix code style * fix grad * fix phi kernel
-
由 zyfncg 提交于
* rename some kernel name * fix compile problem
-
- 29 11月, 2022 1 次提交
-
-
由 Vvsmile 提交于
Optimize the implementation of the argsort operator
-
- 28 11月, 2022 4 次提交
-
-
由 huangjiyi 提交于
* decouple cudnn_desc.h from fluid * move cudnn_desc.h from fluid to phi * fix bugs * decouple cudnn_helper.h from fluid * fix bugs * move cudnn_helper.h from fluid to phi * add fluid cudnn_helper.h * move miopen_desc.h from fluid to phi * move miopen_helper.h from fluid to phi * fix bugs * move gpu_dnn.h from fluid to phi * fix bugs * update copyright year * simplify gpu_dnn.h in fluid * fix bugs * fix xpu build bug * fix compile bug * fix bug
-
由 Asthestarsfalll 提交于
-
由 zyfncg 提交于
* add fluid_op_name_map * rename some kernel name * add comments for op-kernel map * refine map name of op to kernel
-
由 MarDino 提交于
-
- 25 11月, 2022 2 次提交
-
-
由 Wang Bojun 提交于
* group norm fp16 support
-
由 sneaxiy 提交于
* add bfloat16 support for more ops * fix ci compile * fix windows compile error * fix windows compile error * fix rocm compile error * fix ROCM compile error
-
- 24 11月, 2022 4 次提交
-
-
由 wangxiaoning 提交于
* add index sample fp16 support * remove fluid APIs in distributed_strategy.py and role_maker.py * Revert "remove fluid APIs in distributed_strategy.py and role_maker.py" This reverts commit 223bbee990d3bf69e252fc3c0f19e3873550a264. * remove fluid APIs in distributed_strategy.py and role_maker.py * remove index sample op changes * remove fluid APIs under fleet.base * remove fluid APIs under fleet.layers.mpu * remove fluid APIs under fleet.meta_optimizers * fix fluid error * fix util_factory.py * reset fluid.io.load_inference_model API
-
由 huangjiyi 提交于
* rm dependence to "convert_utils.h" in some files * fix bugs * replace DataType2String with DataTypeToString * replace framework::DataTypeSize with phi::SizeOf * mv convert_function from fluid to phi and rm old map * recommit with pre-commit * repalce ProtoVarType with ProtoDataType and update comment. * fix error about include "dnnl.hpp" * revert add dep mkldnn to convert_utils in phi * add mkldnn deps in convert_utils.h in phi * move deps to convert_utils.h in phi
-
由 PuQing 提交于
-
由 sneaxiy 提交于
-
- 23 11月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * 2nd commit
-
- 22 11月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 21 11月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* refine reduce_all
-
由 huangjiyi 提交于
* move cross_entropy from fluid to phi * replace mutable_data with Alloc * use .template
-
- 18 11月, 2022 2 次提交
-
-
由 MarDino 提交于
* Add quick gelu and fused bias add kernel * fix annotation * remove useless code * add fast gelu option and set it in multi transformer op * add flag to restrict if use fast gelu approximate * fix flags conflict * fix use tanh function instead * add cudart version limit * use phi fast tanh func * fix comment
-
由 huangjiyi 提交于
* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi * update copyright years * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi * fix rocm-complie bugs
-