- 22 4月, 2022 5 次提交
-
-
由 Ruibiao Chen 提交于
* Add gpudnn yaml config for some OPs * Add grad gpudnn config * Fix CI errors * Fix CI errors * Fix CI errors * Fix conflicts
-
由 Ming-Xu Huang 提交于
* Fix leading dimension setting error in fused_gemm_epilogue_grad_op. * Add dyload to cuBlasLt functions. * Added cublasLtMatmulAlgoGetHeuristic to improve performance. * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue * Added UTs to FLAGS_cublaslt_exhaustive_search_times * Added warmup runs in algo searching of Gemm epilogue. * Update copyright and documents. * Fixed error handling.
-
由 zhangkaihuo 提交于
-
由 zyfncg 提交于
* optimiaze performance of PreparePhiData * dygraph performance optimization
-
由 Jiabin Yang 提交于
* fix memory issue for eager * fix bug
-
- 21 4月, 2022 2 次提交
-
-
由 Aganlengzi 提交于
* [CustomDevice] fix macro * fix
-
由 sneaxiy 提交于
* support int16 argmax kernel * add fp16 test
-
- 20 4月, 2022 1 次提交
-
-
由 BrilliantYuKaimin 提交于
* 增加logspace的算子描述 * 增加logspace的形状推断 * 增加logspace核函数实现 * 在python中增加logspace接口 * 增加logspace单测 * 增加logspace * Update logspace_kernel.cu * Update logspace_op.cc * 调整代码格式 * Update doc of logspace * Update tensor.py * Update logspace_op.cc * Update logspace_kernel.cc * Update logspace_kernel.cu * Update test_logspace.py * 调整 logspace 的位置 * 调整代码格式
-
- 19 4月, 2022 8 次提交
-
-
由 Chen Weihang 提交于
-
由 jakpiase 提交于
* changes for md in tensor * ci fix * Temporarily limited dims for test * ci fix * removed unnecessary includes * added reviewers suggestions * checkouted two files to avoid changing more than 19 in single PR * minor fix * reverted one file to reduce files changed to 19
-
由 littletomatodonkey 提交于
* fix pad3d infer shape
-
由 fwenguang 提交于
-
由 YuanRisheng 提交于
[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel(Part1) (#41806) * seperate add/div/sub/mul from elementwise * delete code * fix compile bugs * deal with conflict * fix bugs when compile * fix windows unit test bug * fix ci converage bugs
-
由 Aurelius84 提交于
* [Phi]Fix expand_sig infershape BUG under static graph mode * [Phi]Fix expand_sig infershape BUG under static graph mode * [Phi]Fix unittest * [Phi]Fix unittest
-
由 Aurelius84 提交于
* [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml * add unittest for full_like * fix unittest
-
由 Weilong Wu 提交于
* [Eager] Fix numpy interface for constructing empty tensor * Fix CI, construct empty tensor * Modify empty tensor's shape from [] to [0] * Add more test for constructing empty tensor
-
- 18 4月, 2022 4 次提交
-
-
由 zyfncg 提交于
* create tensor by empty in custom op * fix some bug
-
由 Lijunhui 提交于
-
由 zhangkaihuo 提交于
-
由 Siming Dai 提交于
* add eids result for graph_sample_neighbors * fix bug * move fisher_yates sample to warp * add cpu eid output * delete comment * delete comment * change nullptr placeholder * optimize sample kernel * fix mutable_data
-
- 17 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
由 Chen Weihang 提交于
* fix place type related compat error * fix test failed * remove dll decl * revert place type change * add dll decl
-
- 16 4月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 15 4月, 2022 8 次提交
-
-
由 chentianyu03 提交于
* add adamw yaml * fix test case error * make the name of weight and bias in linear1 and linear2 to be constant
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures * [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode * [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode * Enabled more test cases * [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode * Adjusted test_imperative_star_gan_with_gradient_penalty.py
-
由 Jack Zhou 提交于
* Add core.eager.StringTensor __init__ which pyarray args can be passed * Add the numpy method of core.eager.StringTensor * revert tensor.to_string modification * Add ToPyObject for core.eager.StringTensor * Add debug string for core.eager.StringTensor * Remove place args of core.eager.StringTensor temporarily * Fix check string_tensor error * remove dtype of core.eager.StringTensor * add core.eager.StringTensor unittest * remove pstring from VarDesc * Add InitStringTensorWithStringTensor * Remove to_string modification * Remove zero_copy arg from StringTensor creator
-
由 Chen Weihang 提交于
-
由 zhangkaihuo 提交于
-
由 limingshu 提交于
* change cudnn helper for auto-tune * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm. * Fix the bug in calculating and printing current step cache hit rate. * Improve the autotune cache and fix unittest. * Change the key from AlgorithmType to int64_t. * Fix unittest for cpu-only env. * change ChooseAlgoByWorkspace for heuristic mode Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
由 hong 提交于
* try to fix batch norm memory issue * fix batch norm memroy alloc bug * polish some code
-
- 14 4月, 2022 9 次提交
-
-
由 Lijunhui 提交于
* regist elementwise_xxx
-
由 Chen Weihang 提交于
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency * deal with conflict * fix bugs when run unit test * fix unit test bugs
-
由 Jacek Czaja 提交于
* Add UT - Added missed data_layout - Added missing conversions - NDHWC added - NDHWC support in data_transform - another fix - condddate change - fix u- fix - fix - fix - fix - fix - fix to hack - compilation fix - fix to automatic merge * - reduced UT * - fix * - lint * - fix to lint
-
由 zyfncg 提交于
* support some c++ api in paddle namespace * change c++ api namespace in custom op
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures * [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode * [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode * Enabled more test cases * Fixed performance issues * Fixed minor issue
-
由 Aurelius84 提交于
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-