- 05 5月, 2022 1 次提交
-
-
由 xiongkun 提交于
-
- 04 5月, 2022 2 次提交
-
-
由 XiaoguangHu 提交于
* fix bug of batch_norm_grad kernel with fp16 * format code
-
由 XiaoguangHu 提交于
-
- 01 5月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 30 4月, 2022 2 次提交
-
-
由 xiongkun 提交于
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default. * add opt_einsum dependence * add yaml and support eager model * fix by code review
-
由 littletomatodonkey 提交于
* fix pad3d infer shape * fix pad3d * fix pad default value * fix order * add unit test * fix unittest for ci coverage * add ndhwc check
-
- 28 4月, 2022 5 次提交
-
-
由 Chen Weihang 提交于
* opt attr eaque perf * opt attr select code * fix one hot infermeta * polish get attr impl * fix tests failed * add testcases
-
由 xiongkun 提交于
* full api fix * when out is None, go old dygraph mode * by static check * first version: support 2-inputs forwards. TODO: 1. backward 2. BroadCast 3. MultiVariable * time out -> 120
-
由 FlyingQianMM 提交于
set device id of Place() to get GPUContext needed by LimitGridDim in ElemwiseGradBroadcast (PaddlePaddle#42320) (#42332)
-
由 zyfncg 提交于
* Optimize performance of dygraph (v4) (#42196) * optimize performance of dygraph * optimize performance of dygraph and elementwise_add * optimize the trace op * fix bug * fix bug * fix unittest bug * fix code format * fix cherry-pick problem
-
由 zyfncg 提交于
* Optimize the performanece of sum api (#42231) * optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log * Add move construct for KernelSignature (#42253) * add move construct for KernelSignature * add noexcept * fix cherry-pick problem
-
- 27 4月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* Remove std::type_index in AttributeArdDef (#42122) * polish some impl * add lost attr type * polish details * fix error type * polish in name lists * add double attr * adapt infrt attr parse * add attr type test (#42263) * opt attr eaque perf (#42272)
-
由 Jiabin Yang 提交于
* fix memory issue for eager * fix bug
-
由 Chen Weihang 提交于
* Change small vector size (#42202) * change samll vector size * Update type_defs.h * Optimize dygraph InferShape perf (#42155) * init commit * remove two hash impl * fix bug * polish details * fix compile failed * fix compile failed * fix compile failed * add default kernel sig cache * fix get kernel arg defs error * remove kernel arg defs cache * fix origin op execute
-
- 26 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* Add paddle::variant and replace paddle::any (#42139) * add variant and replace any * split attribute * Optimize dygraph GetExpectedKernelType perf (#42154) * opt dygraph scheduling * revert part impl * fix variant compile error (#42203) * replace any by variant in infermeta (#42181)
-
- 25 4月, 2022 2 次提交
-
-
由 zyfncg 提交于
* optimiaze performance of PreparePhiData (#42093) * Dygraph performance optimization (v2) (#42103) * optimiaze performance of PreparePhiData * dygraph performance optimization * optimize performance of dygraph (#42137)
-
由 Aurelius84 提交于
[Cherry-Pick][Performance]Remove CudaStreamSychornize in ClipGradByGlobalNorm and fix shape op (#42170) * [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT (#42138) * [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT * [Performance]Set ShapeKernel with ALL_BACKEND and ALL_LAYOUT * [Performance]Remove CudaStreamSychornize in ClipGradByGlobalNorm (#42132)
-
- 22 4月, 2022 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 21 4月, 2022 4 次提交
-
-
由 zyfncg 提交于
* [PHI] Support some c++ api in paddle namespace (#41778) * support some c++ api in paddle namespace * change c++ api namespace in custom op * [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) * support setting vector out size in yaml * support setting size of vector<tensor> for out in yaml * add data transform config for shape and size (#41909) * fix api_gen bug
-
由 Chen Weihang 提交于
* [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) * support setting vector out size in yaml * support setting size of vector<tensor> for out in yaml * resolve conflict Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
-
由 Jiabin Yang 提交于
* cherry-pick python/paddle/utils/code_gen/backward.yaml * remove unsupported yaml Co-authored-by: NZhanlue Yang <jim19930609@gmail.com>
-
由 Chen Weihang 提交于
* polish tensor api details (#41971) * [CustomOp] Fix custom op pinned input error (#41972) * fix custom op pinned input error * fix compile error * fix inference custom op (#41999) * resolve conflict
-
- 20 4月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency * deal with conflict * fix bugs when run unit test * fix unit test bugs
-
由 Aurelius84 提交于
[Cherry-Pick]Fix expand_sig infershape BUG under static graph mode and NeedTransformPlace behavior if set skip_transform in yaml (#41973) * [Phi]Fix expand_sig infershape BUG under static graph mode (#41936) * [Phi]Fix expand_sig infershape BUG under static graph mode * [Phi]Fix expand_sig infershape BUG under static graph mode * [Phi]Fix unittest * [Phi]Fix unittest * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml (#41920) * [Eager]Fix NeedTransformPlace behavior if set skip_transform in yaml * add unittest for full_like * fix unittest
-
- 19 4月, 2022 6 次提交
-
-
由 zyfncg 提交于
* add rsqrt yaml and unittest (#41443) * Add expand equal all yaml (#41540) * add expand, poisson * add poison grad * add expand equal_all poisson triangular solve yaml Co-authored-by: Nhong <43953930+phlrain@users.noreply.github.com>
-
由 Weilong Wu 提交于
* [Eager] Fix numpy interface for constructing empty tensor * Fix CI, construct empty tensor * Modify empty tensor's shape from [] to [0] * Add more test for constructing empty tensor
-
由 Yiqun Liu 提交于
Cherry-pick #40338 #41741 #41313
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures * [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode * [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode * Enabled more test cases * Fixed performance issues * Fixed minor issue
-
由 zhangkaihuo 提交于
cherry-pick the PR#41586 to realese/2.3
-
由 Siming Dai 提交于
* add eids result for graph_sample_neighbors * fix bug * move fisher_yates sample to warp * add cpu eid output * delete comment * delete comment * change nullptr placeholder * optimize sample kernel * fix mutable_data
-
- 18 4月, 2022 4 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 Zhanlue Yang 提交于
[DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) (#41893) * [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures
-
由 Jack Zhou 提交于
* Add core.eager.StringTensor __init__ which pyarray args can be passed * Add the numpy method of core.eager.StringTensor * revert tensor.to_string modification * Add ToPyObject for core.eager.StringTensor * Add debug string for core.eager.StringTensor * Remove place args of core.eager.StringTensor temporarily * Fix check string_tensor error * remove dtype of core.eager.StringTensor * add core.eager.StringTensor unittest * remove pstring from VarDesc * Add InitStringTensorWithStringTensor * Remove to_string modification * Remove zero_copy arg from StringTensor creator
-
由 Chen Weihang 提交于
* [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647) * remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment * remove all is initialized using (#41766) * remove inner_place using (#41768) * polish tensor depreacted method warning (#41807) * [CustomOp] Fix PlaceType related compat error (#41826) * fix place type related compat error * fix test failed * remove dll decl * revert place type change * add dll decl * resolve conflict
-
- 15 4月, 2022 4 次提交
-
-
由 zyfncg 提交于
* fix data transform problem for cudnn backend (#41622) * Fix problem of infermeta with vector output (#41646) * remove stack_grad infershape * fix bug of output with null * fix bug
-
由 zhangkaihuo 提交于
Add paddle.sparse and three Sparse API (#41276) Add Sparse API to_dense, to_sparse_coo and values (#41394)
-
由 zhiboniu 提交于
-
由 YuanRisheng 提交于
* add multi_dot,maxout,multiplex yaml * add code converage
-
- 14 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* add new method for custom double grad * add tanh double grad unittest * change year * revert tensor init method
-
由 Chen Weihang 提交于
* add context pool unittests * fix timeout * polish details * change option pos * add dll decl for wndows * fix pre-commit error * move dll_decl and export DeviceContext * replace lost dll_decl.h
-