- 18 5月, 2023 4 次提交
-
-
由 Galaxy1458 提交于
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
由 RedContritio 提交于
* simplify layer_norm_op.cc * support auto generate for op layer_norm * update unittest for composite_layer_norm * remove layer_norm_op.cc from scripts * replace layer_norm_op with generated_op * add get_expected_kernel for layer_norm * update cmake kernel register function for layer_norm_mkldnn_op
-
- 16 5月, 2023 6 次提交
-
-
由 huangjiyi 提交于
* update * fix bug * test * test * update * update mutable_data * fix bug * update * fix bug * update output type reg * update * update
-
由 Sonder 提交于
* trans fused batch norm Compute function * trans batch norm register info to phi * trans fused batch norm grad Compute * trans batch norm grad register info * add sig file * update sig file * Update fused_bn_activation_kernel.cu * Update fused_bn_activation_grad_kernel.cu * fix * Rename fused_bn_activation_kernel_grad.cu to fused_bn_activation_kernel.cu * fix * fix * fix CudnnDataType error * fix * fix include * update * add #if * add fused bn act to cmakelist.txt * update cmakelist * fix #ifdef error * add timeout set * add env set * fix * fix * Update fused_bn_activation_sig.cc
-
由 Wang Xin 提交于
* static graph autogen code support for softmax op * bug fixed * fix PR-CI-Windows error * fix CI error * bug fixed * fix conflicts
-
由 cyberslack_lee 提交于
-
由 张春乔 提交于
* mv InstanceNorm * modify op_version.yaml * modify add Operator:: in get_expected_kernel_func.cc * rm gradexpectedkernel * add extra * add float epsilon=1e-5
-
由 gouzil 提交于
* [phi]mv StftKernel to phi * [phi] fix KernelSignature * [phi]fix arr error * [phi] Disable check_dygraph * [phi]fix include * [phi] rewrite mutable_data, add output register * [phi] fix Alloc * [phi] fix Alloc again * [phi] fix mutable_data * [phi] fix onesided_out Resize
-
- 15 5月, 2023 1 次提交
-
-
由 huangjiyi 提交于
* update * fix bug * fix output type def
-
- 12 5月, 2023 2 次提交
- 10 5月, 2023 2 次提交
- 09 5月, 2023 2 次提交
- 06 5月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
* Rename randint_raw and move it to legacy * Update fetch_v2_op.cc * Update randint_kernel.cc * Update randint_kernel.cu * Empty Commit to setup deployments
-
- 28 4月, 2023 2 次提交
- 27 4月, 2023 3 次提交
-
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * mv kernels impl * fix parameter error * clean include * fix compat filename * [phi] move fluid sequence_pool_grad to phi * [phi][compat] sig rm GradVarName * [phi] fix sequence_pool out type * [phi] rm impl, add const string * [phi] fix const str * fix sequence_pooling cmake * [phi] mv sequence_pooling_test * [phi] fix grad sig * [phi] fix sequence_pool is_test error * [phi] fix sequence_pooling gpu include * [phi] mv to impl * [phi] fix SequencePoolFunctor cu include * [phi] modify out max_index int32_t * [phi] add pooltype mapping determine * [phi] fix sequence_pool_sig * [phi] fix sequence_pool_sig sum * [phi] try ci * [phi] fix max_index optional
-
由 gouzil 提交于
* [static op generation] triangular_solve * [phi] mv triangular_solve_grad to static_backward * [phi] fix import * [phi] mv to ops.yaml、 backward.yaml * fix forward attr * [phi] fix triangular_solve_grad args
-
由 Wang Xin 提交于
-
- 26 4月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 25 4月, 2023 1 次提交
-
-
由 lzydev 提交于
* support register single .cu file * add register GPU kernel function
-
- 24 4月, 2023 2 次提交
-
-
由 Sonder 提交于
* add sig file * trans fused feedforward compute function to phi * remove fluid include * delete old register info * fix build error * trans fused feedforward grad xpu to phi
-
由 LoneRanger 提交于
* support the 'generate_proposals' for generating static graph ops * fix bug * fix bug * Update ops.yaml fix bug * remove the generate_proposals_v2_op.cc from CMakeLists.txt * fix bug * fix bug
-
- 23 4月, 2023 2 次提交
-
-
由 zhangyuqin1998 提交于
* delete overwrite from gather_grad * fix * Update gather_grad_kernel.cc
-
由 zhangyuqin1998 提交于
* remove axis from elementwise_grad * Update elementwise_sig.cc
-
- 21 4月, 2023 1 次提交
-
-
由 RedContritio 提交于
* support auto generate for op adam optimizer * remove unnecessary files * support auto generate for op adamw optimizer * support auto generate for op merged_adam optimizer * use manual_signature in adam_
-
- 19 4月, 2023 2 次提交
-
-
由 Wang Xin 提交于
* add autogen code support for mean_all op * bug fixed * bug fixed * bug fixed
-
由 zhangyuqin1998 提交于
* fix graph_reindex * fix * Update op_compat.yaml
-
- 18 4月, 2023 4 次提交
-
-
由 zhangyuqin1998 提交于
* reorder MatrixRank * fix * fix * fix * fix * fix
-
由 Zhenghai Zhang 提交于
* add autogen code support for rnn op * fix bug * fix bug
-
由 LoneRanger 提交于
* add autogen code support for lu * fix bug * fix bug * fix bug * fix bug
-
由 zhangyuqin1998 提交于
* reorder_prior_box * fix
-
- 17 4月, 2023 3 次提交
-
-
由 Sanbu 提交于
-
由 lzydev 提交于
* fix bug in parse args * fix bug * recover legacy_*.yaml * change 'Out' to Output
-
由 LoneRanger 提交于
-
- 14 4月, 2023 1 次提交
-
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-