- 27 4月, 2023 1 次提交
-
-
由 ShenLiang 提交于
add utest fix utest
-
- 26 4月, 2023 4 次提交
-
-
由 Chitsing KUI 提交于
* print modifed flags * fix ref, opt print * fix default getter * fix ut
-
由 sneaxiy 提交于
-
由 wuhuachaocoding 提交于
Co-authored-by: Ngongweibao <gongweibao@baidu.com>
-
由 shaojie_wang 提交于
-
- 25 4月, 2023 1 次提交
-
-
由 Zhang Zheng 提交于
* Fix the calculation of layer_norm_bwd * fix
-
- 24 4月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
* save env log for each worker * fix ut
-
由 Chitsing KUI 提交于
Co-authored-by: Ntianshuo78520a <707759223@qq.com>
-
- 22 4月, 2023 1 次提交
-
-
由 Tian 提交于
-
- 21 4月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Yuang Liu 提交于
-
- 17 4月, 2023 4 次提交
-
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
-
由 Haohongxiang 提交于
-
由 sneaxiy 提交于
-
- 14 4月, 2023 25 次提交
-
-
由 Zhang Zheng 提交于
-
由 jjyaoao 提交于
* delete SupportNPU(), SupportMLU() * delete npu branch
-
由 cyberslack_lee 提交于
-
由 cyberslack_lee 提交于
-
由 chenxujun 提交于
* Add digamma, dirichlet tests * Fix code
-
由 superwinner1 提交于
* add erf FP16 test
-
由 duanyanhui 提交于
-
由 chenxujun 提交于
-
由 umiswing 提交于
-
由 risemeup1 提交于
* apply gcc12 to py3-ci * apply gcc12 to py3-ci * apply gcc12 to py3-ci * test * test * test * test * make mirror * test * test * test * test debug * test * update cuda to 12 * update cuda to 12 * update cuda to 12 * apply gcc12 to py3 * fix gcc12 problem * test * apply gcc12 to py3 * test * test * test * apply gcc12 to py3
-
由 YangQun 提交于
[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185) * support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops * fix gaussian random mkldnn op ut
-
由 HongyuJia 提交于
* [Decouple enforce.h] Move LOG from enforce.h to enforce.cc * update cmake of device_context.cc, solve cuda_device_context_allocator.h compile error * add namespace inside macro
-
由 HongyuJia 提交于
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * [phi] mv sequence_pooling to phi funcs * [phi] mv sequence_pooling_test * [phi] RollBACK `paddle/fluid/operators/sequence_ops/sequence_pool_op.cc` * [phi][funcs] fix mutable_data * [phi][funcs] fix mutable_data
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * 恢复正向代码 * update CMkaeList * trans Compute function to phi [build success] * add register code and fix include error [build success] * fix parameter sequence * add include file * update #if before include * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * fix namespace and remove fluid include * recover random seed * remove fluid quant_helper * fix include error * include utils in funcs * change include file * move grad codes back to fluid floder * move grad codes back to fluid floder * fix sig file error * update include * recover codes to develop * update register codes * fix build error * recover fluid include * remove some fluid include * remove some fluid include * Update fused_attention_op.cu * remove fluid include * add some fluid include * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * remote useless include
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop
-
由 Jiabin Yang 提交于
* add more infer var type * fix split error * fix ut * fix top_k infer vartype * fix top_k infer vartype
-
由 lzydev 提交于
-
由 sneaxiy 提交于
-
由 zhupengyang 提交于
-
由 duanyanhui 提交于
-
由 骑马小猫 提交于
* support uint16 python op in d2s * convert uint16 -> bfloat16 in docstring
-
由 Kim Yann 提交于
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-