- 06 5月, 2023 4 次提交
-
-
由 Yuang Liu 提交于
* use int64 to calc dim for c softmax * fix complie bug
-
由 zhenhailiu 提交于
* polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
-
由 Zhang Jun 提交于
-
由 Wilber 提交于
* Add trt pow converter. * update to use AddConstantLayer * add dims=0 ut
-
- 05 5月, 2023 10 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 shentanyue 提交于
-
由 sprouteer 提交于
-
由 haosicheng 提交于
-
由 xiaoguoguo626807 提交于
* modify concat_grad add sum comp rule * modify cast
-
由 xiaoguoguo626807 提交于
* modify concat_grad add sum comp rule * cast and by_pass modify * only modify by_pass * modify by_pass
-
由 gouzil 提交于
* [test]mv fluid *test* to test/cpp/fluid * [phi] fix link error
-
由 gouzil 提交于
* [test]mv fluid op pscore to test/cpp/fluid/pscore * [test]add -faligned-new * [test] fix brpc link
-
由 gouzil 提交于
-
由 gouzil 提交于
-
- 04 5月, 2023 4 次提交
-
-
由 gouzil 提交于
-
由 weishengying 提交于
-
由 gouzil 提交于
* [test]mv fluid reader to test/ * [test]mv fluid op prim_ops to test/cpp/fluid/prim_ops * [test]mv fluid op nccl to /test/cpp/fluid/nccl/ * [test]mv fluid op reduce_ops to test/cpp/fluid/reduce_ops * [test]mv fluid op lite to test/cpp/fluid/lite * [test]fix lite * [test]fix prim op path * [fluid]clean prim ops cmakelists
-
由 Yuanle Liu 提交于
-
- 30 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 29 4月, 2023 1 次提交
-
-
由 gouzil 提交于
* [tests]mv fluid benchmark to tests * [test]Add placeholder * [test]Add placeholder
-
- 28 4月, 2023 9 次提交
-
-
由 HongyuJia 提交于
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
由 gouzil 提交于
-
由 huangjiyi 提交于
* update * fix bug * support parsing fixed kernel data_type * update op_compat * update
-
由 Sanbu 提交于
-
由 jjyaoao 提交于
-
由 Zhang Jun 提交于
* trt support 0 dim * trt support 0 dim * update activation ut
-
由 sneaxiy 提交于
-
由 xiaoguoguo626807 提交于
* add mul doubel grad * add sub_double_grad * add add sub high test * add mutiply test * modify other unsqueeze * delete api.yaml * only for make ci run * midify unsqueeze * modify unsqueeze * tmp * modify operants gen
-
- 27 4月, 2023 11 次提交
-
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * mv kernels impl * fix parameter error * clean include * fix compat filename * [phi] move fluid sequence_pool_grad to phi * [phi][compat] sig rm GradVarName * [phi] fix sequence_pool out type * [phi] rm impl, add const string * [phi] fix const str * fix sequence_pooling cmake * [phi] mv sequence_pooling_test * [phi] fix grad sig * [phi] fix sequence_pool is_test error * [phi] fix sequence_pooling gpu include * [phi] mv to impl * [phi] fix SequencePoolFunctor cu include * [phi] modify out max_index int32_t * [phi] add pooltype mapping determine * [phi] fix sequence_pool_sig * [phi] fix sequence_pool_sig sum * [phi] try ci * [phi] fix max_index optional
-
由 Yuanle Liu 提交于
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
[Dy2St]Get grad names when call append backward to fix high order gradient (#53250)
-
由 wuhuachaocoding 提交于
-
由 houj04 提交于
-
由 gouzil 提交于
* [static op generation] triangular_solve * [phi] mv triangular_solve_grad to static_backward * [phi] fix import * [phi] mv to ops.yaml、 backward.yaml * fix forward attr * [phi] fix triangular_solve_grad args
-
由 HongyuJia 提交于
* [CINN Support 0D-Tensor] CINN supports 0D-Tensor with trick temporarily * Add unittest
-
由 Wang Xin 提交于
-
由 Sonder 提交于
* trans fused_feedward Compute function to phi * add register info * remove maxfunctor * move fused feedward to phi * remove sig file * remove fliud include * add include * add include * add sig file * add output register info * fix sig file * Update fused_feedforward_sig.cc * fix grad kernel * update output register info * fix * open fused_feedforward static build * add optional and fix code style * fix output info for fused attention * add optional param * merge
-
由 Zhang Ting 提交于
* support OD level and skip dynamic loss scaling for bf16
-