- 11 10月, 2022 6 次提交
-
-
由 傅剑寒 提交于
* add fp16 data type for set_value * cancel flip modification * add fp16 dtype support for set_value
-
由 HongyuJia 提交于
* solve transpose2, follow #22402 * fix CI cmake * update REGISTER_OP_KERNEL of transpose2
-
由 HongyuJia 提交于
-
由 Zhen Wang 提交于
* Fix some bugs hidden in build_cinn_pass. * Update codes about OpTransInfo. * Only support for the static reshape/reshape2 op.
-
由 niuliling123 提交于
-
由 wuyefeilin 提交于
-
- 10 10月, 2022 22 次提交
-
-
由 tianshuo78520a 提交于
* Add libpaddle.so log * Add libpaddle.so log
-
由 Tony Cao 提交于
* Update README.md * Update README.md * Fix F541 by converting f-string to normal strings
-
由 YuanRisheng 提交于
* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta * WIP: move rnn kernrl to phi * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types. This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types. * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results. * fix ci bugs * fix ci bugs * fix ci bugs * modify code according comment Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
-
由 Leo Chen 提交于
* reduce time cost on atomic in interpretercore * clear code of PrepareAtomic in interpretercore * refine threadpool cache
-
由 Zhang Jun 提交于
-
由 Sylwester Fraczek 提交于
* fix fc pattern remove use_bias add residual input switch fix references to pattern * review fixes
-
由 Rayman 提交于
-
由 Sylwester Fraczek 提交于
* Add methods that find input or output name by var name * kind of bugfix - initialize variables * ci fix * review fixed
-
由 zhoutianzi666 提交于
-
由 Wangzheee 提交于
* fix embedding fused
-
由 Wang Bojun 提交于
* preln_res_bias_layernorm bugfix unroll opt * code style refine * NOLINT for codestyle
-
由 carryyu 提交于
make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches. (#46777) * make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches.
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* delete_quantize_headerfile * delete_requantize_headerfile
-
由 HongyuJia 提交于
-
由 Paulina Gacek 提交于
* op migrated, Copy(OneDNNContext, ...) added * mutable_data & op registration in fluid removed * refactoring * OneDNNGetDataType to uppercase * missing cpu check added, handler moved to .h file * name changed to transpose_grad * Copy changed back to TensorCopy * Resizing corrected, Copy(OneDNNContext) removed
-
由 LiYuRio 提交于
-
由 feng_shuai 提交于
* fix:gather op * add ut
-
由 Rayman 提交于
-
由 Rayman 提交于
support fp16 for deformable conv
-
由 risemeup1 提交于
* it is a test * it is a test * it is a test,test=coverage * optimizing the proceess of generating ut_file_map.json * optimizing the proceess of generating ut_file_map.json * optimizing the proceess of generating ut_file_map.json * optimizing the proceess of generating ut_file_map.json
-
- 09 10月, 2022 9 次提交
-
-
由 zhangkaihuo 提交于
-
由 Wang Bojun 提交于
* fix fp16 * remove debug info * code style refine
-
由 zhangkaihuo 提交于
-
由 zmxdream 提交于
-
由 Sławomir Siwek 提交于
-
由 Sławomir Siwek 提交于
* enable hard_swish_grad unit test * remove unused argument
-
由 Haohongxiang 提交于
-
由 zhangbo9674 提交于
-
由 ronnywang 提交于
-
- 08 10月, 2022 3 次提交
-
-
由 HongyuJia 提交于
-
由 Haohongxiang 提交于
-
由 cifar10 提交于
-