- 04 2月, 2023 1 次提交
-
-
由 Huihuang Zheng 提交于
As the title
-
- 03 2月, 2023 8 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 Paulina Gacek 提交于
* conv_bias_mkldnn_fuse_pass_tester rewritten * conv_concat_relu_mkldnn_fuse_pass_tester rewritten * conv_elementwise_add_fuse_pass_tester rewritten * mkldnn changed to onednn * tests added to cmakeLists, style fix * got rid of unnecessary UT, some style changes * changes in naming convention * max_examples reduced * time out added
-
由 RedContritio 提交于
-
由 HappyHeavyRain 提交于
* generate some static graph ops * fix the bug of pow * add REGISTER_ACTIVATION_OP in operators.cmake * modify the file operators.cmake
-
由 Yuang Liu 提交于
-
由 Ruibiao Chen 提交于
* Reduce time cost of BuildOpHappensBefore * Update code * Update code * Improve data struct
-
由 Jiabin Yang 提交于
* optimize log * fix type error * fix type error2
-
由 Jiabin Yang 提交于
* refactor dir for prim * support blacklist for bwd comp * fix type error * remove additional file * fix git ignore * add more test * merge develop
-
- 02 2月, 2023 3 次提交
-
-
由 Xiaoxu Chen 提交于
【PRIM】Support use operator's output metadata info in constructing static backward composite (#50043) * [prim] support custom target_gradients * support infershape after append one gradop * [prim] add simple net test * fix test_loop segment fault bug * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
-
由 YuanRisheng 提交于
* fix bugs * fix ci bugs
-
由 Hui Zhang 提交于
* jit layer support multi thread
-
- 01 2月, 2023 7 次提交
-
-
由 Yuang Liu 提交于
-
由 RedContritio 提交于
* add stride check for MaxPool * add unittests
-
由 Wang Bojun 提交于
* preln_residual 2 fused_bias_residual * skip layernorm fix and ut * code refine * code style refine * fix ut * fix output * add trt layer fall back info * refine op teller and ut * DropoutMaskOut output fix
-
由 Hui Zhang 提交于
* jit layer support multi thread * fix bug * clone prediector not do graph optimizer * format * fix comment and format * fix override and fromat * fix * fix
-
由 zhoutianzi666 提交于
-
由 LiYuRio 提交于
-
由 Aurelius84 提交于
* [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore * fix baseline unittest config * fix code style
-
- 31 1月, 2023 15 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
由 wenbin 提交于
* disable integer * disable integer * add cast layer
-
由 Zhang Jun 提交于
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name * integrate phi and fluid info in static code gen * rename some composite maker * modify static code gen format
-
由 Zhang Jun 提交于
-
由 PuQing 提交于
* add FP16 dtype for CastNumpy2Scalar * fix throw message * add test * fix SyntaxWarning * test skip for float16 * fix dtype mistakes
-
由 ronnywang 提交于
* [CustomDevice] add custom device api * update * update * test=document_fix * update * update * add examples
-
由 Roc 提交于
-
由 Yuanle Liu 提交于
-
由 TeFeng Chen 提交于
* support inplaced variable in cinn_launch * fix error hint when compiling * fix inplaced output variable of the subgraph * skip CinnCompiler check * using existed definition * fix namespace reference error * modify error message * update cinn tage * fix namespace * skip enforce check * fix unittest attribute throw
-
由 pangyoki 提交于
-
由 HongyuJia 提交于
* decouple phi custom_op * decouple phi custom_op, remove codes * delete custom symbol of inference
-
由 LiYuRio 提交于
-
由 姜永久 提交于
* rm flags_retain grad in pybind * retain grads for xpu test * set retain grad for xpu * rm flag * lint --------- Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
-
- 30 1月, 2023 5 次提交
-
-
由 jiangcheng 提交于
-
由 engineer1109 提交于
replace all TensorFromVector & TensorToVector AssignKernel async copy
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 gem5 提交于
-
- 29 1月, 2023 1 次提交
-
-
由 jiangcheng 提交于
-