- 17 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
- 14 4月, 2022 5 次提交
-
-
由 Jacek Czaja 提交于
* Add UT - Added missed data_layout - Added missing conversions - NDHWC added - NDHWC support in data_transform - another fix - condddate change - fix u- fix - fix - fix - fix - fix - fix to hack - compilation fix - fix to automatic merge * - reduced UT * - fix * - lint * - fix to lint
-
由 Sławomir Siwek 提交于
* Change tensor name to match activation * declare fc_eltwise_add pass * merge conv_eltwise refactor PR * first compilable draft * unittest feedback tools * Fuse pass tester * Move IsReachable() to shared file * 100% coverage of fuse_pass_tester.cc * register pass * Add bias node * Improve unit tests / remove bias node from pattern * improve fc_eltwiseadd_unittest * cancel eltwise_add fuse if act is already fused * Add elementwise_input scale * Residual MVP * Add new FC attrs * Add more test cases * Add missing op attrs * Adapt code to new Elementwise pattern * reuse existing fcpattern * improve code style * remove unused arguments * fix typo * remove whitespace * remove int8 related code * Remove attributes from base ops * style * style check * Remove input from base op * Set attribute during fuse * ut timeout * download and test model * DRY * apply feedback from review * Style check * fix typo * cosmetic changes * explicitly set residual as output * VIT-OCR accuracy check * trigger CI * remove whitespaces * fix missing data file
-
由 Sing_chan 提交于
-
由 baoachun 提交于
* add mkldnn int8 pass [step3] * Add test for compute_propagate_scales_mkldnn_pass * update pass * update api comment and python api Co-authored-by: Nwozna <joanna.wozna@intel.com>
-
由 jakpiase 提交于
* added shuffle_channel bf16/fp32 fwd kernel * added missing files * CI fix * changed from pten to phi * tmp save * added reviewers suggestions * fix for test
-
- 13 4月, 2022 1 次提交
-
-
由 feng_shuai 提交于
* init roll convert * add ut for roll convert * roll convert don't support trt6.0 * fix: change ut for trt 7.0.0.1
-
- 12 4月, 2022 4 次提交
-
-
由 feng_shuai 提交于
* strided_slice * fix: compiler error because of size() * fix: warning * fix : warning * init input_shape * fix:forget punctuation
-
由 JingZhuangzhuang 提交于
* add python share_data interface * Update inference_api.cc * Update inference_api.cc * add python share_data interface
-
由 feng_shuai 提交于
* add trt supoort for slice op * fix:output dims bug * fix: test * fix:for c++ coverage * fix:c++ coverage * fix: fix test bug * fix: CI test
-
由 joanna.wozna.intel 提交于
-
- 07 4月, 2022 3 次提交
-
-
由 Sing_chan 提交于
* change inference demo_test build method to ninja to choose visual studio version automaticly * notest;test=windows_ci_inference * set cuda of demo_ci by arg,fix bug of ninja compile,test=document_fix;test=windows_ci;test=windows_ci_inference * fix bug;test=document_fix;test=windows_ci;test=windows_ci_inference * fix bug;test=document_fix;test=windows_ci_inference" * set lib_path according to generator
-
由 Zhang Jun 提交于
-
由 JingZhuangzhuang 提交于
* modify infer gpu memory strategy * modify infer gpu memory strategy
-
- 06 4月, 2022 2 次提交
-
-
由 feng_shuai 提交于
-
由 Allen Guo 提交于
* remove paddle_ipu shared library * fix unique_name
-
- 05 4月, 2022 1 次提交
-
-
由 wangxinxin08 提交于
* add fake index and unittest for multiclass_nms3 trt * modify unittest
-
- 02 4月, 2022 2 次提交
-
-
由 Wangzheee 提交于
* paddle inference support new quant_model
-
由 wangxinxin08 提交于
* filter unsupported inputs for elementwise op in op teller * add unittest for corner case
-
- 01 4月, 2022 3 次提交
-
-
由 xiaoxiaohehe001 提交于
fix_reshape: for paddle-trt
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Jacek Czaja 提交于
-
- 31 3月, 2022 4 次提交
-
-
由 wangxinxin08 提交于
* add multiclass_nms3 converter
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type * op:transpose_op supports bool type * Keep strided_slice op behavior consistent with slice op when starts input is less than -rank * Using DistConfig in inference
-
由 heliqi 提交于
* add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast
-
由 wenbin 提交于
-
- 30 3月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
-
由 heliqi 提交于
-
- 29 3月, 2022 1 次提交
-
-
由 wangxinxin08 提交于
* add elementwise sub and elementwise div in tensorrt op teller * add unittest of elementwise mul, sub and div
-
- 24 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
- 21 3月, 2022 1 次提交
-
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
- 18 3月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 17 3月, 2022 5 次提交
-
-
由 heliqi 提交于
* add onnxruntime predictor * Add code comments * support link paddle2onnx onnxruntime * support onnxruntime with python * support onnxruntime with python * support onnxruntime with windows * paddle2onnx compile with windows * supoort windows compile * supoort windows compile with onnxruntime * supoort windows compile with paddle2onnx * supoort mac compile * compile with mac * compile with mac * add code comments * fix remind word * code optimization * add test case * add test case * add inference demo_ci test case * fix compile paddle2onnx with no python * add inference demo_ci test case * add inference demo_ci test case * add inference infer_ut test case * support c go api and test cases * add converage test case * add converage test case * add capi test case * add capi test case * fix onnxruntime copyfromcpu and copytocpu * fix goapi * modify code
-
由 hong 提交于
* update * fix bugs; test=develop * update; test=develop * fix test compile error; test=develop * fix cpu compile error; test=develop * fix test error; test=develo * fix layer_norm_op plugin error; test=develop * fix error; test=develop * fix test bug; test=develop * update; test=develop * polish code; test=develop * fix bugs; test=develop * remove unused depency; test=develop * polish code; test=develop
-
由 YuanRisheng 提交于
-
由 Yuang Liu 提交于
-
由 baoachun 提交于
-
- 15 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move activation op * adjust code format * fix compile bugs * fix ci bugs * code format adjust * code format adjust2 * activate ci status * modify according to comment * move activation kernel * revert relu6 * reduce add code * perfect use_phi_functor * completing func name * fix bugs when run ci * fix bugs when run infr * modifpy infrt get kernel signature
-
- 14 3月, 2022 2 次提交
-
-
由 Tomasz Socha 提交于
* Add elementwise add and activation fuse pass * Fix copy ellision * More flexible pattern detector * More flexible fusion pass * Update lists for pass * Add support for Pow operator * Add support for more activation types * Style * Rename fusion pass * First version of tests * Dirty version of pass * Polished version * Update pbtxt * Style * Update names * Style * Use PADDLE_ENFORCE_EQ * Save error message to variable * WO for error checks * CR * Static style check * Add missing 'activation_scale' attribute * Add relu6 and sigmoid activations * Style * Fix fuse list formating * Sync filenames for fuse pass files * Fix cmake after move * Fix registration * Fix pass name in tests * Add missing activations to checker * WIPS * Working mul op * Working sub * Working Add * Remove pten includes * Remove some forward declarations * Remove Includes * Fixes * Remove default kernels * Add check if post_ops attributes are avaliable * Style * Code adjustment * Register default kernels * We have year 2022 not 2021... Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Fast review fixes Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Review Fix * Rename one_dnn -> onednn * Style after review * Fast and dirty fix for quantization * Update tests * Style * Fix mkldnn_quantizer config * Add Joanna's suggestion. * Check if operator is explicitly disables on OneDNN * Try to use unregistered attributes * Style * Test new framework * FXI * FXII * Update test * Style Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-