- 15 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 14 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
This reverts commit 84bf5c31.
-
- 10 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* cuda_graph * cuda_graph_ * cuda_graph_ * cuda_graph_
-
- 05 8月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
由 Zhang Jun 提交于
* update trt workspace size param * update * update * update * use int64_t * use int64_t * upate * update
-
- 04 8月, 2022 3 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 zhoutianzi666 提交于
* add rnn
-
由 Wilber 提交于
* convert support multi block. * update
-
- 02 8月, 2022 1 次提交
-
-
由 Wilber 提交于
* multihead matmul add fp16 * fix windows error * fix rocm error * fix rocm error
-
- 01 8月, 2022 3 次提交
- 29 7月, 2022 1 次提交
-
-
由 ming1753 提交于
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
-
- 28 7月, 2022 1 次提交
-
-
由 heliqi 提交于
-
- 26 7月, 2022 1 次提交
-
-
由 Wilber 提交于
* multi stream support handle lazy init. * support eigen lazy init * update * fix ci problem
-
- 22 7月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 21 7月, 2022 2 次提交
-
-
由 ming1753 提交于
* fc support fp16 * add a ‘,’ on paddle_pass_builder.cc * fc support fp16 on non-cuda.
-
由 xiaoxiaohehe001 提交于
* convfusionfp16 * convfusionfp16 * convfusionfp16
-
- 19 7月, 2022 3 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
由 zhoutianzi666 提交于
* shape sum * add shape, sum trt layer
-
由 Wilber 提交于
-
- 18 7月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
* reshape fill_constant * commit * commit
-
- 15 7月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
* add fused token prune op and plugin
-
- 13 7月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 12 7月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* add method for post ops * format code * gpd * format style * add matmul+act test * implement matmul+activation * whitespaces * code style * python code format * Increase UT timeout * code format * update style * generalize activation fuse passes * change order * Unify activation GPD * Revert changes with op_act * remove softmax mkldnn attrs * set common name for act attributes * whitespace * append postops by helper function * ut style * revert changes related to quantization * Reduce redundancy * reduce number of parameters * trigger CI * validate attribute * trim unit test
-
- 11 7月, 2022 2 次提交
-
-
由 Zuza Gawrysiak 提交于
* Quantize shape operator * Add shape op to propagate scales pass
-
由 heliqi 提交于
* add ort clone interface * paddle2onnx update to 1.0.0rc * ort input_tensor use mutable data of scope
-
- 08 7月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 07 7月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* copy onnxruntime.dll to c++ test folder in windows * remove ut that failed due to onnxrumtime.dll * test_api_impl failed of diff * use TARGET to make sure if the test exist; use POST_BUILD to add copy command
-
- 30 6月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* modify graph_pattern to thread_local * modify graph_pattern to thread_local
-
- 29 6月, 2022 2 次提交
- 28 6月, 2022 2 次提交
-
-
由 Tomasz Socha 提交于
* Remove output arguments from functions. Replace pointers with references * Name used bool flags * Reorder functions * Enable bfloat16 data type * Give declarations some space * Style * Style
-
由 石晓伟 提交于
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* revert 40531 * update
-
- 23 6月, 2022 2 次提交
-
-
由 ccrrong 提交于
* add cast trt converter
-
由 Sylwester Fraczek 提交于
* sylwek prototype params to int8 pass * trying to make warmup work * wip * wip * change test to cpp test * review fixes, refactoring * more refactoring * add erasevars * change test to fixture * rename pass and reorder erasevars and graphsaferemovenodes * fix * more refactoring and fixed bug * formatting * remove scale count * enfroce message too short * remove erasevars erasevars couldbe cauuse of memory issues some other fixes * add count of successfull fuses to name of new nodes * FindVar -> GetVar and use ConvResidual pattern * use tensor->clear() instead of new variable * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * add log (review fix)c * review fix (2 functions to one) * code review: Conv->QuantizeConv * revert * fix formatting * remove unused functions * add paddle enforce Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 22 6月, 2022 2 次提交