- 06 8月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 12 6月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Small changes related to BF16 fusion_gru and fusion_lstm * Correct to pass arg by value * Add conditions to rnn op * Correct the spelling mistake * Improving the test with checking activation * Trigger CI
-
- 03 6月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 26 5月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
* optimize OP's compilation time, test=develop * add more op and run ci test, test=develop * CUDA Kernel register in cc file, test=develop * fix macros, test=develop * fix undefined symbol error, test=develop * fix compilation error and undefined symbol, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop
-
- 25 5月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 21 5月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 28 4月, 2021 1 次提交
-
-
由 denglin-github 提交于
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug * Add delete dropout_op pass * Fix some format bug * Fix format bug
-
- 22 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 03 2月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
-
- 16 1月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
* Move pass tester helper functions to single common place. * Use helper functions in two more fuse pass tests.
-
- 13 1月, 2021 1 次提交
-
-
由 alncat 提交于
* added support for inference using qunatization aware trained dygraph * added support for inference using qunatization aware trained dygraph correct boost get usage * Delete incorrect warning message (#30196) * fix warning and no grad * clean redundant API alias in 2.0 - part 2 (#30013) * delete paddle.nn.functional.assign * fix dynamic to static error * just add the op error message for the matmul xpu (#30246) add the op error message for the matmul xpu * Add Static Variable Clone (#30208) Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat * use wget to replace curl to download the lcov file (#30229) * use wget to replace curl to download the lcov file * add cache for lcov * fix test_pool3d_op timeout issue (#30248) * Fix unittests bugs. (#30250) * modify error message based on comments (#30189) * modify error message based on comments * edit code according to review. * Correct spelling according to review. * Fix bug for 'save mutiple method' (#30218) * Fix bug for 'save mutiple method' * To pass coverage. * edit code to pass coverage. * edit code to pass coverage. * add unittest for coverage. * change for coverage. * edit for coverage. * added support for inference using qunatization aware trained dygraph * Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206) * add alias from fluid.layers.auc to static.auc * Update __init__.py * added support for inference using qunatization aware trained dygraph correct boost get usage * corrected boost get usage * corrected naming issues and enforcing zero check * correct paddle enforce message * added more error checkings * corrected error report message and optimized code * corrected findvar usage * corrected paddle_enforce in scope * correct error messages * correct error reporting format Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NBai Yifan <me@ethanbai.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NWeiXin <weixin10@baidu.com> Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
-
- 31 12月, 2020 1 次提交
-
-
由 cc 提交于
* Add mkldnn nearest_interp and bilinear_interp op * don't run mkldnn interpolate in default * add interpolate_mkldnn_pass
-
- 29 12月, 2020 1 次提交
-
-
由 cc 提交于
* map matmul/squeeze2+matmul/reshape2+matmul to mul
-
- 28 12月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 24 12月, 2020 1 次提交
-
-
由 jakpiase 提交于
-
- 23 12月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* remove duplicate ut reload * remove duplicate ut define in cmakelist
-
- 25 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_fuse_pass and tests * fix date * cleaned up headers
-
- 24 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_seq_fuse_pass and tests * fix date * removed unused functions
-
- 04 11月, 2020 1 次提交
-
-
由 石晓伟 提交于
* enhance the op_version_registry, test=develop * add unittests, test=develop * enhance the op_version_registry, test=develop * fix bugs, test=develop * revert pybind_boost_headers.h, test=develop * fix a attribute bug, test=develop
-
- 27 10月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* add fuse_bn_add_act pass
-
- 26 10月, 2020 1 次提交
-
-
由 Adam Osewski 提交于
-
- 14 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 29 7月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 21 6月, 2020 1 次提交
-
-
由 Shibo Tao 提交于
* don't re-generate header file if content doesn't change. test=develop * add copy_if_different function. test=develop
-
- 09 6月, 2020 1 次提交
-
-
由 Sylwester Fraczek 提交于
* remove gmock from ut test=develop * coverage enabled for r+t+m fuse pass test=develop
-
- 03 6月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 08 5月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
1. To make ProgramTranslator to support `assert` grammar, this PR adds `assert` python API and C++ code. 2. Fix a bug: graph_pattern_detector.h #include <gtest/gtest_prod.h> but didn't declared dependency at CMakeLists, which can cause single build failure. 3. Refactoring `Formatter` in print_op to make it reusable and reuse the formatter to print in assert op.
-
- 28 4月, 2020 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 24 4月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 arlesniak 提交于
-
- 22 4月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 13 4月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 09 4月, 2020 1 次提交
-
-
由 mozga-intel 提交于
* Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop
-
- 01 4月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 27 3月, 2020 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 20 3月, 2020 1 次提交
-
-
由 Wilber 提交于
update embedding_eltwise_layernorm fuse pass and fused kernel, to support multi input
-
- 11 3月, 2020 2 次提交
-
-
由 Wilber 提交于
* add skip_layernorm pass. test=develop
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 14 2月, 2020 1 次提交
-
-
由 Wilber 提交于
当一个模型中有多个fc_lstm子图的时候,且其中fc共用了同一个persistable的bias,此时不应该将bias节点删除,只将非persistable的节点去除即可。
-
- 07 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
* Add the first implememtation of fusion_group op #19621 (#3) * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc. test=develop * Call CUDA driver api to launch the kernel compiled by nvrtc. test=develop * Disable for mac and windows. test=develop * Refine the codes to support manually specified num_threads and workload_per_thread. test=develop * Refine the CUDA kernel to support large dims. test=develop * Add DeviceCodePool to manage all device codes. * Add the first implementation fusion_group op. * Add unit-test for fusion_group op. * Add the check of result. * Add the check of nvrtc in unit-test. test=develop * Add comment to explain the inputs, outputs and features of fusion_group op. test=develop * Disable fusion_group op for mac and windows. test=develop * Make the compiling of device code return status instead of hanging up. test=develop * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API. * Unify fusion_group_op's input and output names. test=develop * Add the check of CUDA driver library in unittest. test=develop * Enable generating code for a given subgraph. #21126 (#4) * Enable generating code for a given subgraph. * Support sorting the subgraph. * Remove the rearange of expressions because we use the sorted subgraph directly. * Enable generating code for a subgraph which is composed of grad ops. * Use expression information to check the accuracy in unittest. * Separate load and store from computation expressions. test=develop * Improve the loading statements in generated codes. test=develop * Remove unused arguments from formal list. test=develop * Enable the detection of subgraph of grad ops. * Generate code for detected subgraph in fusion_group_pass. * Add an option in BuildStrategy to enable fusion_group_pass and add unittest. test=develop * Fix a bug when checking whether the shape of all inputs are the same. * Add debug information. * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5) test=develop * Call subgraph_detector in fusion_group pass. test=develop * Disable fusion_group when WITH_GPU is OFF. test=develop * Refine all PADDLE_ENFORCE message. test=develop * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op. test=develop * Follow review comments. test=develop
-