- 20 10月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
Add CINN compile option in CMake. Now you can use CINN in Paddle by `-DWITH_CINN=ON` when `cmake` To test it, you can run `make cinn_lib_test -j` and `ctest -R cinn_lib_test`. Note: 1. You should set ``` export runtime_include_dir=${CINN_SOURCE_DIR}/cinn/runtime/cuda ``` When run test, the `${CINN_SOURCE_DIR}` should be set based on your CINN directory. 2. CINN is under developing now, you may have to change `CINN_GIT_TAG` to the git commit you need.
-
由 Aurelius84 提交于
-
- 19 10月, 2021 3 次提交
-
-
由 danleifeng 提交于
-
由 jiangcheng 提交于
* add feed op and new var for the generated subgraph * perfect the test script of build_cinn_pass * remove useless clear and perfect some annotation
-
由 WangXi 提交于
-
- 18 10月, 2021 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* add_quant_axis * add_quant_axis * --amend * Update quant_conv2d_dequant_fuse_pass.cc
-
- 15 10月, 2021 2 次提交
-
-
由 Zhang Zheng 提交于
-
由 jiangcheng 提交于
* Add CinnSubgraphSearchPass * solve CI problem of subgraph order not same * fix some bug by review advices * ensure the independently of subgraph, that mean the subgraph should not have link to out-graph * rename cinn_subgraph_search_pass to build_cinn_pass and delete paddle_to_cinn_pass * add flag to control wheter append build cinn pass * remove AppendPass at ParallelExecutorPassBuilder * rename paddle_to_cinn_pass to build_cinn_pass in build_strategy and close test_run_from_cinn
-
- 14 10月, 2021 3 次提交
- 13 10月, 2021 3 次提交
-
-
由 wuhuanzhou 提交于
Check detail PR description at https://github.com/PaddlePaddle/Paddle/pull/36116
-
由 Huihuang Zheng 提交于
Remove RunFromCinn method in PE because We Will Call CinnRunner in Compute method of SubgraphOp
-
由 Wangzheee 提交于
* add_int_pass * add_int8_flag_pass * add_int8_flag_pass * fix CMakeLists.txt * fix test_trt_fc_fuse_quant_dequant_pass.py * fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py * fix test_trt_fc_fuse_quant_dequant_pass.py
-
- 12 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
This reverts commit 7e60cc63.
-
- 11 10月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add FLAGS_allreduce_record_one_event * add more comments * fix ut * improve coverage * fix ut, improve coverage
-
由 jakpiase 提交于
-
由 yaoxuefeng 提交于
-
由 wangxinxin08 提交于
* add mish trt plugin, compile & install success, run error. test=develop * modify code according to review * add TRT_NOEXCEPT for mish trt plugin * add unittest for mish trt plugin * remove unnecessary check of mish in op_teller.cc * fix some problem of trt8 * add check and modify unittest while converting mish to trt plugin Co-authored-by: Ndengkaipeng <dengkaipeng@baidu.com>
-
由 Huihuang Zheng 提交于
Add use_cinn flag and use it to control whether we run PaddlePaddle using CINN. Also add: Replace PaddlePaddle graph with a CINN graph in a pass PE Method to feed data and run the graph by CINN
-
- 09 10月, 2021 2 次提交
-
-
由 Zeng Jinle 提交于
* add const OpDesc id() * add const for VarDesc::id()
-
由 wuhuanzhou 提交于
支持C++开发注册GeneratePass,简化针对fusion等子图优化场景开发方式。
-
- 08 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* support CUDA Graph on PE * add ut, fix CI compile * reduce memory consumption * fix CUDA 10 CI * improve coverage * improve python coverage
-
- 30 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 29 9月, 2021 6 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge spinlock
-
由 yaoxuefeng 提交于
-
由 Yiqun Liu 提交于
-
由 Zeng Jinle 提交于
-
由 baoachun 提交于
-
由 Zeng Jinle 提交于
-
- 28 9月, 2021 6 次提交
-
-
由 Thunderbrook 提交于
* ps gpu dump * remove log
-
由 Jiabin Yang 提交于
* fix dygraph double grad dtype error when calling for high differential senario * reinvoke ci * add test for partial_engine.cc
-
由 Leo Chen 提交于
-
由 Zeng Jinle 提交于
-
由 Huihuang Zheng 提交于
* Add Basic CINN Runner Class * Add CinnCacheKey * Add Cache logic and improve CinnCacheKey * Modify as reviewer commented * Implement hash_combine to fix MAC build.
-
由 Siming Dai 提交于
-
- 27 9月, 2021 2 次提交
-
-
由 xiaoxiao-luomu 提交于
* gloo hdfs set check & gloo connect retry * add vlog * print gloo connect addr & add vlog * . * modify vlof * modify vlog * modify vlog
-
由 Aurelius84 提交于
* Polish multi-thread schedule strategy * fix atomic_deps * modify into lambda function * add and run
-
- 26 9月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* set file_num in one shard * format
-
- 24 9月, 2021 1 次提交
-
-
由 baoachun 提交于
* add multihead_matmul trt converter test case * move attribute check to op_teller
-