- 14 10月, 2021 2 次提交
- 13 10月, 2021 3 次提交
-
-
由 wuhuanzhou 提交于
Check detail PR description at https://github.com/PaddlePaddle/Paddle/pull/36116
-
由 Huihuang Zheng 提交于
Remove RunFromCinn method in PE because We Will Call CinnRunner in Compute method of SubgraphOp
-
由 Wangzheee 提交于
* add_int_pass * add_int8_flag_pass * add_int8_flag_pass * fix CMakeLists.txt * fix test_trt_fc_fuse_quant_dequant_pass.py * fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py * fix test_trt_fc_fuse_quant_dequant_pass.py
-
- 12 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
This reverts commit 7e60cc63.
-
- 11 10月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add FLAGS_allreduce_record_one_event * add more comments * fix ut * improve coverage * fix ut, improve coverage
-
由 jakpiase 提交于
-
由 yaoxuefeng 提交于
-
由 wangxinxin08 提交于
* add mish trt plugin, compile & install success, run error. test=develop * modify code according to review * add TRT_NOEXCEPT for mish trt plugin * add unittest for mish trt plugin * remove unnecessary check of mish in op_teller.cc * fix some problem of trt8 * add check and modify unittest while converting mish to trt plugin Co-authored-by: Ndengkaipeng <dengkaipeng@baidu.com>
-
由 Huihuang Zheng 提交于
Add use_cinn flag and use it to control whether we run PaddlePaddle using CINN. Also add: Replace PaddlePaddle graph with a CINN graph in a pass PE Method to feed data and run the graph by CINN
-
- 09 10月, 2021 2 次提交
-
-
由 Zeng Jinle 提交于
* add const OpDesc id() * add const for VarDesc::id()
-
由 wuhuanzhou 提交于
支持C++开发注册GeneratePass,简化针对fusion等子图优化场景开发方式。
-
- 08 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* support CUDA Graph on PE * add ut, fix CI compile * reduce memory consumption * fix CUDA 10 CI * improve coverage * improve python coverage
-
- 30 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 29 9月, 2021 6 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge spinlock
-
由 yaoxuefeng 提交于
-
由 Yiqun Liu 提交于
-
由 Zeng Jinle 提交于
-
由 baoachun 提交于
-
由 Zeng Jinle 提交于
-
- 28 9月, 2021 6 次提交
-
-
由 Thunderbrook 提交于
* ps gpu dump * remove log
-
由 Jiabin Yang 提交于
* fix dygraph double grad dtype error when calling for high differential senario * reinvoke ci * add test for partial_engine.cc
-
由 Leo Chen 提交于
-
由 Zeng Jinle 提交于
-
由 Huihuang Zheng 提交于
* Add Basic CINN Runner Class * Add CinnCacheKey * Add Cache logic and improve CinnCacheKey * Modify as reviewer commented * Implement hash_combine to fix MAC build.
-
由 Siming Dai 提交于
-
- 27 9月, 2021 2 次提交
-
-
由 xiaoxiao-luomu 提交于
* gloo hdfs set check & gloo connect retry * add vlog * print gloo connect addr & add vlog * . * modify vlof * modify vlog * modify vlog
-
由 Aurelius84 提交于
* Polish multi-thread schedule strategy * fix atomic_deps * modify into lambda function * add and run
-
- 26 9月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* set file_num in one shard * format
-
- 24 9月, 2021 1 次提交
-
-
由 baoachun 提交于
* add multihead_matmul trt converter test case * move attribute check to op_teller
-
- 23 9月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * WorkQueue update * Revert "WorkQueue update" This reverts commit 14ce793dbb204f8ddec63c34b3b72a73c7cdb93a. * optimize WorkQueue
-
- 22 9月, 2021 6 次提交
-
-
由 Tomasz Socha 提交于
* Fix copy elision warning * Remove redundand code
-
由 wanghuancoder 提交于
-
由 Wangzheee 提交于
-
由 wanghuancoder 提交于
-
由 wanghuancoder 提交于
* refine gc for new_executor, test=develop * refine, test=develop * refine, test=develop * merge, test=develop
-
由 Aurelius84 提交于
* Modify H2D and D2H as kQueue::Sync * fix interface error
-
- 18 9月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
Add basic Cost Model, it uses executor to run program and profile it to get op time. This is an early basic version, we will add more functions in the future.
-
由 Wilber 提交于
-