- 25 10月, 2021 2 次提交
-
-
由 Zhen Wang 提交于
* Init the functions of CinnCompiler. * Add the unit test for CinnCompiler. * Fix some compilation errors. * Update the UT of cinn_compiler. * Use Decomposer&OpFusion passes in CinnCompiler::CompileGraph. * Update some comments. * Uncomment some includes in build_cinn_pass.cc. * Use refs instead of ptrs as returned types of FindGraph & Compile in CinnCompiler. * Use the merged CinnGraphSymbolization functions in CinnCompiler.
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * update * update * update Error MSG * update EventsWaiter
-
- 24 10月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
-
- 23 10月, 2021 3 次提交
-
-
由 jiangcheng 提交于
* add cinn graph symbolization * fix some bug * add paddle scope to cinn scope * add paddle scope to CINN scope in Symbolization, and add feed op when build cinn pass * fix some bug * fix some bug by review advices * optimize code problem * revert build_cinn_pass and move the change to https://github.com/PaddlePaddle/Paddle/pull/36503 * fix some bug after co-compilation * perfect single test script * remove scope and rename feed_target to input_tensor * using std::unordered_map instead of absl::flat_hash_map * fix single test bug * revert to preverion for WITH_CINN has add in later PR * full error information for CI * full enfore information for CI pass
-
由 jiangcheng 提交于
* add transformer of paddle desc and cinn desc * change LOG(FATAL) to PADDLE_THROW for ci * full error imformation for ci * fix some problem as review advice * fix some bug * move vat type utils to tansform_desc header file * add if NOT WITH_CINN control whether compile * build_strategy check whether open WITH_CINN * add control WITH_CINN in cmake
-
由 Huihuang Zheng 提交于
This PR added some changes to match the CINN change for compilation. It also tried to fix JiangCheng's Problem in PR: https://github.com/PaddlePaddle/Paddle/pull/36100 These changes include: 1. Set `CINN_GIT_TAG` to a newer tag 2. CINN now just `make cinnapi -j` 3. We have to add `-DPY_VERSION=${PY_VERSION} -DWITH_TESTING=ON` to CINN cmake args 4. For CINN's third party dependencies, we could just include headers without target_link_libraries 5. Moved `cinn.cmake` from `paddle/cmake` to `paddle/cmake/external` to match old style. External folder contains `lite`, which is the same level of `cinn` 6. CINN added `-DNAMESPACE=cinn_gflags` in `gflags.cmake` to have different gflag namespaces between CINN and Paddle. It solved re-define problem. 7. Change namespace of `::google::` in gflags to `::GFLAGS_NAMESPACE`
-
- 22 10月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* [hapi] support dygrapg amp O2 * fix problem of static pure fp16 in hapi * fix bug * fix format * fix ut * follow comments * update ut * update amp save/load * fix ut * refine code format
-
- 21 10月, 2021 4 次提交
-
-
由 jakpiase 提交于
* added base changes for matmul_v2+trans+resh fuse pass * added full matmul_v2+transpose+reshape pass * removed a file added by mistake * added reviewers suggestions * Changed ops type in checking capatibility version * Deteled one statement
-
由 xiongkun 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * adjust multithread using, fix flame graph * update
-
由 Aurelius84 提交于
* Add kQueueSync.synchronize_run_ logic * Support No DataTransform From GetKernelTypeForVar
-
- 20 10月, 2021 5 次提交
-
-
由 danleifeng 提交于
* split into PreBuildTask and BuildPull; slove endpass bug;test=develop * change buildcpu into prebuild and buildcpu into build;test=develop
-
由 Wilber 提交于
-
由 Steffy-zxf 提交于
Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent. * support the text string as an input Tensor * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization. * It first applies basic tokenization, followed by wordpiece tokenization.
-
由 Huihuang Zheng 提交于
Add CINN compile option in CMake. Now you can use CINN in Paddle by `-DWITH_CINN=ON` when `cmake` To test it, you can run `make cinn_lib_test -j` and `ctest -R cinn_lib_test`. Note: 1. You should set ``` export runtime_include_dir=${CINN_SOURCE_DIR}/cinn/runtime/cuda ``` When run test, the `${CINN_SOURCE_DIR}` should be set based on your CINN directory. 2. CINN is under developing now, you may have to change `CINN_GIT_TAG` to the git commit you need.
-
由 Aurelius84 提交于
-
- 19 10月, 2021 3 次提交
-
-
由 danleifeng 提交于
-
由 jiangcheng 提交于
* add feed op and new var for the generated subgraph * perfect the test script of build_cinn_pass * remove useless clear and perfect some annotation
-
由 WangXi 提交于
-
- 18 10月, 2021 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* add_quant_axis * add_quant_axis * --amend * Update quant_conv2d_dequant_fuse_pass.cc
-
- 15 10月, 2021 2 次提交
-
-
由 Zhang Zheng 提交于
-
由 jiangcheng 提交于
* Add CinnSubgraphSearchPass * solve CI problem of subgraph order not same * fix some bug by review advices * ensure the independently of subgraph, that mean the subgraph should not have link to out-graph * rename cinn_subgraph_search_pass to build_cinn_pass and delete paddle_to_cinn_pass * add flag to control wheter append build cinn pass * remove AppendPass at ParallelExecutorPassBuilder * rename paddle_to_cinn_pass to build_cinn_pass in build_strategy and close test_run_from_cinn
-
- 14 10月, 2021 3 次提交
- 13 10月, 2021 3 次提交
-
-
由 wuhuanzhou 提交于
Check detail PR description at https://github.com/PaddlePaddle/Paddle/pull/36116
-
由 Huihuang Zheng 提交于
Remove RunFromCinn method in PE because We Will Call CinnRunner in Compute method of SubgraphOp
-
由 Wangzheee 提交于
* add_int_pass * add_int8_flag_pass * add_int8_flag_pass * fix CMakeLists.txt * fix test_trt_fc_fuse_quant_dequant_pass.py * fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py * fix test_trt_fc_fuse_quant_dequant_pass.py
-
- 12 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
This reverts commit 7e60cc63.
-
- 11 10月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add FLAGS_allreduce_record_one_event * add more comments * fix ut * improve coverage * fix ut, improve coverage
-
由 jakpiase 提交于
-
由 yaoxuefeng 提交于
-
由 wangxinxin08 提交于
* add mish trt plugin, compile & install success, run error. test=develop * modify code according to review * add TRT_NOEXCEPT for mish trt plugin * add unittest for mish trt plugin * remove unnecessary check of mish in op_teller.cc * fix some problem of trt8 * add check and modify unittest while converting mish to trt plugin Co-authored-by: Ndengkaipeng <dengkaipeng@baidu.com>
-
由 Huihuang Zheng 提交于
Add use_cinn flag and use it to control whether we run PaddlePaddle using CINN. Also add: Replace PaddlePaddle graph with a CINN graph in a pass PE Method to feed data and run the graph by CINN
-
- 09 10月, 2021 2 次提交
-
-
由 Zeng Jinle 提交于
* add const OpDesc id() * add const for VarDesc::id()
-
由 wuhuanzhou 提交于
支持C++开发注册GeneratePass,简化针对fusion等子图优化场景开发方式。
-
- 08 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* support CUDA Graph on PE * add ut, fix CI compile * reduce memory consumption * fix CUDA 10 CI * improve coverage * improve python coverage
-
- 30 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 29 9月, 2021 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge spinlock
-
由 yaoxuefeng 提交于
-