- 30 3月, 2022 1 次提交
-
-
由 Zhen Wang 提交于
-
- 27 3月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
* support check data type and mutable_data with compiled infos in paddle with cinn * update cinn_instruction_run_op_test with multi data type
-
- 24 3月, 2022 2 次提交
-
-
由 jiangcheng 提交于
* fix build_cinn_pass internal var may be control var problem * add annotation and vlog by review advice
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
- 12 3月, 2022 1 次提交
-
-
由 jiangcheng 提交于
* fix NetBuilder API Name bug in cinn_lib_test * update cinn version to newest
-
- 08 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels in Activation to Phi (#40175) * move activation op * adjust code format * fix compile bugs * fix ci bugs * code format adjust * code format adjust2 * activate ci status * modify according to comment
-
- 03 3月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
* swith to PE execution in cinn launch * fix outer variables erased * skip the map bug temporarily for test * temporary solution for batch_norm bug * update comment * fix compile error * cinn_instruction_run_op_test: update code to skip external alloc/free instructions generated
-
- 28 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
- 24 2月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
* build a Paddle Graph from CINN compiled program for execution with PE * update names of some variables * fix random fail in build_cinn_pass_test and update some comments * fix compiler error by merging phi pr
-
- 22 2月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 17 2月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
* save the name lists of the input,internal and output variables of a subgraph as its attribute * fix compile error
-
- 16 2月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* remove reshape and elementwise_add registry * delete code * fix bugs when run ci ut * remove log * fix bugs when run unit test * fix bugs when run unit test * fix bugs when run cinn * fix bugs when run ci-mac-python3 * fix compile bugs * fix compile bugs * fix compile bugs * fix bugs when run kunlun * fix bugs when compile * update code according comment
-
- 15 2月, 2022 2 次提交
-
-
由 TeFeng Chen 提交于
* add cinn_instruction_run_op for launching execution of a cinn instruction * fix multi definition compilation error * update cmake * fix bug at infershape * fix compile error due to lacking header file
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 10 2月, 2022 1 次提交
-
-
由 TeFeng Chen 提交于
* add a graph pass to share MemOptVarInfos of external variables into subgraph * update pass name * fix compile failed * add share_mem_opt_info_to_subgraph_pass test * share_mem_opt_info_to_subgraph_pass_test pass * modify some codes for better style and more robust * update cmake
-
- 25 1月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again
-
- 17 1月, 2022 1 次提交
-
-
由 CtfGo 提交于
collecting input variables that only shape needed of each subgraph that compiled by CINN in build_cinn_pass, and expose them to memory optimization of framework passes by declaringDECLARE_INPLACE_OP_INFERER in cinn_launch op.
-
- 13 12月, 2021 1 次提交
-
-
由 CtfGo 提交于
complement deps of cmake files under WITH_CINN compilation
-
- 10 12月, 2021 1 次提交
-
-
由 CtfGo 提交于
-
- 09 12月, 2021 2 次提交
-
-
由 jiangcheng 提交于
* optimize cache key * add cinn cache key by graph address * perfect cache key test script * rename GraphHashProto to GraphHashStrategy * optimize graph_serialize_str_ to graph_hash_val_ and other change by review advices
-
由 CtfGo 提交于
cinn_launch_op: cache scope and place on CinnLaunchContext to skip duplicate alloc/free callback construction
-
- 08 12月, 2021 1 次提交
-
-
由 CtfGo 提交于
1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it 2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.
-
- 06 12月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
1. Modify git tag for CINN 2. Support compile option "-DWITH_CINN=ON, -DWITH_TESTING=OFF"
-
- 02 12月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
-
- 01 12月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Modify ShareTensorWithCinnBuffer by callback to save memory
-
- 25 11月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
-
- 19 11月, 2021 1 次提交
-
-
由 jiangcheng 提交于
* optimize cache-key by replace GraphToProgram to Dot string * fix compile failure bug
-
- 18 11月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* Add the `GetFetchNames` method in CinnGraphSymbolization. * Use unordered_set instead vector as the type of fetch_var_names. * Reuse the definition of kCompilationKey. * Use CompileOptions to set fetch_var_ids. * Update the argument passing of GraphCompiler.Build. * Fix some bugs in CinnGraphSymbolization::GetFetchIds.
-
- 15 11月, 2021 2 次提交
-
-
由 jiangcheng 提交于
-
由 jiangcheng 提交于
-
- 12 11月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Fix Paddle-CINN CI
-
- 11 11月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Add test property RUN_TYPE=CINN to CINN unit tests. It will restrict Paddle-CINN CI to run these unit tests only.
-
- 06 11月, 2021 2 次提交
-
-
由 Zhen Wang 提交于
* Update the batch size used in test_resnet50_with_cinn.py. * Enable more debug info.
-
由 jiangcheng 提交于
* add deny param list to solve unuse param cannot found the problem * enclosure deny list in a function * update by review advice
-
- 05 11月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* Use a more appropriate `Compile` method in cinn_launch_op. * Update the VizGraph method of CinnCompiler. * Add resnet50 model training with CINN.
-
- 03 11月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
Add FLAGS_allow_cinn_ops & FLAGS_deny_cinn_ops for controlling op types used in training with CINN. (#36842) * Update UT test_parallel_executor_run_cinn.py. * Add FLAGS_allow_cinn_ops & FLAGS_deny_cinn_ops & FLAGS_cinn_ops_delim. * Use the custom StringSplit function and remove the FLAGS_cinn_ops_delim flag. * Add FlagController test. * Apply lock to the cache_ only in CinnCompiler. * Add VizGraph & ReadableKey method for CinnCompiler. * Update the dot style of VizGraph in CinnCompiler.
-
- 01 11月, 2021 2 次提交
-
-
由 jiangcheng 提交于
-
由 CtfGo 提交于
增加CinnLaunchOp,负责执行Cinn子图编译的结果,要点如下: 1. 在子图划分的BuildCinnPass中,每个子图在原图中会被替换为该CinnLaunchOp,由它来调用Cinn进行子图编译、执行的功能。 2. CinnLaunchOp的输入/输出即为子图的输入和输出,另外增加`compilation_key`属性,它可由该属性key从全局Cache中获取子图对象、编译结果,该属性由BuildCinnPass在创建Op时进行设置 3. CinnLaunchOp功能实现的流程为: - 从全局Cache中获取子图对象 - 从全局Cache中获取子图编译结果,未命中cache时进行即时编译 - 根据编译结果的变量信息(数据类型、shape)初始化运行时数据,分配内存/显存 - 将运行时数据打包为参数,调用cinn的可执行对象runtime program进行计算 - 子图运行结果通过参数指针同步到paddle侧的tensor
-