- 10 12月, 2021 14 次提交
-
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
由 sneaxiy 提交于
-
由 Jiabin Yang 提交于
-
由 Yuang Liu 提交于
-
由 CtfGo 提交于
-
由 Tao Luo 提交于
-
由 heliqi 提交于
* add fc_elementwise_layernorm_fuse_pass * fix name conflictn * rebuild CI * fix Ran Programs=0 bug
-
由 Leo Chen 提交于
-
由 zyfncg 提交于
-
由 jianghaicheng 提交于
-
由 jianghaicheng 提交于
-
由 jianghaicheng 提交于
-
由 Guanghua Yu 提交于
* Support sub graph quant-post
-
由 Guanghua Yu 提交于
-
- 09 12月, 2021 21 次提交
-
-
由 jiangcheng 提交于
* optimize cache key * add cinn cache key by graph address * perfect cache key test script * rename GraphHashProto to GraphHashStrategy * optimize graph_serialize_str_ to graph_hash_val_ and other change by review advices
-
由 CtfGo 提交于
cinn_launch_op: cache scope and place on CinnLaunchContext to skip duplicate alloc/free callback construction
-
由 wuhuanzhou 提交于
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
由 Haohongxiang 提交于
* merge latest develop branch * fix bugs * update * fix bugs for unittest * modify for less use of gpu mem * fix bugs of using _reset_grad_inplace_version * update * update * modify for CI-Coverage * retrick all CIs
-
由 heliqi 提交于
* add transpose_flatten_concat pass * modify skip func to ignore_pass_case func * delete input_shape limit * modify get node order
-
由 Yuang Liu 提交于
-
由 Chen Weihang 提交于
-
由 Aganlengzi 提交于
-
由 sneaxiy 提交于
* fix cuda atomicAdd for FP16 * try to fix ci
-
由 pangyoki 提交于
-
由 Zhanlue Yang 提交于
* Fixed eager compilation issues by temporarily turn off AutoCodeGen for specific ops * Removed op_types
-
由 jianghaicheng 提交于
-
由 Roc 提交于
-
由 Feng Xing 提交于
-
由 Baibaifan 提交于
-
由 wangguanqun 提交于
* default accessor and multi table config * add unittest * add unittest * delete print
-
由 Wangzheee 提交于
* fix_ele_convert: IElementWiseLayer can broadcast * fix_ele_convert
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
Adjusted Eager AutoCodeGen to Support Operators with Multiple OpBases & Enable Passing Output Tensor as Input Argument (#37943) * Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators * Refactored Eager AutoCodeGen with more organized helper objects * Enabled Eager AutoCodeGen for operators with multiple OpBases * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
-
- 08 12月, 2021 5 次提交
-
-
由 wanghuancoder 提交于
* refine a test case, test=develop * publish python c api for eager, test=develop * revert modify about test_allclose_layer.py, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * delete numpy includes, use pybind11 numpy.h, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * suport eager error msg, and add grad test case, test=develop * refine, test=develop * refine, test=develop * generate eager core ops, only 4 ops, test=develop * Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * open 500 list * refine, test=develop * refine, test=develop * refine, test=develop * fix auto code gen, test=develop * Enabled generation for Operators without Grad/Inputs/Outputs * refine, test=develop * refine, test=develop * refine, test=develop * add to pyobject, test=develop * Resolved operators without input * merge pr 37837 * refine * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop Co-authored-by: Njim19930609 <jim19930609@gmail.com>
-
由 zmxdream 提交于
-
由 chentianyu03 提交于
This reverts commit a73064f2.
-
由 CtfGo 提交于
1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it 2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Fix RecordEvent Co-authored-by: Nliutiexing <liutiexing@google.com>
-