- 13 12月, 2021 5 次提交
-
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-
由 Shang Zhizhou 提交于
* fix reduce_max bug * add unittest
-
由 zlsh80826 提交于
-
由 0x45f 提交于
* allow ifelse return `int` in paddle cond * add test and refine code * polish code, add test * code format
-
由 zlsh80826 提交于
* add restriction on plugin supportsFormat to eliminate errors from TensorRT8 * ernie-varlen is only supported on architecture >= sm75
-
- 12 12月, 2021 1 次提交
-
-
由 沉潜的鱼儿 提交于
* dist matmul op compatible * dist op unittest * modify dist matmul * modify dist reshape * modify dist reshape * add a space * add a space * delete dist matmul op * modify reshape * add dist op unittest * modify dist op unittest
-
- 11 12月, 2021 1 次提交
-
-
由 Tao Luo 提交于
-
- 10 12月, 2021 25 次提交
-
-
由 liu zhengxi 提交于
* promote to v2 * alter
-
由 Leo Chen 提交于
-
由 pangyoki 提交于
-
由 chentianyu03 提交于
-
由 zhaocaibei123 提交于
* fix * modify log * fix batch_size
-
由 kuizhiqing 提交于
-
由 0x45f 提交于
* remove outer comment when dy2stat * remove all comment * add unit test
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions * add alias name for matmul and remove redundant member in kernel factory
-
由 Feiyu Chan 提交于
* add as_complex and as_real op
-
由 Leo Chen 提交于
* git ignore eager_op_function_impl.h * test=document_fix
-
由 沉潜的鱼儿 提交于
* dist matmul op compatible * modify common dist op * modify common * add a space
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
由 sneaxiy 提交于
-
由 Jiabin Yang 提交于
-
由 Yuang Liu 提交于
-
由 CtfGo 提交于
-
由 Tao Luo 提交于
-
由 heliqi 提交于
* add fc_elementwise_layernorm_fuse_pass * fix name conflictn * rebuild CI * fix Ran Programs=0 bug
-
由 Leo Chen 提交于
-
由 zyfncg 提交于
-
由 jianghaicheng 提交于
-
由 jianghaicheng 提交于
-
由 jianghaicheng 提交于
-
由 Guanghua Yu 提交于
* Support sub graph quant-post
-
由 Guanghua Yu 提交于
-
- 09 12月, 2021 8 次提交
-
-
由 jiangcheng 提交于
* optimize cache key * add cinn cache key by graph address * perfect cache key test script * rename GraphHashProto to GraphHashStrategy * optimize graph_serialize_str_ to graph_hash_val_ and other change by review advices
-
由 CtfGo 提交于
cinn_launch_op: cache scope and place on CinnLaunchContext to skip duplicate alloc/free callback construction
-
由 wuhuanzhou 提交于
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
由 Haohongxiang 提交于
* merge latest develop branch * fix bugs * update * fix bugs for unittest * modify for less use of gpu mem * fix bugs of using _reset_grad_inplace_version * update * update * modify for CI-Coverage * retrick all CIs
-
由 heliqi 提交于
* add transpose_flatten_concat pass * modify skip func to ignore_pass_case func * delete input_shape limit * modify get node order
-
由 Yuang Liu 提交于
-
由 Chen Weihang 提交于
-