- 14 12月, 2021 11 次提交
-
-
由 baoachun 提交于
* add conv_gelu_mkldnn_fuse_pass * add post ops
-
由 Aurelius84 提交于
-
由 weishengying 提交于
-
由 YuanRisheng 提交于
-
由 Yuang Liu 提交于
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile
-
由 feng_shuai 提交于
* test_mkldnn_depthwise_conv_pass * test: add TimeOut * sset TIMEOUT * fix:add random num for dilation and group
-
由 Zhanlue Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators * Refactored Eager AutoCodeGen with more organized helper objects * Enabled Eager AutoCodeGen for operators with multiple OpBases * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
-
由 heliqi 提交于
* add layer_norm_fuse_pass test case * restore cmakelist code * Merge branch 'develop' into layer_norm_fuse_pass * Merge branch 'develop' into layer_norm_fuse_pass * add bad case test
-
由 wangguanzhong 提交于
-
由 Sylwester Fraczek 提交于
* reshape+transpose+matmul_v2 * in_name->input_name * fix pr-ci-static-check
-
- 13 12月, 2021 14 次提交
-
-
由 zhenlin 提交于
* update 3 tests * fix typo error
-
由 wenbin 提交于
* disabled bad case * int to size_t
-
由 jianghaicheng 提交于
-
由 taixiurong 提交于
-
由 xiongkun 提交于
* fix single card 8 unittests in new executor * fix * fix
-
由 Noel 提交于
-
由 Chen Weihang 提交于
-
由 wanghuancoder 提交于
-
由 wangzhen38 提交于
* add Logit API * add unittest * conflict * pull conflit * pull conflit logit * fix unititest * fix code style * update docs style of * update en doc * fix docs en style * fix docs en style1 * fix docs en style2 * fix docs en style3 * fix docs en style4 * fix docs en style5 * fix docs en style6 * fix docs en style7 * fix docs en style8 * update by review * fix nan bug
-
由 CtfGo 提交于
complement deps of cmake files under WITH_CINN compilation
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-
由 Shang Zhizhou 提交于
* fix reduce_max bug * add unittest
-
由 zlsh80826 提交于
-
由 zlsh80826 提交于
* add restriction on plugin supportsFormat to eliminate errors from TensorRT8 * ernie-varlen is only supported on architecture >= sm75
-
- 10 12月, 2021 15 次提交
-
-
由 Leo Chen 提交于
-
由 pangyoki 提交于
-
由 chentianyu03 提交于
-
由 zhaocaibei123 提交于
* fix * modify log * fix batch_size
-
由 kuizhiqing 提交于
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions * add alias name for matmul and remove redundant member in kernel factory
-
由 Feiyu Chan 提交于
* add as_complex and as_real op
-
由 Leo Chen 提交于
* git ignore eager_op_function_impl.h * test=document_fix
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
由 sneaxiy 提交于
-
由 Jiabin Yang 提交于
-
由 Yuang Liu 提交于
-
由 CtfGo 提交于
-
由 heliqi 提交于
* add fc_elementwise_layernorm_fuse_pass * fix name conflictn * rebuild CI * fix Ran Programs=0 bug
-
由 Leo Chen 提交于
-