- 15 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
- 14 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
* polish register marco * resolve compile failed * revert needless change * revert eager related change * revert eager related change * change register marco name * polish deetails
-
由 YuanRisheng 提交于
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile
-
- 13 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-
由 Shang Zhizhou 提交于
* fix reduce_max bug * add unittest
-
- 10 12月, 2021 4 次提交
-
-
由 chentianyu03 提交于
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions * add alias name for matmul and remove redundant member in kernel factory
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
由 zyfncg 提交于
-
- 09 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
- 08 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
- 07 12月, 2021 4 次提交
-
-
由 zyfncg 提交于
-
由 wanghuancoder 提交于
* refine a test case, test=develop * rm python, test=develop * refine, test=develop * fix cmake generate error, and fix circular import, test=develop
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * move func in kernel_context.h into kernel_context.cc * refactor logic that transform variable to densetensor * fix bugs when compile * update func name * fix bugs when run windows-ci
-
由 zyfncg 提交于
-
- 06 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage
-
- 03 12月, 2021 4 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
由 zyfncg 提交于
-
由 wanghuancoder 提交于
* refine a test case, test=develop * publish python c api for eager, test=develop * revert modify about test_allclose_layer.py, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * delete numpy includes, use pybind11 numpy.h, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * suport eager error msg, and add grad test case, test=develop * refine, test=develop * refine, test=develop
-
由 zyfncg 提交于
* add code-gen file * add api-gen in cmake * adjust the code format * temp save the code * add genen-api module into pten * merge the develop code * fix merge conflict * fix code conflict with develop * support reduce_mean/sum * fix the CI requirement * fix requirement problem of CI * remove original api code * fix bug caused by removing original api
-
- 02 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * refactor logic that transform variable to densetensor * update func name
-
- 30 11月, 2021 3 次提交
-
-
由 chentianyu03 提交于
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
- 29 11月, 2021 2 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
由 chentianyu03 提交于
* add cuda implement of cast kernel * remove bfloat16 when defined paddle_with_hip
-
- 26 11月, 2021 2 次提交
-
-
由 YuanRisheng 提交于
* Support parse kernel key by multi-inputs * optimize code according to reviewer
-
由 Chen Weihang 提交于
-
- 25 11月, 2021 2 次提交
-
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment * add fill_constant kernel using ScalarArray * modify some prompt * remove fill_constant kernel with no shape
-
由 Chen Weihang 提交于
-
- 24 11月, 2021 3 次提交
-
-
由 YuanRisheng 提交于
* elementwise_mul refactor * perfect code in test * delete redundant code * fix bugs when run test_multiply * adjust the location of macro * fix bugs when run ci
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment
-
由 Chen Weihang 提交于
* standarded unittest namespace * fix detail error
-