- 21 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
* rename cuda to gpu * revert CMake change * resolve conflit * rename other cuda to gpu * poish details
-
由 chentianyu03 提交于
* fix when out_dtype is same with x.dtype and still transform type error * fix spell error
-
由 Chen Weihang 提交于
* remove eigen and blas dir * fix declare error
-
- 20 12月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* add pten conj kernel * modify conj_kernel file path * add defined cuda macro to cuda/conj_kernel.h
-
由 石晓伟 提交于
-
由 zyfncg 提交于
-
- 17 12月, 2021 4 次提交
-
-
由 Chen Weihang 提交于
-
由 chentianyu03 提交于
* modify sum mean args * add GetExpectedPtenKernelArgs for redcue_op * modify kernel args number * modify kernel args number
-
由 Chen Weihang 提交于
-
由 limingshu 提交于
* fix_bugs_for_elementwise_branch_selection * fix merge_dims bugs * fix all influenced file
-
- 16 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
* unify device context entrance * move all_context include to header * polish cmake relay for device_context * fix npu compile failed * fix npu compile failed
-
由 Chen Weihang 提交于
* add register_ctx_kernel and move scale kernel * polish details by reviewer comment * fix xpu compile failed * fix cmake error
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile * modify register name * fix compile bugs
-
- 15 12月, 2021 2 次提交
-
-
由 Yiqun Liu 提交于
test=document_fix
-
由 Chen Weihang 提交于
-
- 14 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* polish register marco * resolve compile failed * revert needless change * revert eager related change * revert eager related change * change register marco name * polish deetails
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile
-
- 13 12月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* fix reduce_max bug * add unittest
-
- 10 12月, 2021 3 次提交
-
-
由 chentianyu03 提交于
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions * add alias name for matmul and remove redundant member in kernel factory
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
- 09 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
由 Chen Weihang 提交于
-
- 08 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
- 07 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * move func in kernel_context.h into kernel_context.cc * refactor logic that transform variable to densetensor * fix bugs when compile * update func name * fix bugs when run windows-ci
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 02 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * refactor logic that transform variable to densetensor * update func name
-
- 30 11月, 2021 2 次提交
-
-
由 chentianyu03 提交于
-
由 Chen Weihang 提交于
-
- 29 11月, 2021 2 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
由 chentianyu03 提交于
* add cuda implement of cast kernel * remove bfloat16 when defined paddle_with_hip
-
- 25 11月, 2021 2 次提交
-
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment * add fill_constant kernel using ScalarArray * modify some prompt * remove fill_constant kernel with no shape
-
由 Chen Weihang 提交于
-
- 24 11月, 2021 2 次提交
-
-
由 YuanRisheng 提交于
* elementwise_mul refactor * perfect code in test * delete redundant code * fix bugs when run test_multiply * adjust the location of macro * fix bugs when run ci
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment
-
- 23 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* elementwise_div refactor * fix compile bugs in windows ci
-
- 22 11月, 2021 2 次提交
-
-
由 chentianyu03 提交于
* add cast kernel * add cast cuda kernel * add cast kernel * make cast kernel output dtype undefined * get cast dtype from vardesc * move cast to manipulation and add test case * add castinfershape * avoid reinitilaze variable * InitializeVariable support datatype * merge develop branch * fix merge bug * revert modify initializeVariable * revert modify on InitializeVariable * revert modify on InitializeVariable * mutable support reset dtype * enable make pten tensor from variable when def_arg.type is undefined * fix build pten ctx start_idx error * copy pten out tensor to variable * merge develop branch * fix non pten kernel cast failed * add reset allocation place for remake tensor * fix inplace realloc error * add mutable on pten kernles and remove unused cast files * rename function names * fix output type error * fix conflict with develop branch * set data type to variable with pten's dtype * fix test_cast_api type mismatch * densorTensro mutable_data support 0 bytes value * fix the inplace bug of reshape kernel * fix pten.backend != variable.place when moving storage, palce mismatch bug * fix conflict with develop branch * Fix bug of paddle::experimental::MovesStorage * fix ReMakePtenDenseTensor place mismatch bug * Revert "fix ReMakePtenDenseTensor place mismatch bug" This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a. * fix ReMakePtenDenseTensor place mismatch bug * reverts the set_lod interface, test=develop * modify by the review options * modify error message * add & for const input arguments * add reference in params * elementwise_sub add mutable_data * fix ResetHolderWithType check size bug * add dependence pten_tensor to test_cast_api object * remove unused code to pass ci coverage Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
由 niuliling123 提交于
* modified the elementwise_op_broadcast and elementwise_op_impl for xpu2
-
- 19 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add copy_to and to method for Tensor * polish msg format * fix details error * fix copy_to test compile failed * fix typo
-
- 18 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* elementwise_add kernel refactor * fix compile bugs in elementwise_add refactor * fix compile bugs when run in npu/xpu * fix bugs when run unit test * fix bugs when run ci-windows * modify code as recommended * code format adjust * fix bugs when run ci * fix compile bug when run in ci-windwos * elementwise_sub refactor * add PD_DLL_DECL for elementwise_sub * fix bugs when compilei
-