- 18 5月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* matmul refactor * remove UT which only check ENFORCE output * code format * improve memory usage
-
- 22 2月, 2022 1 次提交
-
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 29 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* open header for custom kernel * add core utils * tidy core code * tify header * tidy include * tidy namespace * resolve conflit * fix unittest and coverage * remove platform using * resolve conflict * resolve conflict * fix digamma namespace error * fix xpu full kernel error * fix xpu full kernel error * polish details * add place for lib storage
-
- 21 1月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
- 18 9月, 2021 1 次提交
-
-
由 From00 提交于
* Add linalg.eigvals API * pre-commit check * Adjust code style * Fix conflict * Improve code style * Modify the test code to ignore testing CUDA kernel * Sort ouput data before checking in test code * Set timeout value for UT * Improve API example code to pass CI * Fix bug for None fetch_list in Windows * Delete grad Op
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 24 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish framework error meg part2 * polish details
-
- 24 4月, 2020 1 次提交
-
-
由 arlesniak 提交于
-
- 12 4月, 2020 1 次提交
-
-
由 liuwei1031 提交于
-
- 11 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
remove unused accuracy-diff warpctc-cudnn implementation test=develop
-
- 04 9月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 10 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* clean code of dim and place, test=develop * fix failed unittests, test=develop
-
- 08 5月, 2019 1 次提交
-
-
由 Hongyu Liu 提交于
* fix shape_check; test=develop * fix format; test=develop * fix format; test=develop * fix ddim bug; test=develop * fix c++ format; test=develop * change function name; test=develop
-
- 27 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 26 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 25 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 24 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 21 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 20 12月, 2018 2 次提交
- 19 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 18 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 28 2月, 2018 1 次提交
-
-
由 xuwei06 提交于
Dim<0> is for scalar (rank-0 tensor). Adding Dim<0> can simplify a lot of code.
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 08 2月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 08 11月, 2017 2 次提交
-
-
由 Yu Yang 提交于
* Add LoDRankTable LoD Rank Table stores the `level` of `lod` which is ordered by sequence length in descending order. It is useful when implement dynamic RNN and is shared by dynamic RNN memory, dynamic RNN slice input and dynamic RNN slice output operators. * Add skeleton for array_to_lod_tensor and lod_tensor_to_array * Add VarType::LoDTensorArray * Add PyBind of LoDTensorArray * Add InferVarType * Add first unittest * Add ut * Add unittest * Add unittest * Add unittests * update * init * add infershape for lod_tensor_to_array_op * compelete array_to_lod_tensor_op * copy data * clean code * clean code * Fix unittest data * fix bugs * fix compile error * Refine TensorToArrayOp * refactor array_to_lod_tensor * Unittest * fix bugs * Fix unittest * Fix unittest * debug * Debug * Fix unittest * clean code * refactor * use ostream * update test * fix gpu build error * make gpu test pass
-
由 Yu Yang 提交于
-
- 27 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 21 9月, 2017 1 次提交
-
-
由 wanghaoshuang 提交于
-
- 08 9月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-
- 07 9月, 2017 2 次提交
- 06 9月, 2017 3 次提交
- 05 9月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-