- 02 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add softmax log_softmax * refine rocm * refine unittest
-
- 25 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 11 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 02 4月, 2021 1 次提交
-
-
由 ronnywang 提交于
-
- 03 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 07 1月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 05 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 11 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
The macro should be defined by compiler rather than by source. test=develop
-
- 14 11月, 2018 2 次提交
-
-
由 Jacek Czaja 提交于
test=develop
-
由 Tao Luo 提交于
-
- 09 11月, 2018 2 次提交
-
-
由 Jacek Czaja 提交于
test=develop
-
由 Jacek Czaja 提交于
test=develop
-
- 07 11月, 2018 1 次提交
-
-
由 chengduo 提交于
* add fp16 backward support test=develop * add sum_op fp16 test * disable test_dist_save_load test=develop * add check_grad for sum * add unit test for softmax_grad fp16 test=develop * add scale_op unit test * add mul_grad_op unit test for fp16 * add cross_entropy_grad and eman_grad unit test for fp16 test=develop * fix cross_entropy unit test * add pool2d fp16 unit test * refine conv2d fp16 unit test test=develop * refine activation unit test test=develop * fix ci test=develop * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796 test=develop
-
- 17 8月, 2018 1 次提交
-
- 16 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "cherry picked operators changes" * "remove duplicated code" * "add constant setter" * "add get expected kernel" * "fix ci" * "add fill constant"
-
- 26 7月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* refine conv cudnn enforce * update * update all cudnn ops * fix
-
- 07 4月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
* initial commit * fix error * fix typo and order
-
- 21 3月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 15 3月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add softmax cudnn functor support" * "add testing" * "refine cmakelist" * "sequence softmax forward speed up" * "add softmax grad" * "fix sequence softmax test" * "add double precision' * "fix softmax test" * "add softmax cudnn support" * "fix softmax cudnn test" * "add softmax to nn.py" * "fix compile bug" * "refine cmakelist" * "fix ci" * "fix based on comment" * "fix based on comments" * "fix ci"
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 13 11月, 2017 1 次提交
-
-
由 dangqingqing 提交于
-
- 28 9月, 2017 1 次提交
-
-
由 Liu Yiqun 提交于
-
- 22 9月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 13 9月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 25 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 17 7月, 2017 2 次提交
-
-
由 Yu Yang 提交于
-
由 Yan Chunwei 提交于
* add NDEBUG switch to PADDLE_ENFORCE
-
- 11 7月, 2017 2 次提交
-
-
由 dongzhihong 提交于
-
由 dongzhihong 提交于
-
- 06 7月, 2017 2 次提交
- 05 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 04 7月, 2017 2 次提交