- 21 3月, 2022 1 次提交
-
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
- 04 3月, 2022 1 次提交
-
-
由 hong 提交于
* move conv to pten * move conv to pten; test=develop * fix bug; * add conv cudnn impl; test=develop * update * update operator; test=develop * fix bug; test=develop * move operator and prepared_operator to develop; test=develop * resolve conflict; test=develop * remove useless code;test=develop * add depency ; test=develop * fix bug; * add sig.cc ; test=develop * fix use_op error; test=develop * fix bug; test=develop * fix bug; test=develop * add conv3d register; test=develop * fix star gan and conv_nn_grad test failed; test=develop * add header; test=develop * manul to recover to develop; * resolve confilct; test=develop * remove useless code * fix bug; * remove conv2d_cudnn; test=develop * fix bugs; test=develop * fix cpu rocm compile bugs; test=develop * fix blas error; test=develop * fix compile bug; test=develop * fix windows compile error; test=develop * fix windows error; test=develop * resolve confilct; test=develop
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 18 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move blas related files * move lapack related files
-
- 11 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
- 26 5月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
* optimize OP's compilation time, test=develop * add more op and run ci test, test=develop * CUDA Kernel register in cc file, test=develop * fix macros, test=develop * fix undefined symbol error, test=develop * fix compilation error and undefined symbol, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop
-
- 21 3月, 2021 1 次提交
-
-
由 Ouyang Chao 提交于
-
- 21 12月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* add double grad for conv_transpose
-
- 24 9月, 2020 1 次提交
-
-
由 Kaipeng Deng 提交于
* fix error message in conv/conv_transpose. test=develop
-
- 03 12月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 08 11月, 2019 1 次提交
-
-
由 liym27 提交于
fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop
-
- 31 10月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
-
- 13 10月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
-
- 07 10月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
-
- 29 9月, 2018 1 次提交
-
-
由 Dun 提交于
* refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible
-
- 27 6月, 2018 1 次提交
-
-
由 qingqing01 提交于
* Speed depthwise transposed conv2d.
-
- 22 5月, 2018 1 次提交
-
-
由 Yibing Liu 提交于
-
- 08 5月, 2018 2 次提交
-
-
由 Yu Yang 提交于
Do not use ctor * Reduce line of codes. * We can use virtual function for Maker now. * The implementation does not care what maker holds, it is easier to refactor later.
-
由 chengduoZH 提交于
-
- 04 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 11 4月, 2018 1 次提交
-
-
由 Siddharth Goyal 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 23 1月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine conv2d_transpose layer doc * fix conv2d_transpose doc * fix doc
-
- 15 1月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 20 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Move framework.proto to proto namespace * Fix compile * Fix compile * Fix Compile
-
- 14 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 11 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 08 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 27 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 22 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 21 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 17 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 15 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-