- 07 9月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 09 4月, 2022 1 次提交
-
-
由 limingshu 提交于
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode. * Use the system cudaMalloc and cudaFree to allocate workspace during searching. * Enable switch of two kind of workspace setting methods. Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 15 4月, 2021 1 次提交
-
-
由 AshburnLee 提交于
-
- 11 1月, 2021 1 次提交
-
-
由 AshburnLee 提交于
-
- 05 8月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* cunn8 support test=develop * fix ci error test=develop
-
- 07 1月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 05 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 23 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* make_conv_cudnn_ws_size_configurable, test=develop * change std::max to std::min test=develop
-
- 25 1月, 2019 1 次提交
-
-
由 chengduo 提交于
* Revert "set constant for loss" This reverts commit 167933f678ccbb3563e949710279efe004a27731. * Revert "remove workspace_handle" test=develop This reverts commit b4aca8ede9e685bce1dfb1c59e63919f33432572.
-
- 22 1月, 2019 1 次提交
-
-
由 chengduo 提交于
* remove workspace_handle test=develop * set constant for loss test=develop
-
- 03 1月, 2019 1 次提交
-
-
由 qingqing01 提交于
test=develop
-
- 28 12月, 2018 1 次提交
-
-
由 qingqing01 提交于
* Inception fusion operator. * Support horizontal layer fusion in conv_fusion_op. * Search conv algo strategy for variable-length input. search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
-