- 25 8月, 2022 1 次提交
-
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
- 23 8月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 Thomas Young 提交于
* save change * save change by YSL * save change by YSL * change by YSL * test pre commit * Revert "test pre commit" This reverts commit eee5e116331186cc544de871b4a5174a6431f17c. * fix code style * fix ctest * temp save * save change * change by YSL * final change by ysl * fix ci * fix code style * delete unuse code * change by ysl
-
- 21 6月, 2022 2 次提交
-
-
由 Sing_chan 提交于
resort .cu headers, set clang-format not sort include block and consider .cu as main source file (#43633)
-
由 Zhang Ting 提交于
-
- 10 6月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix depthwise conv yaml error * fix depthwise conv double grad error
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 01 6月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add conv3d yaml * add conv3d_grad, conv3d_double_grad * add final_state_conv3d test case * add conv3d double test case * add depthwise_conv2d grad yaml * add depthwise_conv2d double grad test case * modify the order of args * add depthwise_conv2d_grad_grad config
-
- 30 5月, 2022 1 次提交
-
-
由 crystal 提交于
-
- 27 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
- 15 4月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures * [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode * [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode * Enabled more test cases * [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode * Adjusted test_imperative_star_gan_with_gradient_penalty.py
-
- 12 4月, 2022 1 次提交
-
-
由 hong 提交于
-
- 09 4月, 2022 2 次提交
-
-
由 hong 提交于
-
由 limingshu 提交于
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode. * Use the system cudaMalloc and cudaFree to allocate workspace during searching. * Enable switch of two kind of workspace setting methods. Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 06 4月, 2022 1 次提交
-
-
由 hong 提交于
* update * add conv yaml * add backward * remove useless code * fix bug * fix bug * revert fluid dygraph conv2d * remove useless infermeta function * fix meta fn deluplicat error * conv using custom impl * remove amp include * fix bug * use cudnn = true * fix test mkldnn caching bug
-
- 22 3月, 2022 1 次提交
-
-
由 hong 提交于
* move mutable_data to context alloc * move mutable_data to context alloc * remvoe duplicate code
-
- 21 3月, 2022 1 次提交
-
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
- 16 3月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
* Optimize the computation of log_softmax * modify the var name
-
- 14 3月, 2022 2 次提交
-
-
由 Zhang Zheng 提交于
* Optimize performance of log_softmax * delete unity build * modify to phi * fix * fixfixfixfix * fix * fix * fix * fix * simplify * fix * fix enforce
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-
- 12 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename softmax kernel name * move softmax infershape * fix failed test
-
- 11 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 08 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove gpudnn suffix & polish cmake * fix typo
-
- 04 3月, 2022 1 次提交
-
-
由 hong 提交于
* move conv to pten * move conv to pten; test=develop * fix bug; * add conv cudnn impl; test=develop * update * update operator; test=develop * fix bug; test=develop * move operator and prepared_operator to develop; test=develop * resolve conflict; test=develop * remove useless code;test=develop * add depency ; test=develop * fix bug; * add sig.cc ; test=develop * fix use_op error; test=develop * fix bug; test=develop * fix bug; test=develop * add conv3d register; test=develop * fix star gan and conv_nn_grad test failed; test=develop * add header; test=develop * manul to recover to develop; * resolve confilct; test=develop * remove useless code * fix bug; * remove conv2d_cudnn; test=develop * fix bugs; test=develop * fix cpu rocm compile bugs; test=develop * fix blas error; test=develop * fix compile bug; test=develop * fix windows compile error; test=develop * fix windows error; test=develop * resolve confilct; test=develop
-
- 02 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add softmax log_softmax * refine rocm * refine unittest
-
- 25 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-