- 05 7月, 2022 3 次提交
-
-
由 zhangbo9674 提交于
* refine tensor.dtype for bloat16 * refine test * revert * refine bfloat16 print
-
由 Ruibiao Chen 提交于
-
- 04 7月, 2022 4 次提交
-
-
由 Chenxiao Niu 提交于
-
由 zhaoying9105 提交于
-
由 Huihuang Zheng 提交于
-
- 02 7月, 2022 1 次提交
-
- 01 7月, 2022 12 次提交
-
-
由 Chen Weihang 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Polish break/continue statement transformer logic
-
由 enzodechine 提交于
* re-write the unit tests for compare xpu op *test=kunlun * re-write the unit tests for compare xpu op *test=kunlun Co-authored-by: Nrunzhech <runzh_chen@sjtu.edu.cn>
-
由 Aurelius84 提交于
* [Dy2Stat]Enhance nonlocal machanism while returning single var * [Dy2Stat]Enhance nonlocal machanism while returning single var
-
由 Chenxiao Niu 提交于
-
由 Weilong Wu 提交于
- 30 6月, 2022 10 次提交
-
-
由 zhoutianzi666 提交于
-
由 zhaoying9105 提交于
-
由 chentianyu03 提交于
* add relu6 kernel and yaml * format files * format code and fix bug * fix build failed
-
由 Chenxiao Niu 提交于
- 29 6月, 2022 4 次提交
-
-
由 QingshuChen 提交于
* skip xpu conv2d fp16 unitest *test=kunlun * minor *test=kunlun
- 28 6月, 2022 6 次提交
-
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Polish all API name of _jst
-
由 zhaoying9105 提交于
-
由 Ming-Xu Huang 提交于
1. test_parallel_executor_seresnext_base_gpu failed on 2 P100 GPUs with `470.82` driver. ``` ====================================================================== FAIL: test_seresnext_with_learning_rate_decay (test_parallel_executor_seresnext_base_gpu.TestResnetGPU) ---------------------------------------------------------------------- Traceback (most recent call last): File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/test_parallel_executor_seresnext_base_gpu.py", line 32, in test_seresnext_with_learning_rate_decay self._compare_result_with_origin_model( File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/seresnext_test_base.py", line 56, in _compare_result_with_origin_model self.assertAlmostEquals( AssertionError: 6.8825445 != 6.882531 within 1e-05 delta (1.335144e-05 difference) ---------------------------------------------------------------------- ``` 2. To be more accuracte on evaluating loss convergence, we proposed to apply IOU as metric, instead of comparing first and last loss values. 3. As offline discussion, we also evaluated convergence on P100 and A100 in 1000 interations to make sure this UT have the same convergence property on both devices. The curves are showed below. data:image/s3,"s3://crabby-images/53b04/53b041b393852c9f42f1e4f942bf7fce21fb5b24" alt="A100-Single, P100-Single and Diff (1)"