- 01 7月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
-
- 30 6月, 2022 10 次提交
-
-
由 zyfncg 提交于
* move standard apis to api.yaml * revert erfinv * delete dot_op.h * fix dot * rerun ci
-
由 zhoutianzi666 提交于
-
由 Hui Zhang 提交于
* save multi program into one param and seperate model * export class property
-
由 光明和真理 提交于
-
由 zhaoying9105 提交于
-
由 chenjian 提交于
* add code * add unit test
-
由 Leo Chen 提交于
* support scope_guard * fix test
-
由 xiongkun 提交于
* merge and add base support for non-local for * for and while non-local support * fix ci errors: v1 * fix bug * fix * fix code * fix * fix * fix
-
由 chentianyu03 提交于
* add relu6 kernel and yaml * format files * format code and fix bug * fix build failed
-
由 Chenxiao Niu 提交于
-
- 29 6月, 2022 4 次提交
-
-
由 zyfncg 提交于
* support complexd selected_rows kernel in yaml * support configuring optimizer api in yaml * fix data transform bug
-
由 Wilber 提交于
-
由 ccrrong 提交于
* add comparisons trt converter
-
由 QingshuChen 提交于
* skip xpu conv2d fp16 unitest *test=kunlun * minor *test=kunlun
-
- 28 6月, 2022 11 次提交
-
-
由 Yuang Liu 提交于
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Polish all API name of _jst
-
由 xiongkun 提交于
* add unittest for PR43688
-
由 zhaoying9105 提交于
-
由 Ming-Xu Huang 提交于
1. test_parallel_executor_seresnext_base_gpu failed on 2 P100 GPUs with `470.82` driver. ``` ====================================================================== FAIL: test_seresnext_with_learning_rate_decay (test_parallel_executor_seresnext_base_gpu.TestResnetGPU) ---------------------------------------------------------------------- Traceback (most recent call last): File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/test_parallel_executor_seresnext_base_gpu.py", line 32, in test_seresnext_with_learning_rate_decay self._compare_result_with_origin_model( File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/seresnext_test_base.py", line 56, in _compare_result_with_origin_model self.assertAlmostEquals( AssertionError: 6.8825445 != 6.882531 within 1e-05 delta (1.335144e-05 difference) ---------------------------------------------------------------------- ``` 2. To be more accuracte on evaluating loss convergence, we proposed to apply IOU as metric, instead of comparing first and last loss values. 3. As offline discussion, we also evaluated convergence on P100 and A100 in 1000 interations to make sure this UT have the same convergence property on both devices. The curves are showed below. ![A100-Single, P100-Single and Diff (1)](https://user-images.githubusercontent.com/13541238/175461920-25df6101-6dd8-4387-862c-d1c8e9299c57.png)
-
由 fuyou765 提交于
-
由 zhouweiwei2014 提交于
* [Sparse]add SparseTensor mv kernel(csr*dense_vec->dence_vec, coo*dense_vec->dense_vec) * fix CI
-
由 minghaoBD 提交于
-
由 zhangxiaoci 提交于
-
由 Xiaoxu Chen 提交于
* enable Jacobian,Hessian supporting new autograd * fix prim mode failed in PR-CI-Windows * add forward_gradients api * add forward_gradients api * skip test_autograd_functional_prim in windows ci * fix test_autograd_funciton_prim timeouot * remove the block parameter in prim2orig method * remove duplicate to_tensors code snippet # test=allcases
-
- 27 6月, 2022 7 次提交
-
-
由 Aurelius84 提交于
* [Dy2Stat]Refactor convert_shape transformer logic * clean usless unittest
-
由 wanghuancoder 提交于
* rename eagerpylayer
-
由 Aganlengzi 提交于
* [CustomDevice]add custom place supports * sync format
-
由 Guanghua Yu 提交于
-
由 Aurelius84 提交于
-
由 Jackwaterveg 提交于
* Update kaiming.py * Update initializer.py * fix doc bug;test=document_fix * fix doc;test=document_fix * Update initializer.py * Update kaiming.py * for ci;test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 zhangbo9674 提交于
-
- 24 6月, 2022 7 次提交
-
-
由 gongweibao 提交于
* tmp fix * init * compile ok * compile ok * add vlogs * add test * fix termination error * add testfile * add * fix window compile * fix window compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix kunlun compile * fix compilation * fix compilation * fix compilation * tmp fix * add windows * add windows * add more logs * change timeout to protected * SB * add * add * fix timeout * add * fix test * fix test * fix test * fix ut * fix ut * fix ut
-
由 Guanghua Yu 提交于
-
由 xiongkun 提交于
* add closure analysis for control flow and add some unittest * finetune the design of FunctionScopeVisitor * fix * fix python check * fix code by code review
-
由 ccrrong 提交于
* add slice plugin int32 support
-
由 zhouweiwei2014 提交于
-
由 fuyou765 提交于
-
由 z8hanghuan 提交于
* modify xpu unittest to support fp64, *test=kunlun * modify xpu unittest to support fp64 for KL2, *test=kunlun * modify xpu unittest to support fp64, *test=kunlun * modify xpu unittest to support fp64, *test=kunlun
-