提交 · 7499f9613114b903866ef105489b79a41bf1b837 · BaiXuePrincess / Paddle

01 7月, 2022 1 次提交
- W
  
  Switch eager mode to default dygraph mode (#43767) · 7499f961
  由 Weilong Wu 提交于 7月 01, 2022
  
  7499f961
30 6月, 2022 10 次提交
- Z
  Move apis(digamma, dist, dot) from legacy_api.yaml to api.yaml (#43956) · f33763e3
  由 zyfncg 提交于 6月 30, 2022
```
* move standard apis to api.yaml

* revert erfinv

* delete dot_op.h

* fix dot

* rerun ci
```
  f33763e3
- Z
  
  remove decrease_axis in op_teller.cc , support them in slice (#43963) · 56ddd7c2
  由 zhoutianzi666 提交于 6月 30, 2022
  
  56ddd7c2
- H
  [jit] save multi program into one param and seperate model (#43686) · 23f18f46
  由 Hui Zhang 提交于 6月 30, 2022
```
* save multi program into one param and seperate model

* export class property
```
  23f18f46
- [MLU] add mlu kernel for masked_select (#43816) · d29a1214
  由光明和真理提交于 6月 30, 2022
  
  d29a1214
- Z
  
  [MLU] add exp and exp_grad kernel (#43852) · 59d50468
  由 zhaoying9105 提交于 6月 30, 2022
  
  59d50468
- C
  Add statistic code for memory (#43960) · 52d43ca2
  由 chenjian 提交于 6月 30, 2022
```
* add code

* add unit test
```
  52d43ca2
- L
  [new-exec] support runing with different scope and the same program using scope_guard (#43962) · 99a4ff8f
  由 Leo Chen 提交于 6月 30, 2022
```
* support scope_guard

* fix test
```
  99a4ff8f
- X
  [Dy2Static] Add non-local for while and for. (#43864) · 8279dfea
  由 xiongkun 提交于 6月 30, 2022
```
* merge and add base support for non-local for

* for and while non-local support

* fix ci errors: v1

* fix bug

* fix

* fix code

* fix

* fix

* fix
```
  8279dfea
- C
  [phi]add relu6 kernel and yaml (#43549) · a9bba5ba
  由 chentianyu03 提交于 6月 30, 2022
```
* add relu6 kernel and yaml

* format files

* format code and fix bug

* fix build failed
```
  a9bba5ba
- C
  
  [MLU] add rnn forward kernel. (#43894) · 2616d51a
  由 Chenxiao Niu 提交于 6月 30, 2022
  
  2616d51a
29 6月, 2022 4 次提交
- Z
  Support code auto-gene for optimizer api in yaml (#43915) · aa45f931
  由 zyfncg 提交于 6月 29, 2022
```
* support complexd selected_rows kernel in yaml

* support configuring optimizer api in yaml

* fix data transform bug
```
  aa45f931
- W
  
  convert to mixed model python api (#43881) · cbaebb04
  由 Wilber 提交于 6月 29, 2022
  
  cbaebb04
- C
  add equal trt converter (#43461) · 1dbbe20e
  由 ccrrong 提交于 6月 29, 2022
```
* add comparisons trt converter
```
  1dbbe20e
- Q
  skip xpu conv2d fp16 unitest (#43547) · bceca47a
  由 QingshuChen 提交于 6月 29, 2022
```
* skip xpu conv2d fp16 unitest
*test=kunlun

* minor
*test=kunlun
```
  bceca47a
28 6月, 2022 11 次提交

Y

[fused_transformer] update transformer fustion for dygraph, test=allcases (#43858) · 99b3727d
由 Yuang Liu 提交于 6月 28, 2022

99b3727d
A

[Dy2Stat]Enhance Python if-else by pruning usless no_return variable (#43880) · 6e0aa776
由 Aurelius84 提交于 6月 28, 2022

6e0aa776
A
[Dy2Stat]Unify all API name in_jst import path to improve readablity (#43868) · 6cb24967
由 Aurelius84 提交于 6月 28, 2022
```
* [Dy2Stat]Polish all API name of _jst
```
6cb24967
X
add unittest for PR43688 (#43747) · 13451615
由 xiongkun 提交于 6月 28, 2022
```
* add unittest for PR43688
```
13451615
Z

[MLU]: add roi_align and roi_align_grad kernel (#43757) · 99ea0a9c
由 zhaoying9105 提交于 6月 28, 2022

99ea0a9c

Apply IOU to test_parallel_executor_seresnext_base_gpu (#43812) · c0cf5cb7

由 Ming-Xu Huang 提交于 6月 28, 2022

1. test_parallel_executor_seresnext_base_gpu failed on 2 P100 GPUs with `470.82` driver.
```
======================================================================
FAIL: test_seresnext_with_learning_rate_decay (test_parallel_executor_seresnext_base_gpu.TestResnetGPU)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/test_parallel_executor_seresnext_base_gpu.py", line 32, in test_seresnext_with_learning_rate_decay
    self._compare_result_with_origin_model(
  File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/seresnext_test_base.py", line 56, in _compare_result_with_origin_model
    self.assertAlmostEquals(
AssertionError: 6.8825445 != 6.882531 within 1e-05 delta (1.335144e-05 difference)
----------------------------------------------------------------------
```
2. To be more accuracte on evaluating loss convergence, we proposed to apply IOU as metric, instead of comparing first and last loss values.
3. As offline discussion, we also evaluated convergence on P100 and A100 in 1000 interations to make sure this UT have the same convergence property on both devices. The curves are showed below.
![A100-Single, P100-Single and Diff (1)](https://user-images.githubusercontent.com/13541238/175461920-25df6101-6dd8-4387-862c-d1c8e9299c57.png)

c0cf5cb7

F

[MLU]add mlu kernel for where_index op (#43720) · ec5f8cfd
由 fuyou765 提交于 6月 28, 2022

ec5f8cfd
【Sparse】add SparseTensor mv kernel(csr*dense_vec->dence_vec, coo*dense_vec->dense_vec) (#43668) · 5161a047
由 zhouweiwei2014 提交于 6月 28, 2022
```
* [Sparse]add SparseTensor mv kernel(csr*dense_vec->dence_vec, coo*dense_vec->dense_vec)

* fix CI
```
5161a047
M

[ASP] fix some bugs of asp (#43853) · 6aeb60aa
由 minghaoBD 提交于 6月 28, 2022

6aeb60aa
Z

fix squeeze2/unsqueeze2 unittest, *test=kunlun (#43859) · b34b54db
由 zhangxiaoci 提交于 6月 28, 2022

b34b54db

Add forward_gradients api and enable high-order differentiation for Jacobian/Hessian (#43354) · a97a8dd1

由 Xiaoxu Chen 提交于 6月 28, 2022

* enable Jacobian,Hessian supporting new autograd

* fix prim mode failed in PR-CI-Windows

* add forward_gradients api

* add forward_gradients api

* skip test_autograd_functional_prim in windows ci

* fix test_autograd_funciton_prim timeouot

* remove the block parameter in prim2orig method

* remove duplicate to_tensors code snippet # test=allcases

a97a8dd1

27 6月, 2022 7 次提交
- A
  [Dy2Stat]Refactor convert_shape transformer logic (#43846) · d82d5b8c
  由 Aurelius84 提交于 6月 27, 2022
```
* [Dy2Stat]Refactor convert_shape transformer logic

* clean usless unittest
```
  d82d5b8c
- W
  [Eager] Rename EagerPyLayer to PyLayer (#43696) · a5dc0a79
  由 wanghuancoder 提交于 6月 27, 2022
```
* rename eagerpylayer
```
  a5dc0a79
- A
  [CustomDevice]add custom place supports (#43813) · 7f22ef54
  由 Aganlengzi 提交于 6月 27, 2022
```
* [CustomDevice]add custom place supports

* sync format
```
  7f22ef54
- G
  
  fix post_training_quantization typo (#43845) · b848bd37
  由 Guanghua Yu 提交于 6月 27, 2022
  
  b848bd37
- A
  
  [Dy2Stat]Enhance nonlocal machanism while nonlocal vars is empty (#43848) · 40a77319
  由 Aurelius84 提交于 6月 27, 2022
  
  40a77319
- J
  [Docs] Fix doc of kaiming initializer (#43823) · e6e1c5e7
  由 Jackwaterveg 提交于 6月 27, 2022
```
* Update kaiming.py

* Update initializer.py

* fix doc bug;test=document_fix

* fix doc;test=document_fix

* Update initializer.py

* Update kaiming.py

* for ci;test=document_fix
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
```
  e6e1c5e7
- Z
  
  amp.decorate save_dtype hook skil int (#43824) · dfdcfb94
  由 zhangbo9674 提交于 6月 27, 2022
  
  dfdcfb94
24 6月, 2022 7 次提交

Fix hang bug of TCPStore (#43724) · 4c9330d6

由 gongweibao 提交于 6月 24, 2022

* tmp fix

* init

* compile ok

* compile ok

* add vlogs

* add test

* fix termination error

* add testfile

* add

* fix window compile

* fix window compile

* fix windows compile

* fix windows compile

* fix windows compile

* fix windows compile

* fix windows compile

* fix windows compile

* fix kunlun compile

* fix compilation

* fix compilation

* fix compilation

* tmp fix

* add windows

* add windows

* add more logs

* change timeout to protected

* SB

* add

* add

* fix timeout

* add

* fix test

* fix test

* fix test

* fix ut

* fix ut

* fix ut

4c9330d6

G

fix quantization clip and round Attribute (#43764) · 491b87b4
由 Guanghua Yu 提交于 6月 24, 2022

491b87b4

[ Dy2Static ] Add closure analysis for control flow and add some unittest (#43713) · 69717717

由 xiongkun 提交于 6月 24, 2022

* add closure analysis for control flow and add some unittest

* finetune the design of FunctionScopeVisitor

* fix

* fix python check

* fix code by code review

69717717

C
add slice plugin int32 support (#43808) · af97b310
由 ccrrong 提交于 6月 24, 2022
```
* add slice plugin int32 support
```
af97b310
[Sparse] support batch compute of SparseTensor matmul/masked_matmul/softmax (#43703) · eec4e034
由 zhouweiwei2014 提交于 6月 24, 2022

eec4e034
F

[MLU]add mlu kernel for set_value op (#43687) · fa9586a7
由 fuyou765 提交于 6月 24, 2022

fa9586a7

modify xpu unittest to support fp64, *test=kunlun (#43772) · 89c783db

由 z8hanghuan 提交于 6月 24, 2022

* modify xpu unittest to support fp64, *test=kunlun

* modify xpu unittest to support fp64 for KL2, *test=kunlun

* modify xpu unittest to support fp64, *test=kunlun

* modify xpu unittest to support fp64, *test=kunlun

89c783db

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致