提交 · 5fbc26e282928d0bff08551d00cd0dd92cd3db07 · BaiXuePrincess / Paddle

05 7月, 2022 3 次提交
- Z
  refine tensor.dtype print formate for bfloat16 (#44055) · b918d063
  由 zhangbo9674 提交于 2年前
```
* refine tensor.dtype for bloat16

* refine test

* revert

* refine bfloat16 print
```
  未验证
  
  b918d063
- R
  
  Remove header file including for boost (#44052) · 52607cf8
  由 Ruibiao Chen 提交于 2年前
  
  未验证
  
  52607cf8
- W
  [Dy2St]Add BaseTransformer for dy2st error message (#44054) · c10aa24f
  由 WangZhen 提交于 2年前
```
* Add BaseTransformer for dy2st error message

* Fix return_transformer error

* Polish dy2st error info in runtime

* Fix UT error

* Polish runtime error code
```
  未验证
  
  c10aa24f
04 7月, 2022 4 次提交
- C
  
  [MLU] uncomment some interp_v2 tests. (#44053) · 91c0f727
  由 Chenxiao Niu 提交于 2年前
  
  未验证
  
  91c0f727
- Z
  
  [MLU]: add hard_sigmoid,hard_sigmoid_grad,hard_swish,hard_swish_grad kernel (#44044) · bd06a828
  由 zhaoying9105 提交于 2年前
  
  未验证
  
  bd06a828
- L
  
  Modify the unittests of the conv2d_transpose, gaussian_random op. test=kunlun (#43961) · 2b0c22ad
  由 Leo Guo 提交于 2年前
  
  未验证
  
  2b0c22ad
- H
  
  [CINN] Enable test_resnet50_with_cinn (#44017) · cf8e86df
  由 Huihuang Zheng 提交于 2年前
  
  未验证
  
  cf8e86df
02 7月, 2022 1 次提交

unify cpu context, part2 (#44012) · 755438a7

由 Leo Chen 提交于 2年前

* fix init()

* delete test_device_context

* replace CPUDeviceContext with CPUContext

* fix test_scalar

* remove dot_op.cc

* fix compile

未验证

755438a7

01 7月, 2022 12 次提交
- P
  convert graph to program to let SandaloneExecutor supporrt CompiledProgram (#43448) · 8d9f00a8
  由 pangyoki 提交于 2年前
```
* convert graph to program to let sSandaloneExecutor supporrt CompiledProgram

* skip case that compiled_program._program is None

* execute CompiledProgram._compile to apply build_strategy
```
  未验证
  
  8d9f00a8
- C
  
  add clip_extra and change use_combine_name (#44008) · f1ffd59a
  由 Chen Weihang 提交于 2年前
  
  未验证
  
  f1ffd59a
- Y
  
  update new unittests of flatten ops and layernorm, *test=kunlun (#43895) · 9a1fdad3
  由 ykkk2333 提交于 2年前
  
  未验证
  
  9a1fdad3
- 石
  
  fixes a bug, test=develop (#43970) · f3bdabc1
  由石晓伟提交于 2年前
  
  未验证
  
  f3bdabc1
- L
  Addition of switch_auto_tune option for transpose op (#43310) · 53d5abe3
  由 limingshu 提交于 2年前
```
* 2nd part of transpose update

* add switch_auto_tune option.

* add some changes according to Ci

* refine the structure of auto_tune_base.

* merge develop changes

* reset the switch_set_range and change unittest of transpose auto-tune

* change the kernel auto-tune logits
```
  未验证
  
  53d5abe3
- W
  
  Process sub-node in tensor_shape_transformer (#43998) · fac6a5f0
  由 WangZhen 提交于 2年前
  
  未验证
  
  fac6a5f0
- A
  [Dy2Stat]Polish break/continue statement transformer logic (#43489) · cf8d42bb
  由 Aurelius84 提交于 2年前
```
* [Dy2Stat]Polish break/continue statement transformer logic
```
  未验证
  
  cf8d42bb
- E
  Re-write the unit tests for compare xpu op (#43460) · 267d3191
  由 enzodechine 提交于 2年前
```
* re-write the unit tests for compare xpu op

*test=kunlun

* re-write the unit tests for compare xpu op

*test=kunlun
Co-authored-by: Nrunzhech <runzh_chen@sjtu.edu.cn>
```
  未验证
  
  267d3191
- A
  [Dy2Stat]Enhance nonlocal machanism while returning single var (#43957) · 8571833f
  由 Aurelius84 提交于 2年前
```
* [Dy2Stat]Enhance nonlocal machanism while returning single var

* [Dy2Stat]Enhance nonlocal machanism while returning single var
```
  未验证
  
  8571833f
- [MLU] add mlu kernel for fill_constant_batch_size_like (#43820) · 88e27a07
  由光明和真理提交于 2年前
  
  未验证
  
  88e27a07
- C
  
  [MLU] add rnn backward kernel. (#43969) · 3a59ede9
  由 Chenxiao Niu 提交于 2年前
  
  未验证
  
  3a59ede9
- W
  
  Switch eager mode to default dygraph mode (#43767) · 7499f961
  由 Weilong Wu 提交于 2年前
  
  未验证
  
  7499f961
30 6月, 2022 10 次提交
- Z
  Move apis(digamma, dist, dot) from legacy_api.yaml to api.yaml (#43956) · f33763e3
  由 zyfncg 提交于 2年前
```
* move standard apis to api.yaml

* revert erfinv

* delete dot_op.h

* fix dot

* rerun ci
```
  未验证
  
  f33763e3
- Z
  
  remove decrease_axis in op_teller.cc , support them in slice (#43963) · 56ddd7c2
  由 zhoutianzi666 提交于 2年前
  
  未验证
  
  56ddd7c2
- H
  [jit] save multi program into one param and seperate model (#43686) · 23f18f46
  由 Hui Zhang 提交于 2年前
```
* save multi program into one param and seperate model

* export class property
```
  未验证
  
  23f18f46
- [MLU] add mlu kernel for masked_select (#43816) · d29a1214
  由光明和真理提交于 2年前
  
  未验证
  
  d29a1214
- Z
  
  [MLU] add exp and exp_grad kernel (#43852) · 59d50468
  由 zhaoying9105 提交于 2年前
  
  未验证
  
  59d50468
- C
  Add statistic code for memory (#43960) · 52d43ca2
  由 chenjian 提交于 2年前
```
* add code

* add unit test
```
  未验证
  
  52d43ca2
- L
  [new-exec] support runing with different scope and the same program using scope_guard (#43962) · 99a4ff8f
  由 Leo Chen 提交于 2年前
```
* support scope_guard

* fix test
```
  未验证
  
  99a4ff8f
- X
  [Dy2Static] Add non-local for while and for. (#43864) · 8279dfea
  由 xiongkun 提交于 2年前
```
* merge and add base support for non-local for

* for and while non-local support

* fix ci errors: v1

* fix bug

* fix

* fix code

* fix

* fix

* fix
```
  未验证
  
  8279dfea
- C
  [phi]add relu6 kernel and yaml (#43549) · a9bba5ba
  由 chentianyu03 提交于 2年前
```
* add relu6 kernel and yaml

* format files

* format code and fix bug

* fix build failed
```
  未验证
  
  a9bba5ba
- C
  
  [MLU] add rnn forward kernel. (#43894) · 2616d51a
  由 Chenxiao Niu 提交于 2年前
  
  未验证
  
  2616d51a
29 6月, 2022 4 次提交
- Z
  Support code auto-gene for optimizer api in yaml (#43915) · aa45f931
  由 zyfncg 提交于 2年前
```
* support complexd selected_rows kernel in yaml

* support configuring optimizer api in yaml

* fix data transform bug
```
  未验证
  
  aa45f931
- W
  
  convert to mixed model python api (#43881) · cbaebb04
  由 Wilber 提交于 2年前
  
  未验证
  
  cbaebb04
- C
  add equal trt converter (#43461) · 1dbbe20e
  由 ccrrong 提交于 2年前
```
* add comparisons trt converter
```
  未验证
  
  1dbbe20e
- Q
  skip xpu conv2d fp16 unitest (#43547) · bceca47a
  由 QingshuChen 提交于 2年前
```
* skip xpu conv2d fp16 unitest
*test=kunlun

* minor
*test=kunlun
```
  未验证
  
  bceca47a
28 6月, 2022 6 次提交

Y

[fused_transformer] update transformer fustion for dygraph, test=allcases (#43858) · 99b3727d
由 Yuang Liu 提交于 2年前

未验证

99b3727d
A

[Dy2Stat]Enhance Python if-else by pruning usless no_return variable (#43880) · 6e0aa776
由 Aurelius84 提交于 2年前

未验证

6e0aa776
A
[Dy2Stat]Unify all API name in_jst import path to improve readablity (#43868) · 6cb24967
由 Aurelius84 提交于 2年前
```
* [Dy2Stat]Polish all API name of _jst
```
未验证

6cb24967
X
add unittest for PR43688 (#43747) · 13451615
由 xiongkun 提交于 2年前
```
* add unittest for PR43688
```
未验证

13451615
Z

[MLU]: add roi_align and roi_align_grad kernel (#43757) · 99ea0a9c
由 zhaoying9105 提交于 2年前

未验证

99ea0a9c

Apply IOU to test_parallel_executor_seresnext_base_gpu (#43812) · c0cf5cb7

由 Ming-Xu Huang 提交于 2年前

1. test_parallel_executor_seresnext_base_gpu failed on 2 P100 GPUs with `470.82` driver.
```
======================================================================
FAIL: test_seresnext_with_learning_rate_decay (test_parallel_executor_seresnext_base_gpu.TestResnetGPU)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/test_parallel_executor_seresnext_base_gpu.py", line 32, in test_seresnext_with_learning_rate_decay
    self._compare_result_with_origin_model(
  File "/opt/paddle/paddle/build/python/paddle/fluid/tests/unittests/seresnext_test_base.py", line 56, in _compare_result_with_origin_model
    self.assertAlmostEquals(
AssertionError: 6.8825445 != 6.882531 within 1e-05 delta (1.335144e-05 difference)
----------------------------------------------------------------------
```
2. To be more accuracte on evaluating loss convergence, we proposed to apply IOU as metric, instead of comparing first and last loss values.
3. As offline discussion, we also evaluated convergence on P100 and A100 in 1000 interations to make sure this UT have the same convergence property on both devices. The curves are showed below.
![A100-Single, P100-Single and Diff (1)](https://user-images.githubusercontent.com/13541238/175461920-25df6101-6dd8-4387-862c-d1c8e9299c57.png)

未验证

c0cf5cb7

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致