提交 · 56dc8c798267e7b1e82f312651b4699fea15a99a · Crayon鑫 / Paddle

28 3月, 2022 6 次提交
- Z
  Enabled eager_mode for complex unit tests, except for test_complex_op.py and... · 56dc8c79
  由 Zhanlue Yang 提交于 3月 28, 2022
```
Enabled eager_mode for complex unit tests, except for test_complex_op.py and test_complex_view_op.py (#40887)
```
  56dc8c79
- K
  
  Launch fix port (#40936) · 8fe8039e
  由 kuizhiqing 提交于 3月 28, 2022
  
  8fe8039e
- A
  [Dy2Stat] Fix ForLoop Transformation with single return (#40683) · 287cbde8
  由 Aurelius84 提交于 3月 28, 2022
```
* [Dy2Stat] Fix ForLoop Transformation with single return

* [Dy2Stat] Fix ForLoop Transformation with single return
```
  287cbde8
- 0
  Refine test_lac.py for eager mode (#40951) · c03186f9
  由 0x45f 提交于 3月 28, 2022
```
* Refine test_lac.py for eager mode

* refine code

* Fix test_program_translator for eager
```
  c03186f9
- A
  Fix bug while specifying target grad in high order gradient (#40940) · 0d0d76eb
  由 Aurelius84 提交于 3月 28, 2022
```
* Fix bug while specifying target grad in high order gradient

* add more unittest

* add more unittest
```
  0d0d76eb
- Z
  
  Bug fix for intermediate support in Yaml (#40935) · 3f4099ee
  由 Zhanlue Yang 提交于 3月 28, 2022
  
  3f4099ee
27 3月, 2022 5 次提交

X
[ Optest ] refactor optest check_output_with_place logic (#40928) · 37f914c8
由 xiongkun 提交于 3月 27, 2022
```
* first version, maybe many errors

* refactor op_test

* fix compare list

* fix bg

* fix bugs
```
37f914c8

[new-exec] fit for mkldnn and inplace op (#40955) · afa0e82c

由 Leo Chen 提交于 3月 27, 2022

* fit for mkldnn and inplace op

* fix compile

* refine ut

* register op version

* fix inplace op

* fix transfer_layout

afa0e82c

Move slice to phi (#40736) · b8236b7b

由 hong 提交于 3月 27, 2022

* move slice to pten

* merge develop; test=develop

* fix slice bug;

* update

* update

* fix error

* update

* fix bug

* polish code

* polish code

* polish code

* try to fix windows bug

* add gpu compile flag;

* try to fix

* remov template;

* polish code;

* fix npu bug;

* fix npu bug

* fix npu bug; test=develop

* fix slice bug;

* remove no need dep

b8236b7b

A
[NPU] fix npu cast ut (#40982) · f6b6b057
由 Aganlengzi 提交于 3月 27, 2022
```
* [NPU] fix npu cast ut

* [NPU] fix npu cast ut
```
f6b6b057

Add StringTensor (#39830) · 0695e1ac

由 Jack Zhou 提交于 3月 27, 2022

* add string tensor and case convert kernels

* Add strings empty kernel; Reorganize the structure of case convert kernel

* Add string infermeta

* Update mutable_data of string tensor

* rename kernel name

* add string copy tmp

* Fix strings copy device bug

* add utf8 gpu converter

* add string tensor c++ api

* Remove mutable_data of string tensor

* update string tensor interface

* remove charcases_flag.h

* remove some fluid headers

* Add make_ddim

* __HIPCC__ -> PADDLE_WITH_HIP

* remove fluid headers

* fix cpu compile

* remove std::hash

* Fix cudaMalloc

* Remove strings/impl directory

* Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps

* Add empty kernel test

* Remove some comments

* Modify lower/upper api encoding type: string->bool

* STRING->PSTRING; Add CreateInferLikeMeta

* Add code gen for C++ String API

* remove strings_api_utils.h

* Add ignore file (strings_api.h, strings_api.cc)

* update strings gen script

* change args order of case convert kernels

* Add comments for pstring, StringTensor

* cpstring_internal.h -> cpstring_impl.h

* Update accordding to comments:

1. Remove fluid headers
2. paddle::platform::errors -> phi::errors
3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
4. Use camel code style

* Remove all singletons in strings kernels

* fix rocm compile

* Fix py3 compile

* Fix c++ coverage

* 1. Add pstring proto type
2. Add StringTensor debug info
3. Rename case_convert_kernel to strings_lower_upper
4. Remove serialize derialize strings kernel

* DataLayout::PSTRING -> DataLayout::PSTRING_UNION

* Register pstring data type

* Fix strings api gen

* Fix dense tensor register pstring dtype

* Fix error messages

* remove line

* add pstring unittest

* remove test string api unitest

* remove empty line

* Remove some headers to decrease the size of executable file

0695e1ac

26 3月, 2022 2 次提交
- C
  
  Move the redundant numpy() (#40931) · 7e05680c
  由 crystal 提交于 3月 26, 2022
  
  7e05680c
- C
  
  add double grad op example (#40963) · 0ee76f92
  由 Chen Weihang 提交于 3月 26, 2022
  
  0ee76f92
25 3月, 2022 19 次提交

update eager code gen (#40924) · afe2fdd1

由 hong 提交于 3月 25, 2022

* update

* remove useless code

* remove label smooth test

* polish code

* polish code

* polish code

* remove _in_eager_mode error;

afe2fdd1

D
fix lars optitmizer bug (#40892) · c006a609
由 duanboqiang 提交于 3月 25, 2022
```
* fix lars optitmizer bug

* Update optimizer.py
```
c006a609
Z

fix sync_bn error in fp16 amp-o2 (#40943) · 9ab3c76b
由 zhangbo9674 提交于 3月 25, 2022

9ab3c76b
Z

[MLU]add allreduce max/prod/min mlu kernel (#40792) · 9261dff4
由 zn 提交于 3月 25, 2022

9261dff4
0

Fix param@grad type error for amp in run_program (#40938) · 54632b5c
由 0x45f 提交于 3月 25, 2022

54632b5c
J
Fix in dygraph mode doc (#40942) · 09e5b00c
由 Jiabin Yang 提交于 3月 25, 2022
```
* fix doc for enable api

* test=document_fix
```
09e5b00c
J
[Auto parallel] align infer accuracy for ernie generator mode (#40077) · 02146ba5
由 JZ-LIANG 提交于 3月 25, 2022
```
* [Auto Parallel] Support the auto completion of while_op
* align infer  accuracy
```
02146ba5
J

test=document_fix (#40919) · 961ef4de
由 Jiaqi Liu 提交于 3月 25, 2022

961ef4de

add cast_grad phi kernel (#40798) · b79c6a9b

由 zhangbo9674 提交于 3月 25, 2022

* add cast_grad phi kernel

* refie unittest

* refien unittest

* refine unittest

* refine include header path

* refien xpu cast unittest

* refine code

b79c6a9b

support multi_dims for tril_triu, *test=kunlun (#40712) · 9ffedcfd

由 z8hanghuan 提交于 3月 25, 2022

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun

9ffedcfd

change CUDA implementation of dropout OP (#40874) · 1c01d1cc
由 zhouweiwei2014 提交于 3月 25, 2022

1c01d1cc
L
fix paddle.vision.transforms.Resize en docs (#40719) · 236a3bc5
由 Liyulingyue 提交于 3月 25, 2022
```
* Update transforms.py

* Update transforms.py

* Update transforms.py

* Update functional.py
```
236a3bc5

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

T

fix xpu op test, *test=kunlun (#40862) · 1db9cd46
由 TTerror 提交于 3月 25, 2022

1db9cd46

[OpTest] Polish optest (#40879) · d43e8433

由 xiongkun 提交于 3月 25, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

* add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out

* fix some bugs, support python_out_sig

d43e8433

A
[NPU] add merged_momentum (#40875) · 2b74b739
由 Aganlengzi 提交于 3月 25, 2022
```
* [NPU] add merged_momentum

* fix

* fix device
```
2b74b739
Z

modify unit test in bn, stack and split. *test=kunlun (#40880) · 139a30ec
由 Zhangjingyu06 提交于 3月 25, 2022

139a30ec
Z
Scalar support marking data_type in yaml (#40867) · 04087012
由 zyfncg 提交于 3月 25, 2022
```
* Scalar support marking data_type in yaml

* fix code-gene bug
```
04087012

support get_item where the index is a bool scalar tensor (#40829) · 0f5e90a2

由 FlyingQianMM 提交于 3月 25, 2022

* support get_item where the index is a bool scalar tensor

* add unittests for supporting get_item where the index is a bool scalar tensor

0f5e90a2

24 3月, 2022 8 次提交

Support intermediate for Sparse API (#40840) · 98244a9a

由 zyfncg 提交于 3月 24, 2022

* support intermediate for saprse api

* close intermediate in yaml

* fix dygraph_api dep for eager

98244a9a

[AMP] Support amp for Intermediate_dygraph (#40623) · c12f7d48

由 zhangbo9674 提交于 3月 24, 2022

* approve amp for intermediate_dygraph

* add amp_utils for intermediate_dygraph

* add amp needcast check for mlu & npu

* test unittest

* add SetGradNode for set_stop_gradient && add checktensor for GradientHooks

* refine code

* refien unittest of imperative_amp for new dygraph

* inplace api skip amp

* add test_imperative_qat_amp for intermediate amp

* refine code

* refine test_amp ci strategy

* refine unittest code

* refine amp_utils code

* refine amp getpromotetype for some special op

* refine unittest code

c12f7d48

[MoE]Assign pos op (#40580) · 305f32d1

由 Roc 提交于 3月 24, 2022

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* fix for win

* update for test (timeout)

* fix ut

* update

* fix ut for number count
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

305f32d1

L

Wrap dist api for dygraph mode (#40408) · 9d8cfc1b
由 lilong12 提交于 3月 24, 2022

9d8cfc1b
G

support dp for class_center_sample and margin_cross_entropy (#39852) · bff9e28e
由 Guoxia Wang 提交于 3月 24, 2022

bff9e28e
K
test=document_fix , fix launch doc (#40848) · 2e8f9882
由 kuizhiqing 提交于 3月 24, 2022
```
* test=document_fix , fix launch doc

* test=document_fix , fix typo
```
2e8f9882

Fix rnn, wmt16 docs;test=document_fix (#40783) · cc8e98c7

由 Jack Zhou 提交于 3月 24, 2022

* Fix rnn, wmt16 docs;test=document_fix

* Fix wmt14 docs;test=document_fix

* Add more description;test=document_fix

cc8e98c7

X
[Auto Parallel] Gradient merge pass support dist attribute (#40737) · 0443c6f4
由 xiayanming 提交于 3月 24, 2022
```
* [Auto Parallel] gradient merge pass support dist attribute
```
0443c6f4

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致