提交 · 06128b9f0c41375aa7e577c856a0e8563078dd3f · 机器未来 / Paddle

20 12月, 2021 1 次提交
- Z
  
  move the directory of fill kernels in pten (#38219) · 06128b9f
  由 zyfncg 提交于 12月 20, 2021
  
  06128b9f
17 12月, 2021 5 次提交

Support multi place constructor (#38171) · 6f439e5a

由 Jiabin Yang 提交于 12月 17, 2021

* support more eager tensor api

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* refine test in pure cpu

* refine test in pure cpu

6f439e5a

Refine some AMP operators for BERT (#37923) · d80fe268

由 sneaxiy 提交于 12月 17, 2021

* support multi precision update for LAMB

* hide some api

* fix ci uts

* fix lamb output of dygraph

* remove some changes to some PR

* try to fix Py3 CI compile error

* fix test_imperative_optimizer, add lars ut, add layer_norm ut

* fix ut, fix format

* fix ut

* fix windows ci

d80fe268

Generated CoreOpsInfos for potential use in append_op API (#38085) · e3b033f9

由 Zhanlue Yang 提交于 12月 17, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* Generated CoreOpsInfos for potential use in append_op API

* Fixed CI problem

e3b033f9

K

add op/api repeat/interleave (#37981) · a7de0e66
由 kuizhiqing 提交于 12月 17, 2021

a7de0e66
Y

[fleet_executor] run time graph on python side (#38164) · fc701369
由 Yuang Liu 提交于 12月 17, 2021

fc701369

16 12月, 2021 3 次提交

J
support eager switch system (#38170) · 8305c2be
由 Jiabin Yang 提交于 12月 16, 2021
```
* support eager switch system

* polish code
```
8305c2be

Add sparse_attention mask ,test=develop (#37973) · fa463b90

由 Liu-xiandong 提交于 12月 16, 2021

Add key_padding_mask and attn_mask in sparse_attention Api

1.Key padding mask is a tensor with dimensions [batch_size, seq_len], and attention mask is a tensor with dimensions [seq_len, seq_len]. The data types of the two masks are consistent with Q, K, and V, which are float32 or float64. If the value in Mask is 0, it means that the position needs to be masked.

2.The changed files are mainly paddle/fluid/operators/sparse_attention_op.cu and python/paddle/fluid/tests/unittests/test_sparse_attention_op.py. sparse_attention has three parts: sddmm, softmax, and dsd. Adding the mask operation only needs to modify the softmax. It has no effect on the other two parts. In addition, in order to test the mask function, related tests has been added.

fa463b90

Enabled Eager AutoCodeGen for All Existing Operators & Possible Future Operators (#37969) · 08482a86

由 Zhanlue Yang 提交于 12月 16, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Enabled Eager AutoCodeGen for All Existing Operators & Possible Future Operators

* Fixed CI issues

08482a86

15 12月, 2021 1 次提交

Synchronized auto-generated Python-C API with Dygraph Forward Functions (#38017) · 77dfb2e8

由 Zhanlue Yang 提交于 12月 15, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

77dfb2e8

14 12月, 2021 1 次提交

fix memory leak problen of set_value op (#38098) · f8202941

由 zyfncg 提交于 12月 14, 2021

* fix bug of set_value op

* fix BumpInplaceVersion

* polish some comments

* revert change of full_like

f8202941

13 12月, 2021 3 次提交
- T
  
  update xpu_memcpy (#38049) · bdf5834e
  由 taixiurong 提交于 12月 13, 2021
  
  bdf5834e
- X
  fix single card 8 unittests in new executor (#37957) · 9a4eec98
  由 xiongkun 提交于 12月 13, 2021
```
* fix single card 8 unittests in new executor

* fix

* fix
```
  9a4eec98
- W
  
  fix mac import hang, test=develop (#38051) · d3569c7e
  由 wanghuancoder 提交于 12月 13, 2021
  
  d3569c7e
10 12月, 2021 1 次提交
- L
  git ignore eager_op_function_impl.h (#38030) · 01b6bdf4
  由 Leo Chen 提交于 12月 10, 2021
```
* git ignore eager_op_function_impl.h

* test=document_fix
```
  01b6bdf4
09 12月, 2021 4 次提交
- A
  
  fix LoDTensorArray crash in Debug mode build (#37954) · 1f9a5d8f
  由 Aganlengzi 提交于 12月 09, 2021
  
  1f9a5d8f
- Z
  Fixed eager compilation issues by temporarily turn off AutoCodeGen fo… (#37992) · 34a06cf5
  由 Zhanlue Yang 提交于 12月 09, 2021
```
* Fixed eager compilation issues by temporarily turn off AutoCodeGen for specific ops

* Removed op_types
```
  34a06cf5
- J
  
  add ipu device p2 (#37840) · cb636a48
  由 jianghaicheng 提交于 12月 09, 2021
  
  cb636a48
- B
  
  Add varbase init name (#37947) · fdf62e1e
  由 Baibaifan 提交于 12月 09, 2021
  
  fdf62e1e
08 12月, 2021 2 次提交

[Eager] coreops to 495 (#37926) · aff7397b

由 wanghuancoder 提交于 12月 08, 2021

* refine a test case, test=develop

* publish python c api for eager, test=develop

* revert modify about test_allclose_layer.py, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* delete numpy includes, use pybind11 numpy.h, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* suport eager error msg, and add grad test case, test=develop

* refine, test=develop

* refine, test=develop

* generate eager core ops, only 4 ops, test=develop

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* open 500 list

* refine, test=develop

* refine, test=develop

* refine, test=develop

* fix auto code gen, test=develop

* Enabled generation for Operators without Grad/Inputs/Outputs

* refine, test=develop

* refine, test=develop

* refine, test=develop

* add to pyobject, test=develop

* Resolved operators without input

* merge pr 37837

* refine

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine,test=develop
Co-authored-by: Njim19930609 <jim19930609@gmail.com>

aff7397b

[Eager] generate eager core ops, only 4 ops (#37813) · 52f63cd2

由 wanghuancoder 提交于 12月 08, 2021

* refine a test case, test=develop

* publish python c api for eager, test=develop

* revert modify about test_allclose_layer.py, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* delete numpy includes, use pybind11 numpy.h, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* suport eager error msg, and add grad test case, test=develop

* refine, test=develop

* refine, test=develop

* generate eager core ops, only 4 ops, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

52f63cd2

07 12月, 2021 5 次提交
- Z
  Buf fix for reset grad inplace version (#37811) · cf586021
  由 Zhanlue Yang 提交于 12月 07, 2021
```
* Debug

* Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation

* Rearranged interfaces

* Fixed ci issues
```
  cf586021
- H
  Set runtime_include_dir in Paddle.__init__.py (#37886) · e3cca8ac
  由 Huihuang Zheng 提交于 12月 07, 2021
```
Paddle don't have to set runtime_include_dir during run CINN.
```
  e3cca8ac
- W
  [Eager] fix cmake generate error, and fix circular import (#37871) · 79c25979
  由 wanghuancoder 提交于 12月 07, 2021
```
* refine a test case, test=develop

* rm python, test=develop

* refine, test=develop

* fix cmake generate error, and fix circular import, test=develop
```
  79c25979
- J
  
  add ipu device p1 (#37841) · c9a3c669
  由 jianghaicheng 提交于 12月 07, 2021
  
  c9a3c669
- Y
  
  [fleet_executor] fix python gil problem (#37882) · c7cb7eec
  由 Yuang Liu 提交于 12月 07, 2021
  
  c7cb7eec
06 12月, 2021 1 次提交
- K
  
  heter for collective (#37613) · 1bdb8578
  由 kuizhiqing 提交于 12月 06, 2021
  
  1bdb8578
03 12月, 2021 3 次提交

W

Fix _numel func logic and add test (#37810) · 075a02d2
由 Weilong Wu 提交于 12月 03, 2021

075a02d2
R
refine structure for cuda and rocm (#37202) · a6d2fddb
由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
a6d2fddb

[Eager] publish python c api for eager (#37550) · 07b4fe93

由 wanghuancoder 提交于 12月 03, 2021

* refine a test case, test=develop

* publish python c api for eager, test=develop

* revert modify about test_allclose_layer.py, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* delete numpy includes, use pybind11 numpy.h, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* suport eager error msg, and add grad test case, test=develop

* refine, test=develop

* refine, test=develop

07b4fe93

02 12月, 2021 1 次提交
- L
  
  [Fleet Executor] Refine runtime graph (#37703) · 0074a3c9
  由 LiYuRio 提交于 12月 02, 2021
  
  0074a3c9
01 12月, 2021 2 次提交
- J
  Remove cpp layer (#37730) · 44def66a
  由 Jiabin Yang 提交于 12月 01, 2021
```
* optimizer __call__ to make dygraph faster

* fix return type

* remove cpp Layer
```
  44def66a
- Z
  
  Handled dispensable tensors in AutoCodeGen for Eager Dygraph (#37723) · 06c3cce9
  由 Zhanlue Yang 提交于 12月 01, 2021
  
  06c3cce9
30 11月, 2021 2 次提交
- Z
  [opt] Add regularation and Nesterov for mergerd_momentum op (#37527) · c8ffdecb
  由 zhangbo9674 提交于 11月 30, 2021
```
* add regularation and Nesterov for mergerd_momentum

* refine unittest for use_nesterov attr

* refine op check

* refine code

* fix bug

* refine code of regularization_flag

* delete useless code
```
  c8ffdecb
- L
  
  [Fleet_Executor] Passing runtime scope and place (#37603) · 87e65a99
  由 LiYuRio 提交于 11月 30, 2021
  
  87e65a99
27 11月, 2021 1 次提交

[NPU] reorganization for device API abstraction (#37110) · 72241a6a

由 Aganlengzi 提交于 11月 27, 2021

* [NPU] reorganization for device API abstraction

* [NPU] delete old files

* [NPU] fix npu_collective_helper

* [NPU] fix collective_helper

* [NPU] fix ut

* [NPU] mod memory allocation and hccl_helper

* [NPU] fix place_type

* [NPU] split enfoce.h

* move acl* call into npu_info

* merge conflict

* fix merge

* merge conflict

* merge conflict

72241a6a

26 11月, 2021 3 次提交

Z
upgrade async distributed training in pscore (#37515) · 74605fc2
由 zhaocaibei123 提交于 11月 26, 2021
```
* test

* test

* rm test

* update

* update

* update

* add unittest

* update

* update save
```
74605fc2

Added interface reset_grad_inplace_version (#37573) · dcb91fd7

由 Zhanlue Yang 提交于 11月 26, 2021

reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.

This is essentially a hack interface to satisfy one specific request

dcb91fd7

TDM2 (#37044) · 4826167c

由 wangzhen38 提交于 11月 26, 2021

* add tdm sample

* add tdm sample in c++

* update tdm sample

* modify sample count

* fix conflict

* add set_date

* fix cmake error

* fix bug of proto

* update index_dataset proto

* update cmake

* fix error cmake

* fix cmake mkldnn

* fix cmake proto

* update cmake proto

* update cmake

* update rec

* update dataset

* update dataset

* update dataset

* updata dataset

* updata dataset

* updata coverage

* updata ci

* goback4

* fix npu ci

* add xxhash dep

4826167c

25 11月, 2021 1 次提交
- L
  
  Export task node to python (#37509) · 3f815e76
  由 LiYuRio 提交于 11月 25, 2021
  
  3f815e76

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致