提交 · b75507d34203c28fef5a2f1d1df4c57ba5633695 · 机器未来 / Paddle

26 1月, 2022 1 次提交

[PTen] Unify InferMeta(Shape) Function in pten and fluid op (#38976) · b75507d3

由 Chen Weihang 提交于 1月 26, 2022

* infermeta context init design

* support infermeta called in fluid op

* add hasattr and attr methods

* add dygraah GetVarPtrs support

* rename arg_map_context to arg_map_utils

* add registry for arg map func

* resolve conflit

* refactor op utils design

* polish meta config

* fix details

* remove hasattr method

* resolve conflit

* revert cmake order change

* revert some change

* change init pos

* fix compile faileed

* fix typo

* fix inference failed

* fix windows ccompile failed

* polish format
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

b75507d3

25 1月, 2022 16 次提交
- Y
  
  reconstruct directory of ps (#39191) · 2bf9b844
  由 yaoxuefeng 提交于 1月 25, 2022
  
  2bf9b844
- Y
  
  change infermeta and remove makePtenTenosr in reshape (#39186) · 7613129e
  由 YuanRisheng 提交于 1月 25, 2022
  
  7613129e
- L
  GetWorkspaceSize trigger modfication in heuristic cudnn conv (#39184) · 4c61e141
  由 limingshu 提交于 1月 25, 2022
```
* first commit

* add more changes
```
  4c61e141
- C
  add trace event data structure definition (#39109) · 57b2033b
  由 chenjian 提交于 1月 25, 2022
```
* add trace event data structure definition

* convert enum item to string for cupti enum explaination

* modify paddle_enforce_eq description
```
  57b2033b
- Z
  [inference] update trt convert reduce op&ut,test=develop (#39088) · 80753755
  由 Zhang Jun 提交于 1月 25, 2022
```
* [inference] update convert reduce op&ut,test=develop

* update

* update

* update

* add int32 support

* add int32 support

* add comments

* trt < 7.0 do not support int32

* test=develop

* update

* test=develop
```
  80753755
- J
  [MLU]add mlu kernel for fill_constant op (#39069) · 6e871dbc
  由 joeqiao12 提交于 1月 25, 2022
```
* [MLU]add mlu kernel for fill_constant op

* delete device_context DEPS
```
  6e871dbc
- N
  Revert "Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959)" (#39205) · 978558be
  由 niuliling123 提交于 1月 25, 2022
```
This reverts commit 9059ef69.
```
  978558be
- J
  [MLU]add mlu kernel for split and concat (#39020) · ac3dc0bb
  由 joeqiao12 提交于 1月 25, 2022
```
* [MLU]add mlu kernel for concat and split op

* delete device_context DEPS
```
  ac3dc0bb
- Y
  
  [fleet_executor] Dist model run method Implementation (#39194) · 20e23e1b
  由 Yuang Liu 提交于 1月 25, 2022
  
  20e23e1b
- N
  
  Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959) · 9059ef69
  由 niuliling123 提交于 1月 25, 2022
  
  9059ef69
- L
  Optimize nearest_interp forward (#38528) · 232bbce2
  由 Lijunhui 提交于 1月 25, 2022
```
* init commit

* remove comments

* remove nchw branch

* optimize code

* apply fast div mod in 1D kernel, rm 3D kernel

* move init of FastDivMode to CPU

* 3D kernel for nchw, FastDiv for 1D kernel

* debug done. process boundary

* 2^n

* optimize

* optimize

* change code & optimize code
```
  232bbce2
- W
  [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
  由 Weilong Wu 提交于 1月 25, 2022
```
* Added selected_rows and rw_lock to pten

* Renamed the unit test target to fix CI

* Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid

* Remove rw_lock.h,rw_lock_test.cc in fluid

* Use pten::RWLock and pten::AutoRDLock, fix CI

* Use pten::SelectedRows

* Use pten::SelectedRows

* Fix to pass NPU CI

* Use pten::SelectedRows, to pass NPU CI

* To fix NPU CI

* To fix NPU CI again
```
  2bafd338
- N
  
  [pnorm] fix bug in fp16 & optimize memory (#39011) · 3825b40f
  由 Noel 提交于 1月 25, 2022
  
  3825b40f
- W
  
  [PTEN] Add xpu context. (#39098) · c1e5a393
  由 Wilber 提交于 1月 25, 2022
  
  c1e5a393
- F
  
  Add GetBasePtr interface in paddle::memory (#39145) · b2a7261d
  由 From00 提交于 1月 25, 2022
  
  b2a7261d
- X
  [PTen] Migrate string tinyformat errors and part of enforce into pten (#39051) · 6ca49164
  由 xiongkun 提交于 1月 25, 2022
```
* transfer: string tinyformat errors and part of enforce into pten

* remove comment

* fix by code review

* assert is not compile in -DNDEBUG

* add string as dependences of paddle_inference
```
  6ca49164
24 1月, 2022 10 次提交

S

fix test allreduce tests (#39166) · c00303ec
由 sneaxiy 提交于 1月 24, 2022

c00303ec

[pten] add Scale xpu kernel (#39092) · 7874d0a5

由 chentianyu03 提交于 1月 24, 2022

* add scale xpu kernel

* add scale xpu kernel

* add scale xpu kernel

* replace with pten scale kernel

* change dev_ctx

* modify float16 head path

* remove unused xpu header

7874d0a5

[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and... · 3bf3a6ee

由 YuanRisheng 提交于 1月 24, 2022

[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048)

* refactor elementwise add grad

* fix compile bugs

* fix unit test bugs

* fix file conflicts

* fix bugs when buildPtenContext

3bf3a6ee

Remved redundant defintions of likely/unlikely (#38911) · 43919d0a

由 Jacek Czaja 提交于 1月 24, 2022

* - more unlikely

* - compilation fix

* - removed redundant definition

* - fix

* - Fixes

* - compilation fix for windows

43919d0a

[Pten] Migration of eigen numeric extensions and functors in paddle/fluid/operatos/eigen (#39124) · a1e40dc6

由 Feiyu Chan 提交于 1月 24, 2022

* migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h
* update path of data types like float16.h in includes in extensions.h

a1e40dc6

Z

unify compare functor (#39024) · def81b4f
由 Zhang Ting 提交于 1月 24, 2022

def81b4f

[PTEN] Move dynload from fluid to pten. (#39120) · 3c1dc6f6

由 Wilber 提交于 1月 24, 2022

* move dynload from fluid to pten.

* fix ci compile

* fix windows ci compile.

* update

* update

* fix compile error

3c1dc6f6

石

[Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7

由石晓伟提交于 1月 24, 2022

* updates callers, test=develop

* updates tensor, test=develop

* fixes errors, test=develop

* remove some dtypes, test=develop

* fix errors in the base storage modification, test=develop

* fixes a bug, test=develop

* fixes the bugs in push the whole, test=develop

* updates, test=develop

* update

* update, test=develop

* fixes the mac-py3 CI, test=develop

* remove the storage impl, test=develop

* updates some codes, test=develop

* update, test=develop

* updates pten allocation, test=develop

a56e16a7

support sparse of adam, *test=kunlun (#38483) · e106901e

由 z8hanghuan 提交于 1月 24, 2022

* support sparse of adam, *test=kunlun

* add pre-commit-config.yaml

* support sparse of adam in KL2,*test=kunlun

* support sparse of adam in KL2, *test=kunlun

* modify xpu.cmake, *test=kunlun

* support sparse of adam, rm some wait, *test=kunlun

* support sparse of adam, rm some wait, *test=kunlun

* support sparse of adam, *test=kunlun

* support sparse of adam, *test=kunlun

* support sparse of adam, *test=kunlun

* support sparse of adam, *test=kunlun

* support sparse of adam, *test=kunlun

e106901e

Refactored python-level trace_op to call through _C_ops instead of... · c3796061

由 Zhanlue Yang 提交于 1月 24, 2022

Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode (#38338)

* Replaced core.ops with _C_ops

* Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode

* Modified trace_op interface

* Refactored trace_op logic for eager mode

* Added Eager Dygraph support for OpTest

* Fixed ci issues

* Fixed CI failures

* Fixed Coverage CI Issues

* Fixed XPU CI Issues

c3796061

23 1月, 2022 1 次提交

Support test_imperative apply and Add a setter for EagerTensor (#39016) · 8c5c1046

由 Weilong Wu 提交于 1月 23, 2022

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* eager test case

* support inference test

* refine test and fix initializer failed

* modify eagertensor patch method

* add eagertensor.clear_grandint, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* support create varbase and fix retain grad error

* call monkey_patch_varbase in _test_eager_guard, test=develop

* fix windows error

* split clear_gradient to clear_gradient and zero_grads, test=develop

* refine, test=develop

* refine, test=develop

* support test_imperative_basic test in eager mode

* remove additional log in variable.h

* remove additional log in variable.h

* remove additional code create in merge

* eager

* fix some eager logic, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* patch_tensor_method_func, test=develop

* refine, test=develop

* eager test case, test=develop

* refine, test=develop

* eager, test=develop

* eager, test=develop

* eager optimizer, test=develop

* eager optimizer, test=develop

* eager test_imperative_optimizer_v2, test=develop

* eager, test=develop

* refine, test=develop

* refine, test=develop

* eager, test=develop

* add resize in share buffer to, test=develop

* eager, test=develop

* fix _share_buffer_to, test=develop

* refine, test=develop

* refine, test=develop

* support eager for dataloader,test=develop

* Exposed EagerTensor's set func to implement set_value func

* Rename set to _set_value, Supplement the corresponding test case

* fix test concat dev api build failed

* fix conflict

* fix conflict

* Use extern to Polish code
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NJiabinYang <360788950@qq.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>
Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>

8c5c1046

22 1月, 2022 3 次提交
- C
  
  add infershape utils (#39140) · 36d9a364
  由 Chen Weihang 提交于 1月 22, 2022
  
  36d9a364
- C
  
  add get inout var ptr for dygraph (#39134) · ec24bc98
  由 Chen Weihang 提交于 1月 22, 2022
  
  ec24bc98
- C
  [PTen] Add attr method for ArgumentMappingContext (#39130) · 7ac2f80f
  由 Chen Weihang 提交于 1月 22, 2022
```
* add attr for arg map context

* add argument fn declare

* add attr test for get attr value method

* polish details
```
  7ac2f80f
21 1月, 2022 9 次提交
- C
  [pten] fix test concat dev api build failed (#39117) · a14dc688
  由 chentianyu03 提交于 1月 21, 2022
```
* fix test concat dev api build failed

* fix conflict

* fix conflict
```
  a14dc688
- Y
  [PTen]Separate origin Kernel and add Kernel for C++ API (#39002) · a0f586bc
  由 YuanRisheng 提交于 1月 21, 2022
```
* add kernel for c++ api

* fix compile bugs

* fix kunlun compile bugs

* perfect cmake

* fix compile bugs when run ci-inference

* fix compile bugs

* add non-raw kernel for fluid op

* fix compile bugs

* fix compile bugs

* fix unit test bug
```
  a0f586bc
- C
  
  [pten] add concat pten kernel (#38955) · 06803c29
  由 chentianyu03 提交于 1月 21, 2022
  
  06803c29
- W
  
  Renamed selected_rows.* -> selected_rows_utils.* (#39037) · 814e5ab4
  由 Weilong Wu 提交于 1月 21, 2022
  
  814e5ab4
- Z
  
  modify DivideFunctor to match ElementwiseSameDims template (#39041) · df515255
  由 Zhang Ting 提交于 1月 21, 2022
  
  df515255
- T
  Keep strided_slice op behavior consistent with slice op when starts input is... · b47fb764
  由 TeslaZhao 提交于 1月 21, 2022
```
Keep strided_slice op behavior consistent with slice op when starts input is less than -rank (#39066)
```
  b47fb764
- F
  [MLU]add mlu ci dockerfile (#39021) · fdab43b5
  由 fwenguang 提交于 1月 21, 2022
```
* [MLU]add mlu ci dockerfile

* fix comment

* add cncl
```
  fdab43b5
- T
  refactor unittest for kunlun (#38772) · 4f1fef60
  由 TTerror 提交于 1月 21, 2022
```
* refactor unittests for kunlun

* refactor unittests for kunlun, test=kunlun
```
  4f1fef60
- A
  [PTen]Migrate Dim and DDim from paddle::framework into pten namespace (#39053) · 4e23ba32
  由 Aurelius84 提交于 1月 21, 2022
```
* Migrate Dim and DDim from paddle::framework into pten namespace

* fix paddle::framework::Array

* fix framework::Array
```
  4e23ba32

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致