提交 · 943dedec4c85e611780cc552783fa313d0ea4e95 · BaiXuePrincess / Paddle

01 3月, 2022 8 次提交

P

add sgd kernel; test=develop · 943dedec
由 phlrain 提交于 3月 01, 2022

943dedec
P

update · 26aac8d8
由 phlrain 提交于 3月 01, 2022

26aac8d8

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978

由 zhangbo9674 提交于 3月 01, 2022

* add layer norm

* add p norm

* add reduce sum

* refine layer norm register bf16 for cudnn811

* add bf16 cast for hip

* add unittest

* refine rocm

* refine layer_norm unittest

* refine reduce op

* refine unittest

* enhance atol for reduce unittest

ce8ed978

[bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332

由 zhangbo9674 提交于 3月 01, 2022

* add scale gather sum

* refine CUDA_ATOMIC_WRAPPER ADD for bf16

* add gather unittest

* solve conflict

* add scale uinttest

* add sum unittest

* solve conflict

* refine gather unittest

* refine unittest

6d26b332

R

[phi] migrate where kernel into phi (#39811) · 468a2a17
由 ronnywang 提交于 3月 01, 2022

468a2a17
L
[phi] move uniform_random to phi (#39937) · b3466387
由 Leo Chen 提交于 3月 01, 2022
```
* move uniform_random to phi

* fit selected_rows

* replace mutable_data
```
b3466387

[PHI] Support Multi Input and Output for InferShape (#39870) · e8d45583

由 zyfncg 提交于 3月 01, 2022

* add multi input for infer_shape

* support multi output for infershape

* fix split bug

* fix bug of concat

* support vector<MetaTensor*> in infrt

* fix bug

e8d45583

A
[Phi] Migrate logical_and/or/not/xor into Phi (#39942) · 8c237973
由 Aurelius84 提交于 3月 01, 2022
```
* [Phi] Migrate logical_and/or/not/xor into Phi

* fix unittest

* fix function name
```
8c237973

28 2月, 2022 3 次提交

Move index sample (#39905) · 1b585b28

由 seemingwang 提交于 2月 28, 2022

* graph engine demo

* upload unsaved changes

* fix dependency error

* fix shard_num problem

* py client

* remove lock and graph-type

* add load direct graph

* add load direct graph

* add load direct graph

* batch random_sample

* batch_sample_k

* fix num_nodes size

* batch brpc

* batch brpc

* add test

* add test

* add load_nodes; change add_node function

* change sample return type to pair

* resolve conflict

* resolved conflict

* resolved conflict

* separate server and client

* merge pair type

* fix

* resolved conflict

* fixed segment fault; high-level VLOG for load edges and load nodes

* random_sample return 0

* rm useless loop

* test:load edge

* fix ret -1

* test: rm sample

* rm sample

* random_sample return future

* random_sample return int

* test fake node

* fixed here

* memory leak

* remove test code

* fix return problem

* add common_graph_table

* random sample node &test & change data-structure from linkedList to vector

* add common_graph_table

* sample with srand

* add node_types

* optimize nodes sample

* recover test

* random sample

* destruct weighted sampler

* GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* pybind sample nodes api

* pull nodes with step

* fixed pull_graph_list bug; add test for pull_graph_list by step

* add graph table;name

* add graph table;name

* add pybind

* add pybind

* add FeatureNode

* add FeatureNode

* add FeatureNode Serialize

* add FeatureNode Serialize

* get_feat_node

* avoid local rpc

* fix get_node_feat

* fix get_node_feat

* remove log

* get_node_feat return  py:bytes

* merge develop with graph_engine

* fix threadpool.h head

* fix

* fix typo

* resolve conflict

* fix conflict

* recover lost content

* fix pybind of FeatureNode

* recover cmake

* recover tools

* resolve conflict

* resolve linking problem

* code style

* change test_server port

* fix code problems

* remove shard_num config

* remove redundent threads

* optimize start server

* remove logs

* fix code problems by reviewers' suggestions

* move graph files into a folder

* code style change

* remove graph operations from base table

* optimize get_feat function of graph engine

* fix long long count problem

* remove redandunt graph files

* remove unused shell

* recover dropout_op_pass.h

* fix potential stack overflow when request number is too large & node add & node clear & node remove

* when sample k is larger than neigbor num, return directly

* using random seed generator of paddle to speed up

* fix bug of random sample k

* fix code style

* fix code style

* add remove graph to fleet_py.cc

* fix blocking_queue problem

* fix style

* fix

* recover capacity check

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* fix distributed op combining problems

* optimize

* remove logs

* fix MultiSlotDataGenerator error

* cache for graph engine

* fix type compare error

* more test&fix thread terminating problem

* remove header

* change time interval of shrink

* use cache when sample nodes

* remove unused function

* change unique_ptr to shared_ptr

* simplify cache template

* cache api on client

* fix

* reduce sample threads when cache is not used

* reduce cache memory

* cache optimization

* remove test function

* remove extra fetch function

* graph-engine data transfer optimization

* support graph_split load&query

* remove logs

* change shards to pointer vector

* use inference

* remove test code

* renorm op

* simplify renorm op

* recover local changes

* recover renorm op kernel

* fix init

* add blanklines in renorm doc

* fix import

* fix import

* add renorm to init.py

* merge

* move index_sample op

* Delete api.h

* Delete api.cc

* fix

* remove logs

* recover infer shape of grad

* recover changes

* change shape

* fix label

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix
Co-authored-by: NHuang Zhengjie <270018958@qq.com>
Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
Co-authored-by: Nluobin06 <luobin06@baidu.com>
Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
Co-authored-by: Ntangwei12 <tangwei12@baidu.com>

1b585b28

P

move sgd to phi; test=develop · 5ad020e2
由 phlrain 提交于 2月 28, 2022

5ad020e2

[Phi] move truncated_gaussian_random kernel (#39971) · 23aa7a36

由 furnace 提交于 2月 28, 2022

* [Phi] move truncated_gaussian_random, copy kernels

* [Phi] move truncated_gaussian_random, kernel register

* [Phi] move truncated_gaussian_random, delete useless codes

23aa7a36

26 2月, 2022 3 次提交

[Pten] Refactor the copy kernel (#39731) · 9a7b9eda

由 zyfncg 提交于 2月 26, 2022

* remove SetAllocationForOutputTenosr

* add place param for copy kernel

* recover SetAllocationForOutputTenosr

* polish code

* fix empty_dev api bug

* test=allcases

* test=allcases

* fix bug

* recover empty

* recover modify

9a7b9eda

Move GumbelSoftmax OP to phi (#39873) · 581b2c64

由 From00 提交于 2月 26, 2022

* Move GumbelSoftmax OP to phi

* platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc

* Use axis util in kernel impl

* Remove namespace platform::errors

* Use GetCPUEngine in Device Context

581b2c64

F
Move BilinearTensorProduct OP to phi (#39903) · de8f2748
由 From00 提交于 2月 26, 2022
```
* Move BilinearTensorProduct OP to phi

* Set dtype for Infermeta
```
de8f2748

25 2月, 2022 8 次提交

C

move for_range into phi (#39931) · 94d8f392
由 Chen Weihang 提交于 2月 25, 2022

94d8f392

move eye、size、erfinv、pixel_shuffle OP to phi (#39712) · 639675de

由 0x45f 提交于 2月 25, 2022

* move eye OP to pten

* move size OP to pten

* merge develop

* fix merge

* move files

* move erfinv OP to phi

* remove comment

* move pixel_shuffle OP to phi

* remove comment

* fix PT_REGISTER

* fix NPU

* fix CR

* remove size_sig.cc for PR-CI-Coverage

639675de

A
[phi]migrate increment addmm multinomial cholesky InferShapes to phi (#39913) · 87b903a3
由 Aganlengzi 提交于 2月 25, 2022
```
* [phi]migrate increment addmm multinomial cholesky InferShapes to phi

* set_dtype and mod MultinomialFunctor
```
87b903a3
L

move diag_v2 to phi (#39914) · 783c4aba
由 Linjie Chen 提交于 2月 25, 2022

783c4aba

[bf16] add bf16 kernel: elementwise_add elementwise_mul elementwise_sub (#39716) · 2fedd39b

由 zhangbo9674 提交于 2月 25, 2022

* add ele_add

* add ele_mul

* add ele_sub

* sovle conflict

* fix npu

* refine ele_add

* add ele_mul unittest

* refine ele_sub

* refine ci

* refine unittest

2fedd39b

F
[Phi] mv kernel (#39861) · 2553af4f
由 furnace 提交于 2月 25, 2022
```
[Phi] mv kernel 
```
2553af4f
L
[phi] refine code of randint, randperm, unbind kernel (#39909) · 22f84122
由 Leo Chen 提交于 2月 25, 2022
```
* refine randint kernel

* refine randperm kernel

* refine unbind kernel

* support op seed
```
22f84122

[Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a

由 Chen Weihang 提交于 2月 25, 2022

* support cudnn kernel moving

* polish cmake rules

* add unittest for coverage

* remove orig kernel

* remove softmax cudnn kernel

* fix softmax test failed

* fix npu func error

* resolve conflict

* rename gpu dnn kernels

* fix name rule error

* fix compile error

* update fp16 namespace

8895379a

24 2月, 2022 5 次提交
- A
  [phi]migrate increment addmm multinomial cholesky kernels to phi (#39858) · b695fd95
  由 Aganlengzi 提交于 2月 24, 2022
```
* migrate increment addmm multinomial cholesky kernels to phi

* test pr39869

* test pr39869

* fix style and ci
```
  b695fd95
- L
  [phi] move randint to phi (#39872) · 127440c3
  由 Leo Chen 提交于 2月 24, 2022
```
* move randint to phi

* use host generator
```
  127440c3
- 0
  [Phi]Move cross OP to phi (#39829) · 6c358a7c
  由 0x45f 提交于 2月 24, 2022
```
* move cross forward OP

* move cross grad op to phi

* move infershape

* refine infershape

* rename ctx

* set dtype and layout in InferMeta

* refine code
```
  6c358a7c
- L
  [phi] move bce_loss to phi (#39868) · 6fc5d88a
  由 Linjie Chen 提交于 2月 24, 2022
```
* move bce_loss to phi

* refine PADDLE_ENFORCE

* revert PADDLE_ENFORCE

* fix ci
```
  6fc5d88a
- 【Phi】Migrate poisson op into phi (#39814) · bbe441fc
  由 zhouweiwei2014 提交于 2月 24, 2022
```
* Migrate poisson op into phi

* fix CI

* fix comment
```
  bbe441fc
23 2月, 2022 9 次提交
- L
  [phi] move randperm to phi (#39816) · 30992ea0
  由 Leo Chen 提交于 2月 23, 2022
```
* move randperm to phi

* fix npu

* fix memory::Copy
```
  30992ea0
- Y
  
  [Phi] move flip op to phi kernel (#39822) · ad294a81
  由 Yang 提交于 2月 23, 2022
  
  ad294a81
- change CUDA implementaion of bernoulli OP (#39732) · b9675acc
  由 zhouweiwei2014 提交于 2月 23, 2022
```
* change CUDA implementaion of bernoulli OP

* fix CI
```
  b9675acc
- R
  
  [phi] migrate atan2_op into phi (#39806) · b089e7cd
  由 ronnywang 提交于 2月 23, 2022
  
  b089e7cd
- L
  [phi] move unbind to phi (#39789) · dba694f4
  由 Leo Chen 提交于 2月 23, 2022
```
* move unbind to phi

* revert infer shape

* add header file

* move concat_and_split to phi
```
  dba694f4
- L
  [KP] Add elementwise add xpu after phi, test=develop (#39787) · 1a1a2ce8
  由 Liu-xiandong 提交于 2月 23, 2022
```
* [KP] Add elementwise add xpu, test=develop

* modify the File Permissions

* modify the copyright time

* modify code style

* modify code style
```
  1a1a2ce8
- A
  [Phi] Migrate lable_smooth_op into Phi (#39796) · b7bcd0f6
  由 Aurelius84 提交于 2月 23, 2022
```
* [Phi] Migrate lable_smooth_op into Phi

* fix PT->PD
```
  b7bcd0f6
- Z
  [bf16] add bf16 kernel: elementwise_div (#39602) · ca4df333
  由 zhangbo9674 提交于 2月 23, 2022
```
* add elementwise_div

* refine rocm

* refine code

* refine op register

* solve conflict

* refine unittest

* refine unittest precision

* add rocm
```
  ca4df333
- Z
  [PHI] Remove fill_any_like kernel register in fluid (#39807) · 69e9e9d5
  由 zyfncg 提交于 2月 23, 2022
```
* remove fill_any_like kernel in fluid and fix data transform bug

* support scalar in infershpe

* recover infershape in fill_and_like
```
  69e9e9d5
22 2月, 2022 4 次提交

Move real and imag op to phi (#39777) · 345cc8fa

由 From00 提交于 2月 22, 2022

* Move Real OP to phi

* Move Imag OP to phi

* Move Real and Imag InferShape to phi

* Move Real and Imag to complex_kernel

* Change PT_REGISTER_XXX to PD_REGISTER_XXX

345cc8fa

change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624

由 xiongkun 提交于 2月 22, 2022

* change Vector to std::vector and provide MixVector class as a helper wrapper class

* solve the multi-gpu hang problem

* remove the duplicate template instantialize

* Copy vector to cpu

* add CopyToCPU

* xxx

* final version: fix the problem of all reduce

* remove mixvector dependence

* fix

* merge

* fix code

* fix by CI

728c0624

[Phi] Migrate unfold_op into phi (#39778) · 1aa67778

由 Aurelius84 提交于 2月 22, 2022

* [Phi] Migrate unfold_op into phi

* fix im2col CPUContext template instantial

* fix unfold_op.h header include problem

* fix unittest

* fix PT->PD

1aa67778

C
[PTen->Phi PR2] Rename PT_REGISTER macro to PD_REGISTER (#39790) · 4a338796
由 Chen Weihang 提交于 2月 22, 2022
```
* unify register macro

* rename declare macro

* fix infrt error
```
4a338796

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致