提交 · d6038c22696e23dfc181643694e84f888e8001ae · Crayon鑫 / Paddle

24 2月, 2022 1 次提交
- L
  optimize performance of lookup_table_v2_op (#39856) · d6038c22
  由 Li Min 提交于 2月 24, 2022
```
* optimize block config  and fp16 atomicAdd perf for lookup_table_v2_grad.
```
  d6038c22
22 2月, 2022 1 次提交

change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624

由 xiongkun 提交于 2月 22, 2022

* change Vector to std::vector and provide MixVector class as a helper wrapper class

* solve the multi-gpu hang problem

* remove the duplicate template instantialize

* Copy vector to cpu

* add CopyToCPU

* xxx

* final version: fix the problem of all reduce

* remove mixvector dependence

* fix

* merge

* fix code

* fix by CI

728c0624

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

15 2月, 2022 1 次提交

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

08 2月, 2022 1 次提交
- S
  Make Embedding layer support more int ids type (#39381) · 60f1461a
  由 sneaxiy 提交于 2月 08, 2022
```
* add more int id type support for embedding

* add ut

* add more ut

* fix ci error
```
  60f1461a
25 1月, 2022 1 次提交

[Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338

由 Weilong Wu 提交于 1月 25, 2022

* Added selected_rows and rw_lock to pten

* Renamed the unit test target to fix CI

* Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid

* Remove rw_lock.h,rw_lock_test.cc in fluid

* Use pten::RWLock and pten::AutoRDLock, fix CI

* Use pten::SelectedRows

* Use pten::SelectedRows

* Fix to pass NPU CI

* Use pten::SelectedRows, to pass NPU CI

* To fix NPU CI

* To fix NPU CI again

2bafd338

17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
03 12月, 2020 1 次提交

fix gpu outofrange (#29238) · 83587916

由 tangwei12 提交于 12月 03, 2020

* fix gpu emb out of range

Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf

* fix doc

Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf

83587916

28 9月, 2020 1 次提交
- Y
  enhance error messages of lookup_tale, merge_ids, data_norm (#27619) · c9a88013
  由 yaoxuefeng 提交于 9月 28, 2020
```
* enhance error messages of lookup_tale, merge_ids, data_norm

* fix

* fix error msg in .cu
```
  c9a88013
01 9月, 2020 1 次提交
- T
  add embedding 2.0 (#26649) · ebc5f997
  由 tangwei12 提交于 9月 01, 2020
```
* add embedding 2.0

* add embedding support input int32
```
  ebc5f997
22 7月, 2020 1 次提交
- D
  
  optimize embedding cuda kernel lookup_table_v2,test=develop (#25587) · 95fa383d
  由 donproc 提交于 7月 22, 2020
  
  95fa383d
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

29 11月, 2019 1 次提交

Add dygraph execution context (#20157) · ac854670

由 hong 提交于 11月 29, 2019

* add_dygraph_execution_context

* add dygraph infershape context and execution context; test=develop

* fix imperative bug; test=develop

* remove inputs outputs interface from execution context,
because it have same function with inputNames;
test=develop

* remove tracer_test ctest; test=develop

* fix split op bug; test=develop

* fix unitests bug; test=develop

* fix distribute test bug; test=develop

* fix ngraph compile bug; test=develop

* fix grad maker bug; test=develop

* fix load op bugs; test=develop

* fix operator.cc construct bug; test=develop

* remove useless name find in operator; test=develop

* add tracer_test; test=develop

* fix concat, split bug; test=develop

* remove tracer_test unitest; test=develop

* fix attribute check bug; test=develop

* add test code to fix converage; test=develop

* remove useless code, change check backward input in engin; test=develop

* unlock var type infer shape;test=develop

* add ShareAllLoD api; test=develop

* add dygraph infershape context unitest; test=develop

* remove increase and decrease lod in dygraph; test=develop

* addd override; test=develop

* fix increase descrease lod; test=develop

* fix paddle_enforce; test=develop

* disable lod op dygraph check; test=develop

* fix paddle enforce error; test=develop

* add comment for op_registry and OperatorBase; test=develop

* optimize the comment of op_registry; test=develop

* fix format of comment; test=develop

* fix format of comment; test=develop

* optimize the format of comment; test=develop

* optimize the format of the comment; test=develop

* optimize comment of op_registry; test=develop

ac854670

12 10月, 2019 1 次提交

enhance embedding error message test=develop (#20246) · 22823df2

由 Aurelius84 提交于 10月 12, 2019

* enhance embedding error message test=develop

* enforce .h error test=develop

* fix unittest code test=develop

* Fix fp16 dtype in embedding test=develop

* add import warnings test=develop

22823df2

24 9月, 2019 1 次提交

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

05 9月, 2019 1 次提交

unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) · 3ae939e4

由 Tao Luo 提交于 9月 05, 2019

* remove assert.h

* change PADDLE_ASSERT_MSG to PADDLE_ENFORCE

test=develop

* fix tensorrt paddle_enforce

test=develop

3ae939e4

28 8月, 2019 1 次提交

Fix the correctness of async mode at distributed training (#18863) · 65c73684

由 tangwei12 提交于 8月 28, 2019

* fix correctness of the communicator

* fix a bug in send thread when sending var context is empty, test=develop

* add lookup_table_prefetch_op and prefetch optimize, test=develop

* remove remote prefetch GPU supported

* word2vec force with CPU, test=develop

* test dist remote lookup table force with CPU, test=develop

65c73684

09 8月, 2019 1 次提交
- Z
  optimize error message for "embedding" and "cross_entropy" OP (#18765) · c2063217
  由 Zhang Ting 提交于 8月 09, 2019
```
* optimize error message, test=develop

* optimize error message, test=develop
```
  c2063217
08 5月, 2019 1 次提交
- C
  update assert (#17282) · db5e74ab
  由 chengduo 提交于 5月 08, 2019
```
test=develop
```
  db5e74ab
28 3月, 2019 1 次提交
- Q
  
  fix gpu build for lookup_table_op test=develop · 34890fd3
  由 Qiao Longfei 提交于 3月 28, 2019
  
  34890fd3
30 1月, 2019 1 次提交
- Y
  Some improvements to support bert mixed precision training (#15585) · 170842cb
  由 Yibing Liu 提交于 1月 30, 2019
```
* Some improvements to support bert mixed precision training

test=develop

* Revert the cast in layer_norm

test=develop
```
  170842cb
19 12月, 2018 1 次提交
- J
  
  test=develop, fix compile error under gpu mode · 5ec9b377
  由 JiabinYang 提交于 12月 19, 2018
  
  5ec9b377
03 12月, 2018 1 次提交
- Y
  
  Print assert failure id in lookup_table_op (#14698) · c7382df8
  由 Yibing Liu 提交于 12月 03, 2018
  
  c7382df8
29 11月, 2018 1 次提交
- Q
  lookup_table gpu kernel support prefetch · 3e45a5a5
  由 Qiao Longfei 提交于 11月 29, 2018
```
test=develop
```
  3e45a5a5
21 9月, 2018 1 次提交
- Y
  
  Fix MixedVector · e1913bc5
  由 Yu Yang 提交于 9月 21, 2018
  
  e1913bc5
31 7月, 2018 2 次提交
- F
  
  Add unittests for lookup_table_op · b1af7e5d
  由 fengjiayi 提交于 7月 31, 2018
  
  b1af7e5d
- F
  
  make look_up_op supporting tensor ids · 7efdf05a
  由 fengjiayi 提交于 7月 31, 2018
  
  7efdf05a
27 7月, 2018 1 次提交

Refine regularization for selected_rows (#12369) · 2409d0f7

由 chengduo 提交于 7月 27, 2018

* refine regularization for selected_rows

* clean lookup_table

* refine rpc_server_test

* temporally disable rpc_server_test

* fix rpc_server_test

* add unit test

2409d0f7

30 4月, 2018 1 次提交
- D
  Feature/cuda9 cudnn7 (#10140) · eb6f9dd5
  由 dzhwinter 提交于 4月 30, 2018
```
* "re-commit "

* "picked up"

* "fix ci"

* "fix pdb hang up issue in cuda 9"
```
  eb6f9dd5
13 3月, 2018 2 次提交
- C
  
  refine doc · 92e2207e
  由 chengduoZH 提交于 3月 13, 2018
  
  92e2207e
- C
  
  remove concat_rows · b9397b26
  由 chengduoZH 提交于 3月 13, 2018
  
  b9397b26
12 3月, 2018 1 次提交
- C
  
  add concat rows · f1c3ecb2
  由 chengduoZH 提交于 3月 10, 2018
  
  f1c3ecb2
09 3月, 2018 1 次提交
- C
  
  enhancement look_up_table · 1509ce66
  由 chengduoZH 提交于 3月 09, 2018
  
  1509ce66
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
08 2月, 2018 1 次提交
- Y
  
  Rewrite mixed_vector.h · ef1aba39
  由 Yu Yang 提交于 2月 08, 2018
  
  ef1aba39
31 1月, 2018 1 次提交

Fix/lod (#7714) · ae7d1c1f

由 dzhwinter 提交于 1月 31, 2018

* "Need to re-design LoD "

* "add lod design"

* "fix lod gpu ptr pointer"

* "removed commented code"

* "fix CI"

* "remove set lod in pybind"

* "fix style check"

* "fix CI"

* "fix long type template error"

* "pybind reorder to use Place"

* "fix ci"

* "fix ci"

* fix ci

* "sperate as a new file"

* "fix CI"

* "fix ci"

* small fix

* "add test"

* "fix adam op"

* "fix lstmp op"

* "fix adam op"

* "follow comments"

* "fix ci"

ae7d1c1f

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致