提交 · 97cd70897a1ef664dac1e95a4f0fc0c804870ea9 · Crayon鑫 / Paddle

07 4月, 2021 1 次提交

cherry-pick:add softmax_switch for softmax_with_cross_entropy_op (#32105) · 97cd7089

由 chajchaj 提交于 4月 07, 2021

* cherry-pick:add softmax_switch for softmax_with_cross_entropy_op, test=develop

* add softmax_switch for softmax_with_cross_entropy_op, test=develop

* delete using EigenMatrix in softmax_with_cross_entropy_op.h, test=develop

* add REGISTER_OP_VERSION for softmax_switch attr of softmax_with_cross_entropy_op, test=develop

* cherry-pick:add softmax_switch for softmax_with_cross_entropy_op,test=develop

* change softmax_switch to use_softmax, test=develop

* fix code format for softmax_with_cross_entropy_op.cc, test=develop

97cd7089

06 4月, 2021 1 次提交
- P
  
  remove pass restrictions for skip-ln pass (#32082) · 62c21734
  由 Pei Yang 提交于 4月 06, 2021
  
  62c21734
02 4月, 2021 3 次提交

Z
Fix the nan bug when passing all zero values into clip_by_norm_op. (#30777) (#32038) · 1f8834ad
由 Zhen Wang 提交于 4月 02, 2021
```
 if all input grads are zero, the output of clip_by_norm will be inf or nan. This pr is used to fix this bug.
```
1f8834ad

【Paddle.Fleet】【Cherry-Pick】fix grad_clip & gaussian_random & dataset & profiler (#31945) · 186bbebf

由 Chengmo 提交于 4月 02, 2021

* Remove PE special profiler (#30886)

* remove pe special profiler

* add profiler info

* add truncated gaussian random (#30922)

add truncated gaussian random

* 【Paddle.Fleet】fix dataset zip py3 bug (#31441)

* fix zip py3 bug

* 【Paddle.Fleet】Fix one ps gradient clip  (#31664)

* fix one ps gradient clip

186bbebf

[Cherry-Pick] logclean & embedding doc (#32009) · 8140485a

由 tangwei12 提交于 4月 02, 2021

* fix en doc for emb (#31980)

* fix en doc for emb, test=document_fix;
Change-Id: I4757e67caacd7189f068493ed45a7445f87ffb40

* LOG CLEAN (#31819)

* upgrade vlog

* train from dataset fetch optimize

8140485a

01 4月, 2021 1 次提交
- J
  
  fix stack op grad nullptr (#31962) (#32005) · e7542a4d
  由 Jiawei Wang 提交于 4月 01, 2021
  
  e7542a4d
31 3月, 2021 2 次提交

OneDNN hardswish integration (#30211) (#31870) · b934d0b8

由 lidanqing 提交于 3月 31, 2021

* OneDNN hardswish integration (#30211)

* keep only conv + hardswish in this PR
Co-authored-by: Njakpiase <62569058+jakpiase@users.noreply.github.com>

b934d0b8

Cherry pick bert transformer 2.0 support (#31959) · 967f4c2e

由 Pei Yang 提交于 3月 31, 2021

* [Paddle-TRT] TRT inference support for BERT/Transformer in paddle 2.0 api (#31744)

* support multihead_matmul_fuse_pass_v3

* fix compile problems

* embedding_eltwise_ln pass support lookup_table_v2

* suppoort matmul and matmul_v2 in qkv matmul

* map_matmul_to_mul_pass support 3dim

967f4c2e

25 3月, 2021 2 次提交
- W
  
  fix runtime crash when rnn model inference, test=develop (#31833) (#31846) · c7a6a1f9
  由 winter-wang 提交于 3月 25, 2021
  
  c7a6a1f9
- W
  fix cache key in concat oneDNN kernel (#31820) (#31837) · d44d1730
  由 Wojciech Uss 提交于 3月 25, 2021
```
* fix cache key in concat oneDNN kernel

* key simplified
```
  d44d1730
02 3月, 2021 4 次提交
- L
  [CP] align fleet param (#31220) · d15e73b0
  由 lilong12 提交于 3月 02, 2021
```
* update, test=develop (#30692)

* align the default value of some configuration for fleet to that of single cards (#30740)

* update, test=develop
```
  d15e73b0
- W
  
  Modify relu native implementation 2 (#30996) (#31348) · 98c4c780
  由 Wojciech Uss 提交于 3月 01, 2021
  
  98c4c780
- P
  Revert "add trt transpose and flatten converter (#31022) (#31139)" (#31343) · 325bfc37
  由 Pei Yang 提交于 3月 02, 2021
```
This reverts commit 20e68a22.
```
  325bfc37
- C
  
  add clip_by_norm on kunlun, *test=kunlun (#30862) (#31331) · 0a8ebb0d
  由 cucuzg 提交于 3月 02, 2021
  
  0a8ebb0d
01 3月, 2021 6 次提交

[Cherry pick] cherry-pick #31102 #30750 #30626 (#31336) · ff4612a3

由 Thunderbrook 提交于 3月 01, 2021

* solve build gpu task core (#30626)

* build gpu task core

* format

* dump to cpu (#30750)

* dump to cpu

* format

* format

* format

* support multi node in heterps (#31102)

* push multi node

* multi node

* MultiThread

* remove log

* solve bug in 30829

* optimizer

ff4612a3

C
[Cherry-pick] Fix dtype unmatched in custom op API #31306 · a891032f
由 Chen Weihang 提交于 3月 01, 2021
```
[Cherry-pick] Fix dtype unmatched in custom op API

cherry-pick of #31305
```
a891032f
石

[Cherry-pick] inference modification for custom operator (#31283) (#31300) · 628f0856
由石晓伟提交于 3月 01, 2021

628f0856
Y

fix heter compile (#30518) (#31186) · 227a6775
由 yaoxuefeng 提交于 3月 01, 2021

227a6775
W

cherry-pick (#31279) · 6330fc94
由 Wilber 提交于 3月 01, 2021

6330fc94

[Cherry-pick] The 4th part of new custom op (#31282) · 777d1a45

由 Chen Weihang 提交于 3月 01, 2021

* modify custom op dependent from paddle_framework to paddle_custom_op (#31195)

* [Custom Op] Remove unsupport dtypes (#31232)

* remove remove_unsupport_dtype

* remove remove_unsupport_dtype

* remove test dtype

* add more include

* change dtype.h's enum as enum class to avoid conflict with inference lib

* make enum as enum class

* remove additional test

* merge develop

* polish code

* [Custom OP] Support stream set on Custom Op (#31257)

* [Custom OP] change the user header file format, test=develop (#31274)

* [Custom OP]add PD_THROW and PD_CHECK for User Error message (#31253)

* [Custom OP]add PD_THROW and PD_CHECK for User error message

* PD_THROW and PD_CHECK, fix comment

* fix Windows error message

* fix Windows error message

* fix CI

* [Custom OP]add MSVC compile check on Windows (#31265)

* fix test_check_abi
Co-authored-by: NZhou Wei <52485244+zhouwei25@users.noreply.github.com>
Co-authored-by: NJiabin Yang <marsyang199376@gmail.com>
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
Co-authored-by: Nzhouwei25 <zhouwei25@baidu.com>

777d1a45

27 2月, 2021 1 次提交

[Cherry-Pick] Split Macros and Add modeling unittest (#31266) · 52f7e773

由 Aurelius84 提交于 2月 27, 2021

* [CustomOp] Add Modeling with Custom op unittest (#31218)

* add unittest for static/dygraph/dy2stat

* add PE unittet

* remove usless code

* add unittest in CMakeList.txt

* [CustomOp] Split build op marco & polish details (#31229)

* split build op marco & polish details

* revert register api del

* fix other unittest

* [CustomOP]Support Incremental compilation and Add Version management (#31228)

* Support Incremental compilation and Add Version management

* replace hash with hashlib

* fix test_op_num unittest

* Revert "fix test_op_num unittest"

This reverts commit 2f78de976e1d7ca60915b2310717b38a32ae204a.
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

52f7e773

26 2月, 2021 5 次提交
- G
  [cherry-pick]fix error message & label check in softmax_with_cross_entropy (#31123) · 536d9a3b
  由 Guanghua Yu 提交于 2月 26, 2021
```
* fix error message & label check in softmax_with_cross_entropy

* fix error message & label check in softmax_with_cross_entropy

* fix print comment

* fix ignore_index check in softmax_with_cross_entropy
```
  536d9a3b
- P
  Cherry-pick-29260, change import math.h to cmath (#29260) (#31212) · 84a5ed9f
  由 pangyoki 提交于 2月 26, 2021
```
ATT，cherry pick PR #29260
```
  84a5ed9f
- C
  [Cherry-pick] The Second part of new custom op extension in 2.0.1 (#31237) · d3e60959
  由 Chen Weihang 提交于 2月 26, 2021
```
[Cherry-pick] The Second part of new custom op extension in 2.0.1
```
  d3e60959
- W
  
  Fleet distributed strategy support pure fp16 (#30754) (#31238) · 03babe17
  由 WangXi 提交于 2月 26, 2021
  
  03babe17
- T
  loglevel adjustment for distributed training (#31236) · 188bcbb7
  由 tangwei12 提交于 2月 26, 2021
```
Change-Id: I6210ce9c60bed48f3323c47b16500302b66cedf2
```
  188bcbb7
25 2月, 2021 4 次提交

W
[cherry pick]Fix windows error (#31207) · 256103a5
由 wangchaochaohu 提交于 2月 25, 2021
```
cherry-pick #31068
```
256103a5
L
Add cublas_handle() to expose cublas_handle to ops (#31157) (#31190) · c7b32fe1
由 liu zhengxi 提交于 2月 25, 2021
```
* add get_cublas_handle() api

* update format

* add unittests

* alter function name
```
c7b32fe1
Q
[Cherry-pick] Double grad for clip op #31109 · 5bd7c82b
由 qingqing01 提交于 2月 25, 2021
```
Cherry-pick double grad for clip
```
5bd7c82b

fix entry (#31079) (#31182) · 8177ece5

由 tangwei12 提交于 2月 25, 2021

* fix entry

* fix distributed lookup table fuse case

* fix entry bug at first time

* move entry from paddle.fluid -> paddle.distributed

* fix ut with paddle.enable_static()
Co-authored-by: Nmalin10 <malin10@baidu.com>
Co-authored-by: Nmalin10 <malin10@baidu.com>

8177ece5

24 2月, 2021 2 次提交
- P
  
  [Paddle-TRT] support group_norm (#31040) (#31188) · fe00d32a
  由 Pei Yang 提交于 2月 24, 2021
  
  fe00d32a
- A
  added support for fake_quantize_dequantize_abs_max op in quantization… (#30896) (#31162) · 011a6a51
  由 alncat 提交于 2月 24, 2021
```
* added support for fake_quantize_dequantize_abs_max op in quantization inference pass

* remove const_cast to pass ci

* remove compare operator to pass ci-coverage

* added detailed error message for unregistered tensorrt_subgrah_pass
```
  011a6a51
23 2月, 2021 8 次提交
- C
  [CustomOp] New custom operator extension mechanism in 2.0.1 (#31097) · a19154ca
  由 Chen Weihang 提交于 2月 23, 2021
```
[CustomOp] New custom operator extension mechanism in 2.0.1

Cherry-pick New custom operator basic implementation related PRs
```
  a19154ca
- P
  
  add trt transpose and flatten converter (#31022) (#31139) · 20e68a22
  由 Pei Yang 提交于 2月 23, 2021
  
  20e68a22
- Z
  [cherry-pick] Fix softmax cross entropy integer overflow. (#30590) (#31134) · 30a2e7f0
  由 Zhong Hui 提交于 2月 23, 2021
```
[BUG FIX] Fix softmax cross entropy overflow problem.
```
  30a2e7f0
- W
  [cherry-pick 2.0.1] [kunlun] fix xpu bind threaded executor (#31116) · 29467060
  由 WangXi 提交于 2月 23, 2021
```
* [Kunlun] Add condition_variable and notify() in BindThreadedSSAGraphExecutor (#30586)

* [Kunlun] fix dead lock for exec_op_count_ (#30718)

* Fix the problem that the number of ops executed by xpu is wrong (#30961)
Co-authored-by: Nliuyuhui <liuyuhui@baidu.com>
```
  29467060
- Q
  [Cherry-pick] fix ELU output for nan, test=develop (#31135) · b582be2d
  由 Qi Li 提交于 2月 23, 2021
```
ATT, cherry pick of #31132
```
  b582be2d
- W
  A fix for oneDNN matmul kernel. Fixes issue #30309 for oneDNN 1.6 (#31066) · f5007051
  由 Wojciech Uss 提交于 2月 22, 2021
```
* A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723)

* A fix for #30309 with oneDNN 1.6
```
  f5007051
- T
  test=develop, save/load, shrink (#30625) (#31107) · 36710ebc
  由 tangwei12 提交于 2月 23, 2021
```
* test=develop, save/load, shrink
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
Co-authored-by: N123malin <malin10@baidu.com>
```
  36710ebc
- S
  
  update merge pr #31060（update trt int8 calibrator to IEntropyCalibratorV2） (#31121) · 1d2bd35e
  由 Shang Zhizhou 提交于 2月 23, 2021
  
  1d2bd35e

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致