提交 · 14c95700c4875c774a6e2fa936dde0ce1193b225 · PaddlePaddle / Paddle

08 11月, 2022 14 次提交
- C
  
  Support cuda 11 with jetson (#47741) · 14c95700
  由 chalsliu 提交于 11月 08, 2022
  
  14c95700
- Z
  [Paddle Inference] allow fold fill_constant && allow nms3 into trt in int8 model (#47551) · c3a69111
  由 zhoutianzi666 提交于 11月 08, 2022
```
* allow fold fill_constant && allow nms3 into trt in int8 model
* use unordered_map
* fix CI failing
```
  c3a69111
- H
  update AUTHOR. test=kunlun (#47682) · 51507430
  由 houj04 提交于 11月 08, 2022
```
* update AUTHOR. test=kunlun

* update AUTHOR.
```
  51507430
- N
  [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) · 888272b5
  由 Nyakku Shigure 提交于 11月 08, 2022
```
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition

* fix an increment
```
  888272b5
- Z
  
  fix examplce code of slice api (#47735) · e5bb8785
  由 zyfncg 提交于 11月 08, 2022
  
  e5bb8785
- P
  Split quant (#47449) · 130db92a
  由 Paulina Gacek 提交于 11月 08, 2022
```
* Split kernel registered, tests for uint/int added

* Split quantized

* Split output scales calculated only once

* NearestInterp test fix reversed

* DequantizeOutputs corrected
```
  130db92a
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
- T
  remove dist xpu tests for R200 (#47381) · ef21b58b
  由 tianshuo78520a 提交于 11月 08, 2022
```
* disable distributed xpu tests

* test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun

* test=document_fix;test=kunlun
```
  ef21b58b
- C
  support pow double grad op (#47691) · 6fe9dfb2
  由 Charles-hit 提交于 11月 08, 2022
```
* support pow_double_grad op

* add unit test for pow double grad

* fix pow double grad

* optimize pow double grad kernel

* fix pow double grad kernel
```
  6fe9dfb2
- Z
  [Paddle-TRT]Fix cast converter bug , use setOutputType() instaead (#46289) · 18adbbd0
  由 zhoutianzi666 提交于 11月 08, 2022
```
* fix cast bug
```
  18adbbd0
- W
  
  remove <fluid/eager/api/utils/global_utils.h> from phi (#47739) · 42d9fe2f
  由 Wang Xin 提交于 11月 08, 2022
  
  42d9fe2f
- C
  
  normalize autotune tests dir (#47726) · 6bab3343
  由 Chen Weihang 提交于 11月 08, 2022
  
  6bab3343
- T
  
  fix cinn_instruction_run_op_test when FLAGS_use_system_allocator=True (#47731) · a4a9ce0e
  由 TeFeng Chen 提交于 11月 08, 2022
  
  a4a9ce0e
- T
  Fix undefined symbol: shm_open (#47421) · 50c3632f
  由 Tomasz Socha 提交于 11月 08, 2022
```
* Fix undefined symbol: shm_open

* Fix for Windows

* Exclude APLLE
```
  50c3632f
07 11月, 2022 24 次提交
- Y
  Define ConvRunner to wrapper the call of cudnn conv functions. (#47576) · c331e2ce
  由 Yiqun Liu 提交于 11月 07, 2022
```
* Define ConvRunner to wrapper the call of cudnn conv functions.

* Use ConvKind in SearchAlgorithm.
```
  c331e2ce
- H
  suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
  由 Hui Zhang 提交于 11月 07, 2022
```
* suqeeze2 transpose2 fuse onednn

* format

* fix output shape

* fix conflict

* format

* format

* remove useless

* remove log

* simply pass

* fix comment

* fix

* fix msg

* fix error msg

* format
```
  fa874a46
- W
  
  remove hardcoded -Wunused-variable compiler flags (#47706) · 45bc4542
  由 Wang Xin 提交于 11月 07, 2022
  
  45bc4542
- L
  
  fix nlu compilation (#47707) · 75f34bb7
  由 Leo Chen 提交于 11月 07, 2022
  
  75f34bb7
- Q
  support kldiv_loss/kldiv_loss_grad for kunlun (#47638) · 5f0a8adc
  由 QingshuChen 提交于 11月 07, 2022
```
*test=kunlun
```
  5f0a8adc
- T
  Test FLAGS_enable_cudnn_frontend In CUDA117 CI (#47635) · 87753ee8
  由 tianshuo78520a 提交于 11月 07, 2022
```
* test=cuda117

* test=cuda11

* test=document_fix;test=cuda117

* test=document_fix
```
  87753ee8
- C
  
  update error msg ci check rule, test=document_fix (#47708) · dcc4b46f
  由 Chen Weihang 提交于 11月 07, 2022
  
  dcc4b46f
- Z
  [AutoParallel]fp16 pass support assign op (#47649) · 6c51e493
  由 zhaoyingli 提交于 11月 07, 2022
```
* fp16 pass support assign op

* choose assign op exec mode

* add unittest

* add cmakelist
```
  6c51e493
- P
  
  disable WITH_CUDNN_DSO (#47674) · c65f0565
  由 pangyoki 提交于 11月 07, 2022
  
  c65f0565
- Y
  add roll and roll_grad kernels and strided_slice and strided_slice_grad... · 5a4d2186
  由 ykkk2333 提交于 11月 07, 2022
```
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368)

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
```
  5a4d2186
- W
  [Eager] eager tensor support pickler (#47025) · 8a7e54d5
  由 wanghuancoder 提交于 11月 07, 2022
```
* test_paddle_multiprocessing support eager tensor pickler
```
  8a7e54d5
- W
  refine python lib link (#47681) · eb102189
  由 wanghuancoder 提交于 11月 07, 2022
```
* refine python lib link
```
  eb102189
- Y
  
  [Paddle inference] fix mixed precision (#47654) · 624ffdf2
  由 Yuanle Liu 提交于 11月 07, 2022
  
  624ffdf2
- R
  
  call InitDevices only once (#47678) · 0cbdcdda
  由 ronnywang 提交于 11月 07, 2022
  
  0cbdcdda
- N
  [CodeStyle] refine pre-commit-config.yaml (#47693) · b4a3cca1
  由 Nyakku Shigure 提交于 11月 07, 2022
```
* sort hooks

* add `name` for remove-tabs
```
  b4a3cca1
- W
  Get three grad lists in CPP to avoid gpu idle time (#47665) · 01bfe786
  由 WangZhen 提交于 11月 07, 2022
```
* Get three grad lists in CPP to avoid gpu idle time

* Support legacy mode
```
  01bfe786
- J
  [Fluid Clean] remove paddle.fluid.dygraph.nn.conv2D (#47441) · 0b3b4918
  由 JYChen 提交于 11月 07, 2022
```
* remove paddle.fluid.dygraph.nn.conv2D

* fix ut

* fix conv fp16 UT
```
  0b3b4918
- H
  [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
  由 HongyuJia 提交于 11月 07, 2022
```
* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

* Call SetDnnFallback function in the base class

* activation fallback to plain kernel

* fix default GetExpectedKernelType find wrong kernel

* search cudnn kernel instead of fallback

* fix cudnn_handle bug

* remove tanh use_cudnn

* restore tanh use_cudnn

* debug tanh

* fix tanh bug

* delete activation cudnn kernel

* polish code
```
  908a381d
- N
  [CodeStyle][E262][E265] make comments start with `# ` (#47687) · c9a7cadf
  由 Nyakku Shigure 提交于 11月 07, 2022
```
* [CodeStyle][E262][E265] make comments start with `# `

* flake8 config
```
  c9a7cadf
- Q
  
  [cusotm device] add python inference api, test=develop (#46460) · 6074c50a
  由 Qi Li 提交于 11月 07, 2022
  
  6074c50a
- W
  
  Refactor collective communication all_gather, all_reduce, broadcast & barrier C++ API (#47481) · e1a1c354
  由 Wen Sun 提交于 11月 07, 2022
  
  e1a1c354
- S
  [PHI] Migrate batch_norm (#47652) · 2337e609
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* init changes

* bnorm

* method signature

* change order

* bnorm

* removed unused args
```
  2337e609
- Z
  [AutoParallel] update naive data parallel completion (#47578) · 9db507f1
  由 zhaoyingli 提交于 11月 07, 2022
```
* expand op donot use naive data parallel

* fix unittest
```
  9db507f1
- S
  [PHI] Migrate depthwise_conv2d_grad and conv3d_grad kernels (#47686) · b0c38568
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* remove fwd funcs

* migrate conv grads
```
  b0c38568
05 11月, 2022 2 次提交
- Y
  
  update the split logic for uniform (#47670) · 383f1c4f
  由 Yuang Liu 提交于 11月 05, 2022
  
  383f1c4f
- Y
  
  Use an unified FLAGS_check_nan_inf_level to control the result of checking infinite. (#47672) · 54bc3b46
  由 Yiqun Liu 提交于 11月 05, 2022
  
  54bc3b46

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功