提交 · 2096448bb1268ffef91be4e013842cc685605683 · PaddlePaddle / Paddle

27 10月, 2022 1 次提交

make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b

由 Leo Chen 提交于 10月 27, 2022

* make all cpp tests dynamic linked to libpaddle.so

* add comments

* keep old cc_test for some tests

* fix some ut

* make some ut use cc_test_old

* fix typos and fit for win32

* fix lib path

* fix some tests

* skip lite test

* fit for rocm

* fit for cinn

* fit for mac

* fit for win32

* skip inference ut

* skip  windows

* fix coverage

2096448b

24 10月, 2022 1 次提交
- W
  [CodeStyle] fix macos inconsistent-missing-override warnings and add -Werror (#47264) · c5fe109b
  由 Wang Xin 提交于 10月 24, 2022
```
* fix macos inconsistent-missing-override warnings

* fix inconsistent-missing-override error in test
```
  c5fe109b
11 10月, 2022 1 次提交

Fix build on gcc-11 (#46808) · 28ef0fff

由 Leding Li 提交于 10月 11, 2022

* Add missing <cstddef>
  * Fix some iterator type
  * Currently use host protobuf to bypass build failure

28ef0fff

08 9月, 2022 1 次提交
- C
  
  fix warning (#45870) · 23998b75
  由 chenjian 提交于 9月 08, 2022
  
  23998b75
07 9月, 2022 1 次提交
- R
  
  Fix bug for AutoGrowthBestFitAllocator build (#45806) · fcbb307c
  由 Ruibiao Chen 提交于 9月 07, 2022
  
  fcbb307c
01 9月, 2022 1 次提交
- L
  remove circular dependency of device_context and allocator (#45455) · 934171ae
  由 Leo Chen 提交于 9月 01, 2022
```
* refine cmake of framework

* add deps for dense tensor

* fix deps

* remove alloc(ctx)

* add depends on mkldnn
```
  934171ae
17 8月, 2022 1 次提交
- W
  fix multi stream error. (#45196) · a79d4a75
  由 Wilber 提交于 8月 17, 2022
```
* fix multi stream error.
```
  a79d4a75
01 8月, 2022 2 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

GPUGraph merge to develop (#44594) · 798670bb

由 danleifeng 提交于 8月 01, 2022

798670bb

29 7月, 2022 2 次提交

L
unify fluid::CUDADeviceContext and phi::GpuContext (#44723) · 88490567
由 Leo Chen 提交于 7月 29, 2022
```
* remove cudaDeviceContext

* remove more template

* fix rocm compile
```
88490567

move CUDAStream to phi (#44529) · da3743fd

由 Leo Chen 提交于 7月 29, 2022

* init

* move CUDAStream to phi

* fix compilation

* merge develop

* add stream_owned_ member

* split cuda_stream.h

* fix cpu compile

* fix constructor

* fix bug

* fix windows compile

* fix inference test_levit

* fix windows tests

da3743fd

19 7月, 2022 1 次提交

compile phi/backends into one static library (#44373) · 1047cb17

由 Leo Chen 提交于 7月 19, 2022

* compile into one static library

* fix xpu compile

* fix xpu compile

* fix inference compile

* fix inference compile

* add custom test

* revert one file

1047cb17

14 7月, 2022 2 次提交

refine allocation cmake (#44241) · dc5a0420

由 Leo Chen 提交于 7月 14, 2022

* build into one static library

* move memory/detail to memory/allocation

* fix bug

* fix profiler

* fix framework_proto

* fix deps

* fix inference compilation

* fix rocm compile

* follow comments

* fix buddy_allocator_test

dc5a0420

R
[CustomDevice] add custom ccl 1/2 (#44294) · d88e77a7
由 ronnywang 提交于 7月 14, 2022
```
* [CustomDevice] add custom ccl api

* add ut
```
d88e77a7

11 7月, 2022 1 次提交
- 王
  
  [NPU] add npu support for new executor. test=develop (#43403) · 5988553f
  由王明冬提交于 7月 11, 2022
  
  5988553f
06 7月, 2022 1 次提交
- H
  
  minor fix VLOG for xpu. test=kunlun. (#44099) · 502062da
  由 houj04 提交于 7月 06, 2022
  
  502062da
04 7月, 2022 1 次提交
- R
  
  Remove boost::static_visitor (#44024) · 01fedf4f
  由 Ruibiao Chen 提交于 7月 04, 2022
  
  01fedf4f
26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
24 6月, 2022 2 次提交

王

add xpu support for new static alone executor. test=develop (#43076) · b2704837
由王明冬提交于 6月 24, 2022

b2704837

record memory and op supplement info (#43550) · 8dd0a3b9

由 chenjian 提交于 6月 24, 2022

* record memory and op supplement info

* update

* update

* fix a bug

* fix memory recording

* fix a bug

* update

* update

* fix a bug

* update

* fix a bug

* fix a bug

* fix a bug

* Revert "fix a bug"

This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5.

* fix a bug

* fix format

* fix

8dd0a3b9

14 6月, 2022 1 次提交
- W
  fix cmake-lint problems. (#43406) · 59f89236
  由 Wilber 提交于 6月 14, 2022
```
* cmake-lint

* update
```
  59f89236
10 6月, 2022 1 次提交
- R
  Refactor DeviceContextPool (#42901) · 114723c9
  由 Ruibiao Chen 提交于 6月 10, 2022
```
* Refactor DeviceContextPool

* Adjust header file order
```
  114723c9
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
04 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：cmake-format (#43057) · 92568edb
  由 Sing_chan 提交于 6月 04, 2022
  
  92568edb
02 6月, 2022 1 次提交

Support CUDA Graph for partial graph in dygraph mode (#42786) · d05b940a

由 sneaxiy 提交于 6月 02, 2022

* support CUDAGraph for partial graph

* add ut

* fix ci

* fix ut again because of eager mode

* fix kunlun ci

* fix win ci

d05b940a

01 6月, 2022 1 次提交
- R
  Add pinned memory to host memory stats (#43096) · c4b7c485
  由 Ruibiao Chen 提交于 6月 01, 2022
```
* Add pinned memory to HostMemoryStats

* Add macro for WrapStatAllocator

* Fix CI errors
```
  c4b7c485
27 5月, 2022 1 次提交
- R
  Support memory stats for CPU (#42945) · 21f11d35
  由 Ruibiao Chen 提交于 5月 27, 2022
```
* Support memory stats for CPU

* Add UTs

* Fix typos

* Fix typos
```
  21f11d35
19 5月, 2022 1 次提交
- C
  [CompileOpt] Refine enforce code and remove boost/variant include (#41093) · ca359fec
  由 Chen Weihang 提交于 5月 19, 2022
```
* refine enforce code

* refine enforce code

* fix compile failed

* fix infrt failed
```
  ca359fec
27 4月, 2022 1 次提交

[CustomDevice] op_test supports custom device (#42227) · 4df02fdf

由 Aganlengzi 提交于 4月 27, 2022

* [DO NOT MERGE] test op_test

* update with more related modifications

* split op_test.py to use test=allcases for testing

* split op_test.py to use test=allcases for testing

4df02fdf

25 4月, 2022 1 次提交
- R
  
  Do not reset default stream for StreamSafeCUDAAllocator (#42149) · 6553a9d7
  由 Ruibiao Chen 提交于 4月 25, 2022
  
  6553a9d7
07 4月, 2022 1 次提交
- L
  Profile Executors (#41100) · dfb47986
  由 liutiexing 提交于 4月 07, 2022
```
* Profile Executors

* update

* fix ut

* fix names

* update

* update
```
  dfb47986
05 4月, 2022 1 次提交

[new-exec] enable the new standalone executor by default (#41179) · 93ea1297

由 Leo Chen 提交于 4月 05, 2022

* enable new executor by default

* enable stream safe allocator

* test=document_fix;test=coverage

* do not use scope in op kernel

* fit empty program for new executor

* fix communication depend

* fix test_sync_batch_norm

* skip unsupported place

* refine datatransfer

* fit for dirtributed program

* fix dependencpy

* fix some ut

93ea1297

01 4月, 2022 1 次提交
- F
  Fix compilation errors for gcc-54 (#41228) · 8aef685b
  由 From00 提交于 4月 01, 2022
```
* Fix compilation error for gcc-54

* Remove const for gpuStream_t
```
  8aef685b
30 3月, 2022 1 次提交

Add new APIs for GPU memory monitoring (max_memory_allocated,... · afe02e9d

由 From00 提交于 3月 30, 2022

Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657)

* Add new API memory_reserved

* Add memory_allocated, max_memory_reserved and max_memory_allocater

* Fix CI error

* Fix CI error

* Enhance UT

* Add FLAGS_memory_stats_opt

* Add STATS macro functions

* Add StatAllocator

* Fix CI errors

* Add UT

* Fix CI errors

afe02e9d

27 3月, 2022 1 次提交

Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy (#40886) · 0ad2e192

由 From00 提交于 3月 27, 2022

* Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy

* Set FLAGS_use_stream_safe_cuda_allocator to false

* Update

* Remove unnecessary code

* Fix CI errors

* Add UT

0ad2e192

25 3月, 2022 1 次提交

support multi_dims for tril_triu, *test=kunlun (#40712) · 9ffedcfd

由 z8hanghuan 提交于 3月 25, 2022

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun

9ffedcfd

23 3月, 2022 1 次提交

Performance optimization for StreamSafeCudaAllocator (#40718) · d8bff988

由 From00 提交于 3月 23, 2022

* Performance optimize

* Optimize GetAllocator, RWLock and ProcessUnfreedAllocation

* Remove test file

* Fix CI error

* Fix CI errors

* Fix CI errors

d8bff988

18 3月, 2022 1 次提交
- A
  
  [NPU] fix no allocator error (#40687) · 8c713223
  由 Aganlengzi 提交于 3月 18, 2022
  
  8c713223
14 3月, 2022 1 次提交

[multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors ... · e553f758

由 Zhong Hui 提交于 3月 14, 2022

[multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors  between python processes. (#37302)

* Add support for paddle.multiprocessing
* move multiprocessing to incubate.

e553f758

03 3月, 2022 1 次提交

Support cuda graph in StreamSafeCudaAllocator (#39594) · 4c0511fa

由 From00 提交于 3月 03, 2022

* Support cuda graph in StreamSafeCudaAllocator

* Fix CI error

* Arrange AllocatorFacade

* Fix CI error

* Fix CI error

* Fix ROCM Compile error

* Fix ROCM Compile error

4c0511fa

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功